Open XLSX File

Information, tips and instructions

Convert HTML table to XLSX

Sometimes it is necessary to convert an HTML table embedded on a website into XLSX. This is useful when you need to perform further manipulations with data, format it in a diifferent way, or perform some additional calculations.

HTML is a markup language that is used for the development of Internet pages. This is the acronym that corresponds to HyperText Markup Language, which could be translated as Document Format Language for Hypertext. Emerged from SGML (Standard Generalized Markup Language) tags, a concept generally translated as "Generalized Markup Language Standard" and understood as a system that allows ordering and tagging various documents within a list. This language is the one used to specify the names of the labels that will be used when ordering, there are no rules for said organization, that is why it is said to be an open format system. HTML is responsible for developing a description of the contents that appear as texts and their structure, complementing said text with various objects (such as photographs, animations, etc.).

XLSX files are Microsoft Excel text files that are evolutionary from XLS. With the release of the 2007 version of Excel, the format was introduced as a new standard for spreadsheet documents. The goal was to establish an XML-based file format that required less storage capacity. The name XLSX is a combination of the popular XLS extension and the X for XML. This format contains a set of packages in XML, being more practical to archive, since it can be decompressed with software external to Office Suite and allows easy sending via email. The XLSX file belongs to the Office Open XML standard (also called OOXML or OpenXML), being an open file format. This type of file improves file and data management and expands the possibilities with binary files compared to previous versions of Microsoft Excel. Storing information in XML format represents an increase in security, which is essentially plain text.

The natural thing is that the conversion is from XLSX to HTML, however, it can also be done in reverse, the steps are as follows:

  • Step 1: Open the Excel spreadsheet into which you want to import the web page table. Go to the "Data" tab.
  • Step 2: Click "From Web" in the "Get External Data" group. The "New Web Query" window will open.
  • Step 3: Type or paste the address of the web page in the "Address" bar. The address can be up to 255 characters long. Click "Go" to navigate to the web page.
  • Step 4: Click the arrow icon next to the table you want to convert to your spreadsheet or click the arrow icon in the upper left of the window to import the entire web page. If you don't see the arrows on the web page, click the "Show Icons" button on the "New Web Query" toolbar.
  • Step 5: Click the "Import" button and Excel will convert the table from the web page to your spreadsheet.