Is there a way to download a list/table from a website?

0

Is there a way to extract a filtered table/list (built in HTML) from a website and store it in a file such as a: .txt or .xlsx?

Additional information: The website doesn't have RSS.

If anyone knew it would be of great help since I need to transform that data to this other format to do calculations with it.

An example would be the following website: http://www.shanghairanking.com/Shanghairanking-Subject-Rankings/chemistry.html

Jr.Dev

Posted 2019-07-27T05:47:44.930

Reputation: 1

1

You need something to parse HTML. As a matter of fact, google provides tools for reading tabular data and importing it into a google spreadsheet. But, unless this table is rather huge, it's going to be more work than it's worth. Also, not every website is onboard with such.

– Cliff Armstrong – 2019-07-27T06:03:22.023

Can you provide any website as an example? – Biswapriyo – 2019-07-27T08:04:11.527

What is your browser? – harrymc – 2019-07-27T10:58:02.233

This would be a similar example: http://www.shanghairanking.com/Shanghairanking-Subject-Rankings/atmospheric-science.html

– Jr.Dev – 2019-07-27T15:24:09.973

I can download any browser if that's going to help me. – Jr.Dev – 2019-07-27T15:24:37.373

I will take a look at what you mention @Cliff Armstrong – Jr.Dev – 2019-07-27T15:25:26.453

@Cliff Armstrong's reply Works for that case. Specially the "=IMPORTHTML("url";"table";1)" function from Google sheets. – Jr.Dev – 2019-07-29T17:19:57.313

It isn't clear what the issue is. I clicked on the headings of the table at that site and it selected the entire table. I did a simple copy and paste and got everything, including headings and words for the column displayed as flags. No special tools, procedures, or apps were required. – fixer1234 – 2019-07-30T00:29:28.853

@fixer1234 I suspect he wants to continuously do this in an automated fashion for multiple different sites. – Cliff Armstrong – 2019-07-30T16:14:15.593

Exactly, it's for multiple websites. – Jr.Dev – 2019-07-30T19:48:41.980

No answers