scrape-yuanyuan

History

Tristan Daniël Maat 3602c56966 Add guangdong readme		2022-04-09 23:34:56 +01:00
..
articles-guangdong.zip	Add guangdong readme	2022-04-09 23:34:56 +01:00
extract-urls.js	Add typescript-language-server	2022-04-09 17:43:37 +01:00
links.txt	Move the guangdong links	2022-04-09 23:31:52 +01:00
README.md	Add guangdong readme	2022-04-09 23:34:56 +01:00
scrape.py	Structure the project a bit better	2022-04-09 16:50:15 +01:00

README.md

Guangdong scraping

Zip of full article dump: articles-guangdong.zip

Sorry, format for this one is a bit different. I got a bit more practiced after the first.

Files that are likely just links to PDFs:

.rw-r--r-- 119 tlater  9 Apr 04:07 ./2017-05-27_949.txt
.rw-r--r-- 104 tlater  9 Apr 04:07 ./2017-05-27_950.txt
.rw-r--r--  85 tlater  9 Apr 04:07 ./2017-05-27_951.txt
.rw-r--r-- 157 tlater  9 Apr 04:07 ./2017-05-27_952.txt
.rw-r--r-- 164 tlater  9 Apr 04:07 ./2017-05-27_953.txt
.rw-r--r-- 149 tlater  9 Apr 04:07 ./2017-05-27_954.txt
.rw-r--r--  85 tlater  9 Apr 04:07 ./2017-05-27_955.txt
.rw-r--r-- 387 tlater  9 Apr 04:07 ./2017-08-14_888.txt
.rw-r--r-- 355 tlater  9 Apr 04:07 ./2017-08-15_876.txt