# Province article scraping

A couple of scripts to scrape article text from various provinces for
a text analysis university course.

We need:

Qinghai
: page 14-75

Ningxia
: page 11-42

Shanxi
: page 2-18

Xinjiang
: page 10-20

The websites all have subtle differences, so there's simply a folder +
scripts for each (the scripts are simple enough that there's no need
for deduplication or anything complex). Written in python/js where
necessary for educational purposes.