From 4a1cbbe4525d44a74fd4fc7adc45eaf6e28662e0 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Tristan=20Dani=C3=ABl=20Maat?= Date: Sat, 9 Apr 2022 19:30:55 +0100 Subject: [PATCH] Document the result of the dump --- qinghai/README.md | 53 +++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 53 insertions(+) create mode 100644 qinghai/README.md diff --git a/qinghai/README.md b/qinghai/README.md new file mode 100644 index 0000000..18cea8b --- /dev/null +++ b/qinghai/README.md @@ -0,0 +1,53 @@ +## Qinghai scraping + +A few links don't exist anymore. They have the indexes 210, 453, 681, +703, 791, 871, 913, 914, 915 in `links.csv`. + +There are a few small files again, mostly pdf links: + +```console +.rw-r--r-- 101 tlater 9 Apr 19:26 ./2016-09-28_923.txt +.rw-r--r-- 133 tlater 9 Apr 19:26 ./2016-09-28_924.txt +.rw-r--r-- 116 tlater 9 Apr 19:26 ./2016-09-28_925.txt +.rw-r--r-- 147 tlater 9 Apr 19:26 ./2016-09-28_926.txt +.rw-r--r-- 111 tlater 9 Apr 19:23 ./2017-03-16_838.txt +.rw-r--r-- 36 tlater 9 Apr 19:20 ./2017-07-07_745.txt +.rw-r--r-- 82 tlater 9 Apr 19:17 ./2017-08-14_723.txt +.rw-r--r-- 211 tlater 9 Apr 19:17 ./2017-09-12_704.txt +.rw-r--r-- 97 tlater 9 Apr 19:14 ./2017-11-15_587.txt +.rw-r--r-- 156 tlater 9 Apr 19:13 ./2017-11-20_580.txt +.rw-r--r-- 283 tlater 9 Apr 19:13 ./2017-11-23_575.txt +.rw-r--r-- 39 tlater 9 Apr 19:13 ./2017-12-29_566.txt +.rw-r--r-- 39 tlater 9 Apr 19:13 ./2018-01-12_561.txt +.rw-r--r-- 165 tlater 9 Apr 19:12 ./2018-05-30_505.txt +.rw-r--r-- 145 tlater 9 Apr 19:12 ./2018-05-30_507.txt +.rw-r--r-- 391 tlater 9 Apr 19:11 ./2018-07-25_475.txt +.rw-r--r-- 158 tlater 9 Apr 19:11 ./2018-09-13_467.txt +.rw-r--r-- 204 tlater 9 Apr 19:04 ./2020-03-09_254.txt +.rw-r--r-- 124 tlater 9 Apr 19:04 ./2020-03-18_248.txt +.rw-r--r-- 228 tlater 9 Apr 19:04 ./2020-03-20_245.txt +.rw-r--r-- 186 tlater 9 Apr 19:03 ./2020-04-01_221.txt +.rw-r--r-- 67 tlater 9 Apr 19:02 ./2020-04-21_208.txt +.rw-r--r-- 174 tlater 9 Apr 19:01 ./2020-04-30_194.txt +.rw-r--r-- 147 tlater 9 Apr 19:01 ./2020-05-08_186.txt +.rw-r--r-- 189 tlater 9 Apr 19:01 ./2020-05-12_182.txt +.rw-r--r-- 82 tlater 9 Apr 19:01 ./2020-05-15_180.txt +.rw-r--r-- 119 tlater 9 Apr 19:00 ./2020-06-04_139.txt +.rw-r--r-- 201 tlater 9 Apr 19:00 ./2020-07-01_114.txt +.rw-r--r-- 113 tlater 9 Apr 18:59 ./2020-07-20_90.txt +.rw-r--r-- 115 tlater 9 Apr 18:59 ./2020-07-21_86.txt +.rw-r--r-- 99 tlater 9 Apr 18:58 ./2020-08-27_36.txt +.rw-r--r-- 99 tlater 9 Apr 18:58 ./2020-08-27_37.txt +.rw-r--r-- 130 tlater 9 Apr 18:58 ./2020-08-27_38.txt +.rw-r--r-- 130 tlater 9 Apr 18:58 ./2020-08-27_39.txt +.rw-r--r-- 190 tlater 9 Apr 18:58 ./2020-08-27_40.txt +.rw-r--r-- 190 tlater 9 Apr 18:58 ./2020-08-27_41.txt +.rw-r--r-- 184 tlater 9 Apr 18:58 ./2020-08-27_42.txt +.rw-r--r-- 184 tlater 9 Apr 18:58 ./2020-08-27_43.txt +.rw-r--r-- 127 tlater 9 Apr 18:58 ./2020-08-27_44.txt +.rw-r--r-- 127 tlater 9 Apr 18:58 ./2020-08-27_45.txt +.rw-r--r-- 94 tlater 9 Apr 18:58 ./2020-08-27_46.txt +.rw-r--r-- 94 tlater 9 Apr 18:58 ./2020-08-27_47.txt +.rw-r--r-- 88 tlater 9 Apr 18:58 ./2020-09-12_20.txt +.rw-r--r-- 200 tlater 9 Apr 18:57 ./2020-09-21_11.txt +```