It goes on each article on Legifrance, creates a JSON file with all the articles... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		steeve on March 31, 2015 \| parent \| context \| favorite \| on: French civil code now on GitHub It goes on each article on Legifrance, creates a JSON file with all the articles, which book/section they belong to, and for each article, tracks it's versions (they are dates). The crawler is in Go. Then there is a python script that takes that JSON file, creates the .md files and runs the git commands in the shell. Ultimately the sad thing is that I had to scrape this information. There were lots of pitfalls due to bad formatting and so on... Well, scraping.

rouxrc on March 31, 2015 [–]

It might not be necessary to scrap anything since this data was finally released as OpenData last year: https://www.data.gouv.fr/fr/datasets/legi-codes-lois-et-regl...

steeve on March 31, 2015 | [–]

i tried using this, but it seems it seems to be just a snapshot, not the full dataset with history (i wish i'm wrong, though)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact