summaryrefslogtreecommitdiff
path: root/docs/proposals/common-scrapers.txt
blob: 0b1a5e1dd91fce2c47ed5ca55692f417b9176e39 (plain)
1
2
3
4
5
6
7
8
2009-07-30

After chatting with Bruce D'Arcus <bdarcus@gmail.com> on zotero-dev@googlegroups.com, we came up with an idea for how to format scraping metadata.

(1) scraper metadata file format for mapping xpaths to particular attributes in particular classes

(2) messy data cleanup utils: for cleaning up the results of scraping, common parsing utilities.
    - perl/python-style regular expressions