Difference between revisions of "Summerschool Aachen 2004/Hidden Things Lab"
Line 4: | Line 4: | ||
-- Ilja van Sprundel | -- Ilja van Sprundel | ||
+ | |||
+ | === Looking for authors of documents in Cambridge === | ||
+ | |||
+ | I have scanned the *.cam.ac.uk domain using google and wget and retrieved around 1000 M$ Word documents. I then put together a python script that makes a list of authors, documents, urls triplets. This indicates how profilic particular authors are (they have authored documents placed in many different URLs), and exposed quite a few little secrets. | ||
+ | |||
+ | -- [[user: George]] |
Revision as of 17:39, 29 September 2004
The coffee table talk
The slides of the coffee table talk are online, you can find them here.
-- Ilja van Sprundel
Looking for authors of documents in Cambridge
I have scanned the *.cam.ac.uk domain using google and wget and retrieved around 1000 M$ Word documents. I then put together a python script that makes a list of authors, documents, urls triplets. This indicates how profilic particular authors are (they have authored documents placed in many different URLs), and exposed quite a few little secrets.
-- user: George