Lecture "Web Dynamics" SS2010
Lecture "Web Dynamics"
PD Dr.-Ing. Ralf Schenkel
Dr. Marc Spaniol
Today's Web is a highly dynamic and evolving source of information. The lecture shows how to deal with this dynamics, including modeling the evolving Web, exploiting temporal variances for more effective retrieval, and collecting and archiving snapshots of the Web.
- We offer an oral re-exam on October 11. All students that were admitted to the exam can take the re-exam to improve their grade. Please register here until October 1.
- This is the schedule for the re-exam (building E1.4, room 433 (rotunda 4th floor)):
|Monday October 11
- This is the preliminary result after the oral exam:
Please note that certificates and grades in HISPOS will not be available before
the re-exam in October. Send us a mail when you need your certificate earlier.
- The last lecture took place on July 15, no lecture on July 22 (due to SIGIR)
- Lecture: Thursday, 14:15 - 15:45, room 0.01, building E1.7 (MMCI building), starting April 15
- Exercises: Monday, 14:15 - 15:45, room 0.01, building E1.7 (MMCI building), first assignment available April 29 - hand in May 6 - first exercise group May 10)
Students planning to attend the course should have some knowledge about information retrieval (successful participation in the course on Information Retrieval and Data Mining is fine), even though we will introduce all necessary concepts and techniques in the lecture.
Requirements for the certificate
- You must successfully participate in an oral exam at the end of the semester (probably July 26/27)
- You must present a convincing solution of at least one assignment in the excercise group (handing in a solution of the assignment before is not necessary)
- You should actively participate in the excercise group. You can hand in your solutions for the excercises before the lecture on the due date, and you will get back them with comments during the excercise group; this is optional, so it is ok to hand in just one written solution created by a group of students. Here, 'written' means 'on paper', it does not matter if written by hand or created with a word processor. Writing your name on the submission makes giving them back to you much easier.
- Assignment 1 (size estimation, graph models, centrality measures, Benford's law), due May 6
- Assignment 2 (PageRank vs. HITS, MapReduce, seed selection), due May 14
- Assignment 3 (change rates, parallel crawling, focused crawling, freshness), due May 27
- Assignment 4 (archving strategies, incoherence, shingling), due June 7
- Assignment 5 (time-travel search), due June 24
- Assignment 6 (web spam), due July 1
- Assignment 7 (recommendation), due July 8 (updated!)
- Julien Masanes (Ed.): Web Archiving, Springer, 2006
- Pierre Baldi, Paolo Frasconi, Padhraic Smyth: Modeling the Internet and the Web, Wiley, 2003
- Mark Levene, Alexandra Poulovassilis (Eds.): Web Dynamics, Springer, 2004
- a detailed list of references for each chapter will be provided in the slides