Each group should submit a final project report containing no more than 1000 words no later than December 15. A web site should also be available by December 15 for more details of the project including a live demo.
| topic | Papers | Time | |
|---|---|---|---|
| Crawler tutorial |
| Mahmud Sumon, Mohit Gherani | Oct 15 |
| Pagerank algorithm tutorial | Xiutian Cui, Xiaoni Cao | Oct 15 | |
| Near duplicate detection | Yi Zhang, Jiayi Wu | Oct 15 | |
| Search Engine tutorial | Shaochen Zheng, Sun Xu | Oct 15 | |
| Clustering Coefficient | |||
| Search engines |
| ||
| Web as graph |
| Adewale Agboola, Amrut Chachad | Oct 15 |
| Topic | Papers | Time | |
|---|---|---|---|
| A vision of semantic web | Tim Berners-Lee, James Hendler and Ora Lassila, The Semantic Web, A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities, Scientific American, May 2001 | ||
| DBPedia | DBpedia: A Nucleus for a Web of Open Data,
ren Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak and Zachary Ives, The Semantic Web,722--735,2007, Springer
Slides |
Joshua Chittle, John Donais | Oct 17 |
| Social Web | M Gjoka et al., A walk in facebook:uniform sampling of users in online social networks | ||
| Ontology matching | Doan, A., Madhavan, J., Domingos, P., and Halevy, A. 2002. Learning to map between ontologies on the semantic web. In Proceedings of the 11th international Conference on World Wide Web (Honolulu, Hawaii, USA, May 07 - 11, 2002). WWW '02. 662-673. | James, Jordan | Oct 15 |
The following table lists some ontologies and their corresponding graphs.
| Ontology | graph txt file | graph |
|---|---|---|
| 1. SUMO.owl (323KB) | sumoSubclass.txt(~700 edges) | subclass fdp layout |
| 2. All the relatestion extracted using jena. The java program | sumoAll.txt(~2700 edges) | sumoAll fdp layout, circle layout, twopi layout |
| 3. dbpedia.owl(~700KB) | graph txt | |
| 4. SUMO2.owl(~30MB) | graphviz txt | twopi layout(79MB) , a 3D fig produced by Wang Hao |
| 5. SUMO2 subclass (5000 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
| 6. SUMO2 subclass & domain range(7000 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
| 7. SUMO2 domain range (1700 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
| 8. SUMO2 classes | graphviz txt |
fdp fig , dot fig , twopi fig , |
The graph is then drawn using graphviz. Grapphviz can create several different layouts from a graph definition. One layout is called fdp, which uses the spring model so that nodes having stronger connections are put closer on the canvas. The fdp layout reault is here . Graphviz can be downloaded here. The command to run graphviz is here. It also shows how to generate the different layouts.