Each group should submit a final project report containing no more than 1000 words no later than December 15. A web site should also be available by December 15 for more details of the project including a live demo.
topic | Papers | Time | |
---|---|---|---|
Crawler tutorial |
| Mahmud Sumon, Mohit Gherani | Oct 15 |
Pagerank algorithm tutorial | Xiutian Cui, Xiaoni Cao | Oct 15 | |
Near duplicate detection | Yi Zhang, Jiayi Wu | Oct 15 | |
Search Engine tutorial | Shaochen Zheng, Sun Xu | Oct 15 | |
Clustering Coefficient | |||
Search engines |
| ||
Web as graph |
| Adewale Agboola, Amrut Chachad | Oct 15 |
Topic | Papers | Time | |
---|---|---|---|
A vision of semantic web | Tim Berners-Lee, James Hendler and Ora Lassila, The Semantic Web, A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities, Scientific American, May 2001 | ||
DBPedia | DBpedia: A Nucleus for a Web of Open Data,
ren Auer, Christian Bizer, Georgi Kobilarov, Jens Lehmann, Richard Cyganiak and Zachary Ives, The Semantic Web,722--735,2007, Springer
Slides |
Joshua Chittle, John Donais | Oct 17 |
Social Web | M Gjoka et al., A walk in facebook:uniform sampling of users in online social networks | ||
Ontology matching | Doan, A., Madhavan, J., Domingos, P., and Halevy, A. 2002. Learning to map between ontologies on the semantic web. In Proceedings of the 11th international Conference on World Wide Web (Honolulu, Hawaii, USA, May 07 - 11, 2002). WWW '02. 662-673. | James, Jordan | Oct 15 |
The following table lists some ontologies and their corresponding graphs.
Ontology | graph txt file | graph |
---|---|---|
1. SUMO.owl (323KB) | sumoSubclass.txt(~700 edges) | subclass fdp layout |
2. All the relatestion extracted using jena. The java program | sumoAll.txt(~2700 edges) | sumoAll fdp layout, circle layout, twopi layout |
3. dbpedia.owl(~700KB) | graph txt | |
4. SUMO2.owl(~30MB) | graphviz txt | twopi layout(79MB) , a 3D fig produced by Wang Hao |
5. SUMO2 subclass (5000 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
6. SUMO2 subclass & domain range(7000 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
7. SUMO2 domain range (1700 edges) | graphviz txt |
fdp fig , dot fig , twopi fig , |
8. SUMO2 classes | graphviz txt |
fdp fig , dot fig , twopi fig , |
The graph is then drawn using graphviz. Grapphviz can create several different layouts from a graph definition. One layout is called fdp, which uses the spring model so that nodes having stronger connections are put closer on the canvas. The fdp layout reault is here . Graphviz can be downloaded here. The command to run graphviz is here. It also shows how to generate the different layouts.