SUN, LEILEI (2009) Smart Search Engine For Information Retrieval. Masters thesis, Durham University.
This project addresses the main research problem in information retrieval and semantic search. It proposes the smart search theory as new theory based on hypothesis that semantic meanings of a document can be described by a set of
keywords. With two experiments designed and carried out in this project, the experiment result demonstrates positive evidence that meet the smart search theory.
In the theory proposed in this project, the smart search aims to determine a set of keywords for any web documents, by which the semantic meanings of the documents can be uniquely identified. Meanwhile, the size of the set of keywords is supposed to be small enough which can be easily managed. This is the fundamental assumption for creating the smart semantic search engine. In this project, the rationale of the assumption and the theory based on it will be discussed, as well as the processes of how the theory can be applied to the keyword allocation and the data model to be
generated. Then the design of the smart search engine will be proposed, in order to create a solution to the efficiency problem while searching among huge amount of increasing information published on the web.
To achieve high efficiency in web searching, statistical method is proved to be an effective way and it can be interpreted from the semantic level. Based on the frequency of joint keywords, the keyword list can be generated and linked to each other to form a meaning structure. A data model is built when a proper keyword list is achieved and the model is applied to the design of the smart search engine.
|Item Type:||Thesis (Masters)|
|Award:||Master of Science|
|Keywords:||Search Engine, Google, Information Theory, Information Retrieval, Semantic Web|
|Faculty and Department:||Faculty of Science > Engineering and Computing Science, School of (2008-2017)|
|Copyright:||Copyright of this thesis is held by the author|
|Deposited On:||17 Dec 2009 12:42|