The user entered the query “What is the historical relation between Greek and Roma”.

1
The user entered the query “What is the historical relation between Greek and Roma”. Here are the query’s results. The user clicked the topic “Roman copies of Greek sculptures ”. Here are the results to the query with the topic. The user chose the article “Venus de’ Medici”. Here is the article. User Graphical Interface The user can choose to search by the article’s data. The user can choose to search one of the topics. Enter text for searching Search in the search results Choose an article Search by article’s data Search by article’s topics The users send questions in natural language (English/Hebrew), and receive relevant results What is…? The system tags the articles in the corpus with the topics The system builds automatically hierarchical topics by existing corpus The users can navigate in the system to understand the corpus Web system - available every where, every time rning Content Models for Semantic Search Eran Peer, Hila Shalom Advisors: Prof. Michael Elhadad, Dr. Menachem Adler Natural Sciences Engineering Sciences This project was programmed in Java and Javascript using Eclipse. The system was developed using Mallet, SolR, Morphological analyzer and Tomcat. Client HTTP Client – The functions in the client side GUI – Web based graphical user interface Server HTTP Servlet – Handles the connection with the client Morphological Analyzer – Analyze the articles into verb, noun, adjective etc. LDA – Algorithm that creates hierarchical topics from the corpus. Tags all the articles with the topics. Can be replaced with a better one. Corpus – Contains the articles and their data (author, date etc.). New and existing articles can be added or removed. Can be at English or Hebrew. SolR – System for searching quickly and efficiently in huge corpus. HTTP Client : Java script GUI Manager HTTP Servlet Manager LDA : Jar Morpholo gical Analyzer SolR Corpus to allow the user to explore a repository of textual documents, and discover material of interest even when he is not familiar with the content of the repository. Existing search engines provide powerful features to identify known documents by using a short description of their content (keywords, name of document). For example, a user wants to find information about a term that he knows, he enters the term's name in a search engine, and the search engine returns the address of the term's value at Wikipedia. The task we address is what happens when the user does not know the name of the term he is looking for, or when the term the user enters has several meanings. The system that we developed will help the user explore interactively the repository and the terms that are most significant in this repository. THE PROJECT GOAL The user can choose to remove parts of the query. The user can view the topics as a list or as a wheel The user can choose one of the topics to add it to the search. The user can choose one of the articles, to view it. The user can enter a query for new search or to search in the results. Our code at http:// code.google.com/p/semantic-search2011

description

Natural Sciences Engineering Sciences. Client. Server. HTTP Servlet. HTTP Client : Java script. Morphological Analyzer. LDA : Jar. Manager. Learning Content Models for Semantic Search. Manager. GUI. SolR. Eran Peer, Hila Shalom. - PowerPoint PPT Presentation

Transcript of The user entered the query “What is the historical relation between Greek and Roma”.

Page 1: The user entered the query “What is the historical relation between Greek and Roma”.

The user entered the query “What is the historical relation between Greek and Roma”.Here are the query’s results.

The user clicked the topic “Roman copies of Greek sculptures”. Here are the results to the query with the topic.

The user chose the article “Venus de’ Medici”.Here is the article.

User Graphical Interface

The user can choose to search by the article’s data.

The user can choose to search one of the topics.

Enter text for searching

Search in the search results

Choose an article

Search by article’s data

Search by article’s topics

The users send questions in natural language (English/Hebrew), and receive relevant results

What is…?

The system tags the articles in the corpus with the topics

The system builds automatically hierarchical topics by existing corpus

The users can navigate in the system to understand the corpus

Web system - available every where, every time

Learning Content Models for Semantic SearchEran Peer, Hila Shalom

Advisors: Prof. Michael Elhadad, Dr. Menachem Adler

Natural SciencesEngineering Sciences

This project was programmed in Java and Javascript using Eclipse.The system was developed using Mallet, SolR, Morphological analyzer and Tomcat.

Client• HTTP Client – The functions in the client side• GUI – Web based graphical user interface

Server• HTTP Servlet – Handles the connection with the client• Morphological Analyzer – Analyze the articles into verb,

noun, adjective etc.• LDA – Algorithm that creates hierarchical topics from

the corpus. Tags all the articles with the topics. Can be replaced with a better one.

• Corpus – Contains the articles and their data (author, date etc.). New and existing articles can be added or removed. Can be at English or Hebrew.

• SolR – System for searching quickly and efficiently in huge corpus.

ClientHTTP Client :

Java script

GUI

Manager

ServerHTTP

Servlet

Manager LDA : Jar

Morphological Analyzer

SolR Corpus

SYSTEM ARCHITECTURE

to allow the user to explore a repository of textual documents, and discover material of interest even

when he is not familiar with the content of the repository.

Existing search engines provide powerful features to identify known documents by using a short

description of their content (keywords, name of document). For example, a user wants to find

information about a term that he knows, he enters the term's name in a search engine, and the search

engine returns the address of the term's value at Wikipedia.

 

The task we address is what happens when the user does not know the name of the term he is

looking for, or when the term the user enters has several meanings. The system that we developed

will help the user explore interactively the repository and the terms that are most significant

in this repository.

THE PROJECT GOAL

The user can choose to remove parts of the query.

The user can view the topics as a list or as a wheel

The user can choose one of the topics to add it to the search.

The user can choose one of the articles, to view it.

The user can enter a query for new search or to search in the results.

Our code at http://code.google.com/p/semantic-search2011