![]() |
SIGIR 2007 Workshop"Improving Web retrieval for non-English queries"Amsterdam - 27 July 2007 |
| Home | Call for Papers | Schedule/Proceedings | Organizers/Commitee | Registration | News |
Workshop Theme:
Over 60% of the online population are non-English speakers and it is probable the number of non-English speakers is growing faster than English speakers. Recent studies showed that non-English queries and unclassifiable queries have nearly tripled since 1997. Most search engines were originally engineered for English. They do not take full account of inflectional semantics nor, for example, diacritics or the use of capitals.
The main conclusion from the literature is that searching using non-English and non-Latin based queries results in lower success and requires additional user effort so as to achieve acceptable recall and precision. Further international search engines (like Yahoo and Google) are relatively weaker with monolingual non-English queries.
New tools and resources are needed to support researchers in non-English retrieval. New methodologies need to be proposed which will help the identification of problems in existing search engines. New teaching strategies should be formed aiding users to become more efficient in formulating their queries.
Aims and Topics:
The main objectives of the workshop are to propose techniques and to evaluate tools which improve the effectiveness of the existing search engines. The specific aims of the workshop are:
Evaluate search engines in non-English queries and measure the additional user effort.
Define methodologies for evaluating the effectiveness of search engines in non-English queries.
Study the user query patterns in non-English Web retrieval.
Identify the factors that influence utilization of search engines in a multicultural world.
Propose extensions to the search engines to improve non-English Web retrieval.
Propose teaching strategies for helping users improve their searching behaviour.
Identify how standard IR techniques (Indexing, Query representation, Query reformulation, etc) can be adapted in Web retrieval for non-English languages.
Discuss the application of natural language processing techniques for non-English Web IR.
Workshop areas of interest include, but are not limited to:
Evaluation methodologies
Analysis of query logs
Localization of search engine interfaces
Performance issues of local search engines
NLP applications in non-English IR
User studies
Image and video retrieval services
Summarization
Teaching
Maarten de Rijke http://staff.science.uva.nl/~mdr/
Authors are invited to submit full papers or posters in PDF format using the ACM template page.
The workshop accepts two types of submissions: Full papers (8 pages max) and Posters (4 pages max).
Please use the online submission system (http://www.easychair.org/iNEWS07/) to submit your paper.
PDFs should not contain author information. So please exclude author names and affiliations and all other information associated with the authors.
| 25 May 2007 | Paper submissions due | |
| 15 June 2007 | Notifications of acceptance | |
| 22 June 2007 | Camera-ready copy due | |
| 27 July 2007 | Workshop |
The workshop is partially funded by:
"Rede Galega de Procesamento da Linguaxe e Recuperacion de Informacion (Galician Network for Language Processing and Information Retrieval), funded by Xunta de Galicia"
If you have any questions, comments, suggestions, please contact Fotis Lazarinis (lazarinf AT teimes.gr) and/or Jesus Vilares (jvilares AT udc.es).