ABSTRACT
People are thirsty for medical information. Existing Web search engines often cannot handle medical search well because they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar with medical terminology. Therefore, he sometimes prefers to pose long queries, describing his symptoms and situation in plain English, and receive comprehensive, relevant information from search results. This paper presents MedSearch, a specialized medical Web search engine, to address these challenges. MedSearch uses several key techniques to improve its usability and the quality of search results. First, it accepts queries of extended length and reforms long queries into shorter queries by extracting a subset of important and representative words. This not only significantly increases the query processing speed but also improves the quality of search results. Second, it provides diversified search results. Lastly, it suggests related medical phrases to help the user quickly digest search results and refine the query. We evaluated MedSearch using medical questions posted on medical discussion forums. The results show that MedSearch can handle various medical queries effectively and efficiently.
- A. Anagnostopoulos, A. Z. Broder, and D. Carmel. Sampling Search-Engine Results. WWW 2005: 245--256. Google ScholarDigital Library
- E. Agichtein, E. Brill, and S. T. Dumais. Improving Web Search Ranking by Incorporating User Behavior Information. SIGIR 2006: 19--26. Google ScholarDigital Library
- R. A. Baeza-Yates, B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press/Addison-Wesley, 1999. Google ScholarDigital Library
- W. Boswell. Healthline.com - A Medical Search Engine. websearch.about.com/od/enginesanddirectories/a/healthline.htm.Google Scholar
- E. A. Brewer. Lessons from Giant-Scale Services. IEEE Internet Computing 5(4): 46--55, 2001. Google ScholarDigital Library
- S. Brin, L. Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks 30(1-7): 107--117, 1998. Google ScholarDigital Library
- A. Z. Broder. Identifying and Filtering Near-Duplicate Documents. CPM 2000: 1--10. Google ScholarDigital Library
- Curbside.MD homepage. http://www.curbside.md, 2008.Google Scholar
- M. Charikar, C. Chekuri, and T. Feder et al. Incremental Clustering and Dynamic Information Retrieval. STOC 1997: 626--635. Google ScholarDigital Library
- M. Chau, H. Chen. Comparison of Three Vertical Search Spiders. IEEE Computer 36(5): 56--62, 2003. Google ScholarDigital Library
- J. G. Carbonell, J. Goldstein. The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. SIGIR 1998: 335--336. Google ScholarDigital Library
- EasyDiagnosis medical expert system homepage. http://easydiagnosis.com.Google Scholar
- 'Googling' Aids Difficult Diagnoses. http://www.e-health-insider.com/news/item.cfm?ID=2258, 2006.Google Scholar
- Google Health homepage. http://www.google.com/Top/Health.Google Scholar
- D. Harman. Relevance Feedback Revisited. SIGIR 1992: 1--10. Google ScholarDigital Library
- Healthline homepage. http://www.healthline.com.Google Scholar
- T. H. Haveliwala, A. Gionis, and D. Klein et al. Evaluating Strategies for Similarity Search on the Web. WWW 2002: 432--442. Google ScholarDigital Library
- M. A. Hearst, J. O. Pedersen. Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results. SIGIR 1996: 76--84. Google ScholarDigital Library
- K. Järvelin, J. Kekäläinen. IR Evaluation Methods for Retrieving Highly Relevant Documents. SIGIR 2000: 41--48. Google ScholarDigital Library
- G. Kumaran, J. Allan. A Case for Shorter Queries, and Helping Users Create Them. HLT 2007.Google Scholar
- M. Klein, H. Easley. Checking Medical Facts Online can be OK, but don't Become a 'Cyberchondriac'. The Journal News, June 26, 2006. http://www.thejournalnews.com/apps/pbcs.dll/article?AID=/20060626/NEWS03/606260311/1019.Google Scholar
- Family Medicine Online homepage. http://www.hmc.psu.edu/ume/fcmonline/index.htm, 2007.Google Scholar
- R. Kraft, F. Maghoul, and C. Chang. Y!Q: Contextual Search at the Point of Inspiration. CIKM 2005: 816--823. Google ScholarDigital Library
- M. Kaszkiel, J. Zobel. Passage Retrieval Revisited. SIGIR 1997: 178--185. Google ScholarDigital Library
- D. Lawrie, B. W. Croft, and A. L. Rosenberg. Finding Topic Words for Hierarchical Summarization. SIGIR 2001: 349--357. Google ScholarDigital Library
- X. Long, T. Suel. Optimized Query Execution in Large Search Engines with Global Page Ordering. VLDB 2003: 129--140. Google ScholarDigital Library
- Medical Search Engine Rated 'Better Than Google'. http://www.ehiprimarycare.com/news/item.cfm?ID=2318, 2006.Google Scholar
- MeSH homepage. http://www.nlm.nih.gov/mesh/meshhome.html, 2006.Google Scholar
- The National Coalition on Health Care. Facts on the Cost of Health Care. http://www.nchc.org/facts/2006%20Fact%20Sheets/Cost%20-%202006.pdf, 2006.Google Scholar
- T. Nomoto, Y. Matsumoto. A New Approach to Unsupervised Text Summarization. SIGIR 2001: 26--34. Google ScholarDigital Library
- D. Pelleg, A. W. Moore. X-means: Extending K-means with Efficient Estimation of the Number of Clusters. ICML 2000: 727--734. Google ScholarDigital Library
- F. Radlinski, S. Dumais. Improving Personalized Web Search Using Result Diversification. SIGIR 2006: 691--692. Google ScholarDigital Library
- L. Rosenberger. Google Maximum Search Length Increased. lbr.library-blogs.net/google_maximum_search_length_increased.htm, 2005.Google Scholar
- S. E. Robertson, S. Walker, and M. Hancock-Beaulieu. Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive. TREC 1998: 199--210.Google Scholar
- SearchMedica - The GPs search engine. www.searchmedica.co.uk/searchmedica/EUIHomeAction.do, 2006.Google Scholar
- C. Sherman. Curing Medical Information Disorder. http://searchenginewatch.com/showPage.html?page=3556491, 2005.Google Scholar
- A. Singhal. Modern Information Retrieval: A Brief Overview. IEEE Data Eng. Bull. 24(4): 35--43, 2001.Google Scholar
- B. Shneiderman, D. Byrd, and W. B. Croft. Clarifying Search: A User-Interface Framework for Text Searches. D-Lib Magazine, January 1997.Google Scholar
- J. Shapiro, I. Taksa. Constructing Web Search Queries from the User's Information Need Expressed in a Natural Language. SAC 2003: 1157--1162. Google ScholarDigital Library
- A. Spink, Y. Yang, and J. Jansen et al. A Study of Medical and Health Queries to Web Search Engines. Health Information and Libraries Journal 21(1): 44--51, 2004.Google ScholarCross Ref
- M. Steinbach, G. Karypis, and V. Kumar. A Comparison of Document Clustering Techniques. Text Mining Workshop, KDD 2000.Google Scholar
- SMART Stopword List. http://www.lextek.com/manuals/onix/stopwords2.html, 2006.Google Scholar
- YourDiagnosis medical expert system homepage. http://www.yourdiagnosis.com.Google Scholar
- WebMD homepage. http://www.webmd.com.Google Scholar
- Q. T. Zeng, J. Crowell, and R. M. Plovnick et al. Assisting Consumer Health Information Retrieval with Query Recommendations. JAMIA 13(1): 80--90, 2006.Google Scholar
- O. Zamir, O. Etzioni. Web Document Clustering: A Feasibility Demonstration. SIGIR 1998: 46--54. Google ScholarDigital Library
- C. Zhai, W. W. Cohen, and J. D. Lafferty. Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval. SIGIR 2003: 10--17. Google ScholarDigital Library
- B. Zhang, H. Li, and Y. Liu et al. Improving Web Search Results Using Affinity Graph. SIGIR 2005: 504--511. Google ScholarDigital Library
- C. Ziegler, S. M. McNee, and J. A. Konstan et al. Improving Recommendation Lists through Topic Diversification. WWW 2005: 22--32. Google ScholarDigital Library
- G. Luo, C. Tang, H. Yang, and X. Wei. MedSearch: A Specialized Search Engine for Medical Information. Poster at WWW 2007: 1175--1176. Google ScholarDigital Library
- Medstory homepage. http://www.medstory.com.Google Scholar
- M. Sahami, T.D. Heilman. A Web-Based Kernel Function for Measuring the Similarity of Short Text Snippets. WWW 2006: 377--386. Google ScholarDigital Library
- G. Luo. iMed: An Intelligent Medical Web Search Engine. Available at pages.cs.wisc.edu/~gangluo/imed.pdf, 2008.Google Scholar
- G. Luo. Intelligent Output Interface for Intelligent Medical Search Engine. AAAI 2008: 1201--1206.Google ScholarDigital Library
- G. Luo, C. Tang. On Iterative Intelligent Medical Search. SIGIR 2008: 3--10. Google ScholarDigital Library
Index Terms
- MedSearch: a specialized search engine for medical information retrieval
Recommendations
MedSearch: a specialized search engine for medical information
WWW '07: Proceedings of the 16th international conference on World Wide WebPeople are thirsty for medical information. Existing Web search engines cannot handle medical search well because they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar ...
On iterative intelligent medical search
SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrievalSearching for medical information on the Web has become highly popular, but it remains a challenging task because searchers are often uncertain about their exact medical situations and unfamiliar with medical terminology. To address this challenge, we ...
Did You Know? A Rule-Based Approach to Finding Similar Questions on Online Health Forums
ICHI '15: Proceedings of the 2015 International Conference on Healthcare InformaticsThis paper describes our system submitted for the ICHI 2015 Healthcare Data Analytics Challenge. Given a relatively large corpus of questions posted by users on online health forums, for a newly posted question (i.e., Query question), our task is to ...
Comments