skip to main content
10.1145/1458082.1458104acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

MedSearch: a specialized search engine for medical information retrieval

Authors Info & Claims
Published:26 October 2008Publication History

ABSTRACT

People are thirsty for medical information. Existing Web search engines often cannot handle medical search well because they do not consider its special requirements. Often a medical information searcher is uncertain about his exact questions and unfamiliar with medical terminology. Therefore, he sometimes prefers to pose long queries, describing his symptoms and situation in plain English, and receive comprehensive, relevant information from search results. This paper presents MedSearch, a specialized medical Web search engine, to address these challenges. MedSearch uses several key techniques to improve its usability and the quality of search results. First, it accepts queries of extended length and reforms long queries into shorter queries by extracting a subset of important and representative words. This not only significantly increases the query processing speed but also improves the quality of search results. Second, it provides diversified search results. Lastly, it suggests related medical phrases to help the user quickly digest search results and refine the query. We evaluated MedSearch using medical questions posted on medical discussion forums. The results show that MedSearch can handle various medical queries effectively and efficiently.

References

  1. A. Anagnostopoulos, A. Z. Broder, and D. Carmel. Sampling Search-Engine Results. WWW 2005: 245--256. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. E. Agichtein, E. Brill, and S. T. Dumais. Improving Web Search Ranking by Incorporating User Behavior Information. SIGIR 2006: 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. A. Baeza-Yates, B. A. Ribeiro-Neto. Modern Information Retrieval. ACM Press/Addison-Wesley, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. W. Boswell. Healthline.com - A Medical Search Engine. websearch.about.com/od/enginesanddirectories/a/healthline.htm.Google ScholarGoogle Scholar
  5. E. A. Brewer. Lessons from Giant-Scale Services. IEEE Internet Computing 5(4): 46--55, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Brin, L. Page. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks 30(1-7): 107--117, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Z. Broder. Identifying and Filtering Near-Duplicate Documents. CPM 2000: 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Curbside.MD homepage. http://www.curbside.md, 2008.Google ScholarGoogle Scholar
  9. M. Charikar, C. Chekuri, and T. Feder et al. Incremental Clustering and Dynamic Information Retrieval. STOC 1997: 626--635. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Chau, H. Chen. Comparison of Three Vertical Search Spiders. IEEE Computer 36(5): 56--62, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. G. Carbonell, J. Goldstein. The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries. SIGIR 1998: 335--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. EasyDiagnosis medical expert system homepage. http://easydiagnosis.com.Google ScholarGoogle Scholar
  13. 'Googling' Aids Difficult Diagnoses. http://www.e-health-insider.com/news/item.cfm?ID=2258, 2006.Google ScholarGoogle Scholar
  14. Google Health homepage. http://www.google.com/Top/Health.Google ScholarGoogle Scholar
  15. D. Harman. Relevance Feedback Revisited. SIGIR 1992: 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Healthline homepage. http://www.healthline.com.Google ScholarGoogle Scholar
  17. T. H. Haveliwala, A. Gionis, and D. Klein et al. Evaluating Strategies for Similarity Search on the Web. WWW 2002: 432--442. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. A. Hearst, J. O. Pedersen. Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results. SIGIR 1996: 76--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. K. Järvelin, J. Kekäläinen. IR Evaluation Methods for Retrieving Highly Relevant Documents. SIGIR 2000: 41--48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Kumaran, J. Allan. A Case for Shorter Queries, and Helping Users Create Them. HLT 2007.Google ScholarGoogle Scholar
  21. M. Klein, H. Easley. Checking Medical Facts Online can be OK, but don't Become a 'Cyberchondriac'. The Journal News, June 26, 2006. http://www.thejournalnews.com/apps/pbcs.dll/article?AID=/20060626/NEWS03/606260311/1019.Google ScholarGoogle Scholar
  22. Family Medicine Online homepage. http://www.hmc.psu.edu/ume/fcmonline/index.htm, 2007.Google ScholarGoogle Scholar
  23. R. Kraft, F. Maghoul, and C. Chang. Y!Q: Contextual Search at the Point of Inspiration. CIKM 2005: 816--823. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. M. Kaszkiel, J. Zobel. Passage Retrieval Revisited. SIGIR 1997: 178--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. Lawrie, B. W. Croft, and A. L. Rosenberg. Finding Topic Words for Hierarchical Summarization. SIGIR 2001: 349--357. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. X. Long, T. Suel. Optimized Query Execution in Large Search Engines with Global Page Ordering. VLDB 2003: 129--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Medical Search Engine Rated 'Better Than Google'. http://www.ehiprimarycare.com/news/item.cfm?ID=2318, 2006.Google ScholarGoogle Scholar
  28. MeSH homepage. http://www.nlm.nih.gov/mesh/meshhome.html, 2006.Google ScholarGoogle Scholar
  29. The National Coalition on Health Care. Facts on the Cost of Health Care. http://www.nchc.org/facts/2006%20Fact%20Sheets/Cost%20-%202006.pdf, 2006.Google ScholarGoogle Scholar
  30. T. Nomoto, Y. Matsumoto. A New Approach to Unsupervised Text Summarization. SIGIR 2001: 26--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. D. Pelleg, A. W. Moore. X-means: Extending K-means with Efficient Estimation of the Number of Clusters. ICML 2000: 727--734. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. F. Radlinski, S. Dumais. Improving Personalized Web Search Using Result Diversification. SIGIR 2006: 691--692. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. L. Rosenberger. Google Maximum Search Length Increased. lbr.library-blogs.net/google_maximum_search_length_increased.htm, 2005.Google ScholarGoogle Scholar
  34. S. E. Robertson, S. Walker, and M. Hancock-Beaulieu. Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive. TREC 1998: 199--210.Google ScholarGoogle Scholar
  35. SearchMedica - The GPs search engine. www.searchmedica.co.uk/searchmedica/EUIHomeAction.do, 2006.Google ScholarGoogle Scholar
  36. C. Sherman. Curing Medical Information Disorder. http://searchenginewatch.com/showPage.html?page=3556491, 2005.Google ScholarGoogle Scholar
  37. A. Singhal. Modern Information Retrieval: A Brief Overview. IEEE Data Eng. Bull. 24(4): 35--43, 2001.Google ScholarGoogle Scholar
  38. B. Shneiderman, D. Byrd, and W. B. Croft. Clarifying Search: A User-Interface Framework for Text Searches. D-Lib Magazine, January 1997.Google ScholarGoogle Scholar
  39. J. Shapiro, I. Taksa. Constructing Web Search Queries from the User's Information Need Expressed in a Natural Language. SAC 2003: 1157--1162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. A. Spink, Y. Yang, and J. Jansen et al. A Study of Medical and Health Queries to Web Search Engines. Health Information and Libraries Journal 21(1): 44--51, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  41. M. Steinbach, G. Karypis, and V. Kumar. A Comparison of Document Clustering Techniques. Text Mining Workshop, KDD 2000.Google ScholarGoogle Scholar
  42. SMART Stopword List. http://www.lextek.com/manuals/onix/stopwords2.html, 2006.Google ScholarGoogle Scholar
  43. YourDiagnosis medical expert system homepage. http://www.yourdiagnosis.com.Google ScholarGoogle Scholar
  44. WebMD homepage. http://www.webmd.com.Google ScholarGoogle Scholar
  45. Q. T. Zeng, J. Crowell, and R. M. Plovnick et al. Assisting Consumer Health Information Retrieval with Query Recommendations. JAMIA 13(1): 80--90, 2006.Google ScholarGoogle Scholar
  46. O. Zamir, O. Etzioni. Web Document Clustering: A Feasibility Demonstration. SIGIR 1998: 46--54. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. C. Zhai, W. W. Cohen, and J. D. Lafferty. Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval. SIGIR 2003: 10--17. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. B. Zhang, H. Li, and Y. Liu et al. Improving Web Search Results Using Affinity Graph. SIGIR 2005: 504--511. Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. C. Ziegler, S. M. McNee, and J. A. Konstan et al. Improving Recommendation Lists through Topic Diversification. WWW 2005: 22--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. G. Luo, C. Tang, H. Yang, and X. Wei. MedSearch: A Specialized Search Engine for Medical Information. Poster at WWW 2007: 1175--1176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Medstory homepage. http://www.medstory.com.Google ScholarGoogle Scholar
  52. M. Sahami, T.D. Heilman. A Web-Based Kernel Function for Measuring the Similarity of Short Text Snippets. WWW 2006: 377--386. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. G. Luo. iMed: An Intelligent Medical Web Search Engine. Available at pages.cs.wisc.edu/~gangluo/imed.pdf, 2008.Google ScholarGoogle Scholar
  54. G. Luo. Intelligent Output Interface for Intelligent Medical Search Engine. AAAI 2008: 1201--1206.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. G. Luo, C. Tang. On Iterative Intelligent Medical Search. SIGIR 2008: 3--10. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MedSearch: a specialized search engine for medical information retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      CIKM '08: Proceedings of the 17th ACM conference on Information and knowledge management
      October 2008
      1562 pages
      ISBN:9781595939913
      DOI:10.1145/1458082

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 26 October 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,861of8,427submissions,22%

      Upcoming Conference

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader