User profile discovery for web search

  • T. Gopalakrishnan*
  • , P. Segottuvelan
  • , J. Sathyamoorthy
  • *Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    The web has not achieved its goal of providing easy access to online information. As its size is increasing the abandons of available info on the web cause the testing phenomenon of information overload to web users. The system implements an experiential process to approximate semantic likely-hood using page calculations and text fragments retrieved from a web search engine for two words. Specifically, we define various word co-occurrence measures using page counts and integrate those with lexical patterns extracted from text snippets. To identify the numerous semantic relations that exist between two given words, we propose a novel pattern extraction algorithm and a pattern clustering algorithm. The optimal combination of page counts-based co-occurrence measures and lexical pattern clusters is learned using support vector machines. The proposed method outperforms various baselines and previously proposed web-based semantic similarity measures on three benchmark data sets showing a high correlation with human ratings. Moreover, the proposed method significantly improves the accuracy in a community mining task.

    Original languageEnglish
    Title of host publicationProceedings - 2014 International Conference on Intelligent Computing Applications, ICICA 2014
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages377-381
    Number of pages5
    ISBN (Electronic)9781479939664
    DOIs
    Publication statusPublished - 21-11-2014
    Event2014 International Conference on Intelligent Computing Applications, ICICA 2014 - Coimbatore, Tamilnadu, India
    Duration: 06-03-201407-03-2014

    Publication series

    NameProceedings - 2014 International Conference on Intelligent Computing Applications, ICICA 2014

    Conference

    Conference2014 International Conference on Intelligent Computing Applications, ICICA 2014
    Country/TerritoryIndia
    CityCoimbatore, Tamilnadu
    Period06-03-1407-03-14

    All Science Journal Classification (ASJC) codes

    • Computer Science Applications

    Fingerprint

    Dive into the research topics of 'User profile discovery for web search'. Together they form a unique fingerprint.

    Cite this