| By Mike Svatek | Article Rating: |
|
| July 31, 2008 02:45 PM EDT | Reads: |
6,427 |
George Orwell didn't specifically mention enterprise search in his visionary book 1984, but he made a statement that still resonates today. When it comes to relevant search query results, "it's not about the statistics." Sanity comes from the human element.
Until recently, this concept was mostly ignored by enterprise search solutions. Instead, search was based on text-matching algorithms and models that methodically sifted through link structures or categorization schemes. After analysis and compilation, a surge of search results (all quantifiably correct) were released. While a statistician might be impressed by the outcome, the user who initiated the search simply drowned in a sea of homogenous data.
The question became, "Is it possible to produce a set of meaningful search results that will help people, rather than inundate them?"
In response, enterprise search technologies introduced tools that allowed "experts" to tune the search algorithm for their specific content, and introduced meta-tagging best practices to structure content and produce more meaningful results. More recently, explicit user actions such as click-throughs, ratings, and feedback were introduced to solve the issue of search relevancy. Today, advanced techniques including "social search" have evolved to take into account how humans search for, find, and consume information and products in the physical world.
Comparing Various Approaches to Solve Search
Enterprise search has long been the de facto standard for corporate knowledge databases and websites. Because this approach is typically based on text-matching algorithms, it does a thorough job in finding all documents that match a particular query term. Some highly evolved search engines use specialized classification systems or pattern recognition techniques that rely on statistical inference. As a result, enterprise search engines have experienced huge increases in performance, comprehensiveness, and automation. However, they still lack the single most important ingredient that produces relevant search results: subjectivity. Guided by a few ambiguous words typed into a text field, a search engine still has no reliable way of accurately interpreting the actual intent of an individual user.
To cope with subjectivity, and the lack of understanding around user intent, various approaches including expert-based tuning and user-explicit tuning have been developed to try and provide more useful results to the end user.
Different expert tuning approaches can include tuning based on algorithms, tuning the actual content, and tuning based on queries. The primary issues with these expert-driven approaches are that they can be severely labor intensive, biased, and lack insight as to the true intent of the user. Ultimately they proved to be too rigid and laborious over time and so a more scalable model was envisioned, one that would take into account user feedback.
Published July 31, 2008 Reads 6,427
Copyright © 2008 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Mike Svatek
Mike Svatek is currently vice president of marketing for Baynote and has been taking Internet technologies to market for more than a decade. Prior to Baynote, he was at Interwoven where he set the product direction and marketing strategy for initiatives related to search, portal, and web content management, and drove business development activities with partners in support Interwoven's global customer base. Mike holds an MBA from the Haas School of Business, UC Berkeley, and a BBA from the McCombs School of Business at the University of Texas, Austin.
- Asynchronous Logging Using Spring
- What to Expect in 2012: Cloud Computing and Open Source Software
- Will PaaS Finally Bring Open Source Love to the Enterprise?
- AT&T Joins OpenStack, Floats Cloud Architect
- Red Hat Sets Up GlusterFS Advisory Board
- Linux Virtualization and Tired Open Source Myths
- Acquia Announces Two New Board Members
- OpenOffice.com Lives
- Cloud Computing: A Platform-First Approach
- Powering the Cloud with Open Source
- Top 10 Open Source eCommerce Software (Joomla and Drupal)
- Piston Delivers First OpenStack-Based Cloud OS
- Adobe Sends Flex to the Apache Foundation
- i-Technology in 2012: Five Industry Predictions
- Microsoft Tries Hadoop on Azure
- OpenXava 4.3: Rapid Java Web Development
- Asynchronous Logging Using Spring
- StorSimple Supports OpenStack
- What to Expect in 2012: Cloud Computing and Open Source Software
- Will PaaS Finally Bring Open Source Love to the Enterprise?
- AT&T Joins OpenStack, Floats Cloud Architect
- More Use Cases for Big Data Analytics
- Red Hat Sets Up GlusterFS Advisory Board
- Linux Virtualization and Tired Open Source Myths
- After Ubuntu, Windows Looks Increasingly Bad, Increasingly Archaic, Increasingly Unfriendly
- SCO CEO Posts Open Letter to the Open Source Community
- Simula Labs Launches Hosted Delivery Platform To Enable Enterprise Open Source Adoption
- Where Are RIA Technologies Headed in 2008?
- Source Claims SCO Will Sue Google
- How Open Is "Open"? – Industry Luminaries Join the Debate
- Latest SCO News is Plain Weird
- SCO Claims Linux Lifted ELF
- IBM Tells SCO Court It Can't Find AIX-on-Power Code
- Flashback: Investing in 'Professional Open Source' - Exclusive 2004 Interview with David Skok, Matrix Partners
- Developing an Application Using the Eclipse BIRT Report Engine API
- HP Starts Pushing Desktop Linux




















