Semantic search

Semantic search seeks to improve search accuracy by understanding the searcher's intent and the contextual meaning of terms as they appear in the searchable dataspace, whether on the Web or within a closed system, to generate more relevant results. Semantic search systems consider various points including context of search, location, intent, variation of words, synonyms, generalized and specialized queries, concept matching and natural language queries to provide relevant search results.^[1] Major web search engines like Google and Bing incorporate some elements of semantic search.

Guha et al. distinguish two major forms of search: navigational and research.^[2] In navigational search, the user is using the search engine as a navigation tool to navigate to a particular intended document. Semantic search is not applicable to navigational searches. In research search, the user provides the search engine with a phrase which is intended to denote an object about which the user is trying to gather/research information. There is no particular document which the user knows about and is trying to get to. Rather, the user is trying to locate a number of documents which together will provide the desired information. Semantic search lends itself well with this approach that is closely related with exploratory search.

Rather than using ranking algorithms such as Google's PageRank to predict relevancy, semantic search uses semantics, or the science of meaning in language, to produce highly relevant search results. In most cases, the goal is to deliver the information queried by a user rather than have a user sort through a list of loosely related keyword results. However, Google itself has subsequently also announced its own Semantic Search project.^[3]

Author Seth Grimes lists "11 approaches that join semantics to search", and Hildebrand et al. provide an overview that lists semantic search systems and identifies other uses of semantics in the search process.^[4]

Other authors primarily regard semantic search as a set of techniques for retrieving knowledge from richly structured data sources like ontologies and XML as found on the Semantic Web.^[5] Such technologies enable the formal articulation of domain knowledge at a high level of expressiveness and could enable the user to specify their intent in more detail at query time.^[6]

Disambiguation

In order to understand what a user is searching for, word sense disambiguation must occur. When a term is ambiguous, meaning it can have several meanings (for example, if one considers the lemma "bark", which can be understood as "the sound of a dog," "the skin of a tree," or "a three-masted sailing ship"), the disambiguation process is started, thanks to which the most probable meaning is chosen from all those possible.

Such processes make use of other information present in a semantic analysis system and takes into account the meanings of other words present in the sentence and in the rest of the text. The determination of every meaning, in substance, influences the disambiguation of the others, until a situation of maximum plausibility and coherence is reached for the sentence. All the fundamental information for the disambiguation process, that is, all the knowledge used by the system, is represented in the form of a semantic network, organized on a conceptual basis.

In a structure of this type, every lexical concept coincides therefore with a semantic network node and is linked to others by specific semantic relationships in a hierarchical and hereditary structure. In this way, each concept is enriched with the characteristics and meaning of the nearby nodes.

Every node of the network (called Synset) groups a set of synonyms which represent the same lexical concept (called Synsets) and can contain:

single lemmata ('seat', 'vacation'; 'work', 'quick'; 'quickly', 'more', etc.)
compounds ('non-stop', 'abat-jour', 'policeman')
collocations ('credit card', 'university degree', 'treasury stock', 'go forward', etc.)

The semantic relationships (links), which identify the semantic relationships between the synsets, are the order principals for the organization of the semantic network concepts.

Commonly used searching methodologies

Mäkelä describes five mainly used methodologies:^[7]

RDF Path Traversal - traversing the net formed by a graph of information that uses the RDF data model.
Keyword to Concept Mapping^[8]
Graph Patterns - used to formulate patterns for locating interesting connecting paths between resources. Also commonly used in data visualization.
Logics - by using inference based on OWL
Fuzzy concepts, fuzzy relations, and fuzzy logics^[9]

Ten Defining Attributes

The attributes of semantic search (those qualities that make it distinct from non-semantic search) are not all necessarily advantages by definition. For example, some attributes may improve search accuracy because of an exhaustive reiterative process but by effect overconsume time and/or resources. Accordingly, these ten attributes are merely salient features although the underlying assumption is that under perfect conditions they are generally preferable.^[10]

1- Handling morphological variations
2- Handling synonyms with correct senses
3- Handling generalizations
4- Handling concept matching
5- Handling knowledge matching
6- Handling natural language queries and questions
7- Ability to point to uninterrupted paragraph and the most relevant sentence
8- Ability to Customize and Organic Progress
9- Ability to operate without relying on statistics, user behavior, and other artificial means
10- Ability to detect its own performance

Semantic search portals

Bing
Twinword Finder - Semantic search tool for web pages.
CMANTIK - semantic search in Wikipedia and news
FindMyCarrots - semantic search engine for online travel
Google, particularly its new Knowledge Graph feature
GoPubMed - for biomedical research - first semantic search engine, launched in 2002
iGlue - semantic search engine with realtime annotator plugin/bookmarklet which adds a smart layer to every website
International Digital Media Archive - Semantic document search engine, including an archive of full semantic metadata
Kosmix – social media semantic search
Kvasir -a semantic recommendation system seamlessly integrated in web browsing
Legal Intelligence - legal search
Lexxe – beta in early 2011
NTENT -(Formerly Vertical Search Works) semantic search indexing and ad matching technology
Rendipity - image semantic search
SILVIA - semantic search indexing and image matching^[11]
Swoogle
Thinkglue - semantic video search engine
vLex – vLex Global Legal Intelligence
Yummly - food and recipe semantic search
Zaptravel - Semantic search for travel and travel experiences
Klevu - Semantic understanding of keywords. Automatically adds categories based on semantic logic.

Enterprise semantic search engines

References

↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ Lua error in package.lua at line 80: module 'strict' not found.
↑ What is Semantic Search? Hakia 2011
↑ Lua error in package.lua at line 80: module 'strict' not found.

v t e Internet search
Types	Web search engine (List) Metasearch engine Collaborative search engine Human flesh search engine
Tools	Local search Vertical search Search engine marketing Search engine optimization Search oriented architecture Selection-based search Social search Document retrieval Text mining Web crawler Multisearch Federated search Search aggregator Index/Web indexing Focused crawler Spider trap Robots exclusion standard Distributed web crawling Web archiving Website mirroring software Web search query Voice search Natural language search engine Web query classification
Applications	Image search Video search engine Enterprise search Semantic search
Protocols and standards	Z39.50 Search/Retrieve Web Service Search/Retrieve via URL OpenSearch Representational State Transfer Website Parse Template Wide area information server
See also	Search engine Desktop search Online search

Semantic search

Contents

Disambiguation

Commonly used searching methodologies

Ten Defining Attributes

Semantic search portals

Enterprise semantic search engines

See also

References

Further reading

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools