- Architecture of an enterprise search solution
- Indexing and querying business data
- LOB data and people knowledge
|Microsoft Office SharePoint Server 2007 for Search Standard Edition||Microsoft Office SharePoint Server 2007 for Search Enterprise Edition||Microsoft Office SharePoint Server 2007 Standard Edition||Microsoft Office SharePoint Server 2007 Enterprise Edition|
|Indexes||40 file types out of the box (extensible)||40 file types out of the box (extensible)||40 file types out of the box (extensible)||40 file types out of the box (extensible)|
|Supports (out of the box) search on file shares, Web sites, SharePoint sites, Exchange Public Folders, Notes database files|
|Supports search on third-party document repositories|
|Supports search for people and Expertise|
|Supports searching on structured data sources|
|Provides secure content access control|
|Provides enhanced Search Center UI|
Several metadata tags are included in the relevancy calculations. Here are a few of the things considered:
- Click Distance Browsing distance from authoritative sites (shorter distances tend to be more relevant).
- Anchor Text Hyperlinks act as annotations on their target. In addition, they tend to be highly descriptive.
- URL Depth URLs higher in the hierarchy tend to be more relevant.
- URL Matching Direct matches on text that's in URLs.
- Metadata Extraction Automatically extracts titles and authors from document text if they are missing.
- Automatic Language Detection Helps create preference for results in your language.
- File Type Biasing Certain file types tend to be more relevant (for example, PPT files are often more relevant than XLS files).
- Text Analysis Traditional text ranking based on such factors as matching terms, term frequencies, and word variants.
© 2008 Microsoft Corporation and CMP Media, LLC. All rights reserved; reproduction in part or in whole without permission is prohibited.