Updated: 2009-04-16
The crawler uses protocol handlers to access content. When creating a content source, shared services administrators specify the protocol handler that the crawler will use when crawling the URLs specified in that content source. Microsoft Office SharePoint Server 2007 provides protocol handlers for all common Internet protocols. The following table shows the protocol handlers that are installed by default.
| Protocol handler |
Used to crawl |
|
Bdc
|
Business Data Catalog - Office SharePoint Server 2007 Enterprise Edition only
|
|
Bdc2
|
Business Data Catalog URLs (internal protocol) - Office SharePoint Server 2007 Enterprise Edition only
|
|
File
|
File shares
|
|
http
|
Web sites
|
|
https
|
Web sites over Secure Sockets Layer (SSL)
|
|
Notes
|
Lotus Notes databases
|
|
Rb
|
Exchange public folders
|
|
Rbs
|
Exchange public folders over SSL
|
|
Sps
|
People profile import from Windows SharePoint Services 2.0 server farms
|
|
Sps3
|
People profile import from Windows SharePoint Services 3.0 server farms only
|
|
Sps3s
|
People profile import from Windows SharePoint Services 3.0 server farms only over SSL
|
|
Spsimport
|
People profile import
|
|
Spss
|
People profile import from Windows SharePoint Services 2.0 server farms over SSL
|
|
Sts
|
Windows SharePoint Services 3.0 root URLs (internal protocol)
|
|
Sts2
|
Windows SharePoint Services 2.0 sites
|
|
Sts2s
|
Windows SharePoint Services 2.0 sites over SSL
|
|
Sts3
|
Windows SharePoint Services 3.0 sites
|
|
Sts3s
|
Windows SharePoint Services 3.0 sites over SSL
|
If you want to crawl content that does not have a protocol handler installed, you must install a third-party or custom protocol handler before you can crawl that content. Several third-party protocol handlers, with accompanying installation instructions, are available for Office SharePoint Server 2007. Check with the third-party manufacturer for instructions when you install third-party protocol handlers.
Crawling content from enterprise content management systems
Office SharePoint Server 2007 has protocol handlers that enable it to connect to and index content from other enterprise content management systems. Because the content is indexed into an Office SharePoint Server 2007 content index, users can search and get results from these content sources by using the SharePoint search user interface.
Enterprise Search Indexing Connector 2008 for EMC Documentum
Enterprise Search Indexing Connector 2008 for IBM FileNet
See Also