Office SharePoint Server 2007 provides several settings that enable SSP and site collection administrators to control what users see in search results pages. Although you can control the search results in numerous different ways, before deployment we recommend that you:
Keywords, sometimes called keyword phrases, are the words that users type into a search box when constructing a query. When users perform a simple keyword search, for example entering the word "widget" in the search box and clicking the Go Search button, Office SharePoint Server 2007 displays the search results of all content in the selected scope that contains that keyword.
Office SharePoint Server 2007 enables site collection administrators to create an entity called a keyword that is directly related to keyword phrases of the same name that are in the index. A site collection administrator can create a keyword using one or more words. For example, a keyword can be a single word, such as "OOF", or a group of words that must be typed in a particular order, such as "out of office".
In addition to the name of the keyword, also called the keyword phrase, site collection administrators can create keywords that are composed of one or more of the following options:
-
A definition of the keyword that appears in search results
-
One or more synonyms
-
One or more best bets, which are the URLs that SSP administrators specify as being highly relevant for a particular keyword.
Note: |
|---|
|
Although you can create a keyword that does not contain any of the optional information listed above, doing so does not improve the relevance of search results.
|
Keywords enable site collection administrators to improve the relevance of end user queries. The search results for any site collection can be modified to promote specific content so that it appears more prominently in response to queries that use particular search terms. Although keywords are planned, implemented, and managed at the site collection level, it is a good idea to make your planning and implementation consistent throughout your organization.
Keyword definitions are a good way to provide easy access to information about high-priority concepts in each site collection. For each concept, site collection administrators can create a keyword so that the definition of the keyword appears in the Search Best Bets Web Part next to the search results. For example, a sales portal devoted to selling a particular line of products might provide definitions for the major items in the product line. These definitions can be used to help sales associates understand their products better, or the definitions can be displayed in search results on a public-facing portal site for customers.
Keyword example
A site collection administrator knows that end users are having difficulty finding the calendar that is used to track when team members are out of the office. Users report that when they search for the calendar, their queries produce several pages of irrelevant search results, and that they give up after looking through the first few search results pages.
The site collection administrator decides to create a keyword named "oof" containing the following items:
The site collection administrator then asks the end users to use this new keyword "oof" or its synonym "time off" when searching for the calendar.
The following figure shows an example of the default search results page that end users see when searching on the keyword "oof" from the Search Center in this scenario. Note that best bets and keyword descriptions only appear on search results pages for searches that are run using the Search Center, by default.
In the figure above, the query that the end user ran is shown (callout 1). The keyword highlighting feature also shows the keywords in the content as bold text (callout 2). The description that the site collection administrator assigned to this keyword appears in the top-right corner of the search results page, by default (callout 3). Each keyword can be associated with a definition and you can include a URL in definitions. Accordingly, it is a good idea to:
-
Identify definition sources during planning.
-
Include a separate step during planning to design a glossary of all definitions used by keywords in each site collection.
-
Create some keywords for the sole purpose of associating the keywords with a definition.
The Best Bet (callout 4) appears directly below the keyword description, if there is one. A best bet is more than a URL. It also has a title and can optionally have a description. In this example, the site collection administrator named the best bet "Out of Office page". The description that the administrator assigned to the best bet appears directly below its name, and then the URL of the best bet appears below that.
Specific documents, sites, and people with expertise in the concept associated with a search term are common uses of best bets. It is important to consider the title and description of each best bet during content planning to increase the relevance and usability of each best bet. You can associate up to 25 best bets with each keyword in the administration user interface, and many more with the object model, but it is a good idea to not overuse best bets. Effective content planning can help you identify an appropriate number of best bets for each keyword that balances the number of search results with search relevancy.
Note: |
|---|
|
Because the URL of a best bet is hard coded by the site collection administrator, it can be any URL. It can even point to content that has not been crawled.
|
You can use the same best bet for more than one keyword. If the best bet already exists, you can add it to any keyword without having to enter the properties for the best bet again and possibly introduce redundant best bets. You can also simultaneously change the URL and description for that best bet for all keywords that use it. This is particularly useful if you are using a test site during planning and before initial deployment.
Understanding synonyms
Synonyms are one or more words that closely relate to a particular keyword. For example, an effective synonym for the keyword "car" might be "auto", "automobile", or "SUV". These are all effective synonyms for that particular keyword because you would expect that some users that are searching for cars to type one of those words into the search box instead. Site collection administrators can define one or more synonyms for each keyword. The purpose of a synonym is to display the same keyword definition and best bet on the search results page that is displayed when using the keyword. Following the previous example, if end users run a search query using the synonym "out of office" they see the same keyword description and best bets that they see when they run the search query using the keyword "oof." The difference, however, is that they see search results for only content that contains the words "out of office" instead of content that contains the word "oof".
Synonyms are useful when several search terms are used for the same concept and content, so that search results are consolidated and not scattered across several search terms. The list that is updated when a site collection administrator creates keywords and adds synonyms is called a thesaurus. The thesaurus for Office SharePoint Server 2007 is compatible with the thesaurus for Microsoft Office SharePoint Portal Server 2003.
Use information architecture to identify keywords
You can analyze the content in your information architecture to compile a list of terms with which to create keywords that you can associate with specific and highly relevant content.
Relevant content is anything specific that you want people to see first or most prominently when they search using a specific keyword. Examples of relevant content for each major business concept or content area include:
Associating keywords with a best bet is helpful to encourage people to view the key documents that are needed to collaborate on key business processes. For example, a business might have a special template for expense reports, and a keyword "expense report" that promotes that template to the top of search results. Without that keyword, each employee might spend several minutes asking their colleagues for the appropriate URL or browsing the company Web site. With the keyword associated with the URL to the expense report template as a best bet, employees can quickly locate the template.
Keywords for sites are helpful for identifying the location of sites for relevant information in a large organization. For example, "holidays" could be a keyword associated with a best bet that contains the URL of the human resources site that contains information about paid time off for employees. Ideally, the best bet could contain the URL of the exact page that provides company holiday information.
Keywords that help people find other people encourage collaboration between people across the organization who have important knowledge to share, or just about important people in the company. For example, you can associate a title such as "CEO" with the chief executive officer for a company, or you can make a person's My Site a best bet for a keyword relating to their organization or area of expertise, such as "chemistry department."
Security considerations for keywords
Unlike previous versions of Office SharePoint Portal Server, keywords and best bets are not affected by security permissions, and all readers on the site collection can see all best bets and keywords for that site collection that appear in search results. Users who do not have permissions to see the page to which a best bet is linked cannot go to the page. However, they can see the description of the best bet on the search results page and the URLs to the content. This could expose information that some users were not intended to see.
Keywords are meant to provide high-priority results for all users.
If you want to target content to certain users based on their permissions, you can use audiences and targeted Web Parts in the appropriate places on the site collection.
Plan keywords across your organization
It is important to plan keywords in advance to help ensure consistency of keywords across your organization. Even though keywords are implemented at the site collection level, you should ensure consistency of keywords across site collections whenever possible.
Example of good keyword planning
Contoso Corporation has two site collections, one for the sales and one for the marketing department. A different site collection administrator is assigned to each. Because users of these site collections are spending too much time finding the corporation's customer list, these two site collection administrators decide that they need to create a keyword for their site collection and associate a best bet with that keyword that links users directly to the customer list.
To ensure that end users have a consistent experience on both site collections, the two site collection administrators collaborate to decide how the keyword and best bet should be defined.
Tip: |
|---|
|
Planning for consistency of keywords across your organization requires good collaboration among site collection administrators. Whenever possible, ensure that keywords and best bets are consistent across site collections.
|
By avoiding the use of different keywords and best bets across site collections that point to the same content, you can avoid end-user confusion. For example, if the site collection administrator of the sales site collection had created a keyword named "super list" and the site collection administrators of the marketing site collection had created a keyword named "master list", end users who use both site collections would become confused when their keyword searches do not consistently display the best bet they expect to see on the search results page. That is, an employee who primarily uses the sales site collection is accustomed to searching using the keyword "super list" and would expect it to work from either site collection.
In small organizations, the content planning team is likely to be small and organized around a single site collection, and planning for keywords might be organized by only one or two people. In large organizations, on the other hand, a larger planning team can be helpful. You should include business planners and administrators at each level to make sure all business needs are addressed.
Best bets appear in search results even if the content hasn't been crawled. This is another reason to plan keywords during initial deployment, so that high-priority content can be available in the early stages of a deployment before all content sources have been crawled. In rare cases where content cannot be crawled because search is missing a relevant IFilter or for any other technical reason, you can use best bets to make the content easier to find even though it hasn't been crawled.
Key people at each level of the organization plan keywords for their site collections. Those people use the same overall content plan, adapted for the content on the site collections that they are planning. When planning keywords, which starts before deployment and continues in waves after deployment, each set of content planners can communicate with each other to keep consistency in the overall plan.
Not all keywords will be planned before deployment. The role of your content planning teams is to identify the high-priority concepts that are most relevant to search queries in your organization, so that search queries are relevant to users from the first day of your deployment. The planning team can identify a contact person for each keyword who may or may not be someone on the planning team. After deployment, site collection administrators can expand the keyword list after identifying common search terms in the query logs.
In the planning phase, keyword list managers should consider how keywords match to queries. Keywords must match the complete string of search terms exactly, and must not use special syntax such as a plus sign (+) and a minus sign (-) when searching for content in lists. This prevents the return of multiple lists of keywords for the same search query, which streamlines search results.
Plan keyword management
The details of keyword management are mostly relevant to the daily operations of your site collections, but there are some aspects of administration that are worth considering when planning your deployment. Specifically the optional contact and publishing properties that site collection administrators can assign to properties.
Tip: |
|---|
|
The more planning you do before deployment, the less management will be needed during day to day operations.
|
In addition to the properties listed in the "Plan keywords, best bets, and synonyms" section earlier in this article, each has the following optional properties:
Keywords can be required to go through approval before they affect search results, and can also be set to start or expire after a certain amount of time. The high-priority keywords identified during initial planning are unlikely to be temporary, except for content that is relevant to people using a site collection during the initial deployment.
The contact for each keyword is the person who should be contacted when a keyword expires, if it is set to expire. Content planners for each site collection should consider who is going to be managing keywords after the initial deployment, and include at least some of those people in the planning process at the site collection level.
However, part of the planning process is anticipating who will make decisions about keywords in the future. Making those decisions during the planning process can improve the transition to regular operations of the site collection, and promote consistent and efficient use of keywords in the future.
By using the object model, you can also import and export keywords between site collections as an Excel spreadsheet, so if some best bets apply to other site collections, you can plan once and deploy on all relevant site collections. This also allows keyword managers for a site collection at the divisional or project level to suggest best bets for a central site collection in a shared services environment.
For more information about managing keywords, see the Operations Guide for Office SharePoint Server 2007.
The greater the body of content that is being searched over, the more likely it is that several pages of search results will be displayed for a particular query. This is especially true when basic keyword queries instead of advanced queries are used. To facilitate a positive end-user experience, ensure that links to the most relevant content are displayed as early as possible on the search results pages.
Office SharePoint Server 2007 enables SSP administrators to assign indexed Web pages relevance settings. Each relevance setting, which is associated with a particular Web page, determines how close to the top of the search results page the link to a particular page appears. Pages that are assigned a relevance setting are known as authoritative pages.
Authoritative page settings are one factor in prioritizing search results. For more information about search relevance, see Enterprise Search Relevance Architecture Overview (http://go.microsoft.com/fwlink/?LinkId=93736).
Authoritative page settings are configured at the SSP level and apply to all queries made using that SSP. SSP administrators can assign sites to one of four authoritative page levels:
Web pages are weighted based on how authoritative they are, with each level receiving proportionate relevance weighting. By default, all top-level pages for Web applications are automatically added as most authoritative. You can move these pages to other authoritative page levels or remove them from authoritative page settings completely.
Sites that are not assigned an authoritative page ranking are weighted based on their click distance from an authoritative site. Click distance refers to the number of links between a content item and an authoritative page linking to the content item. For more information, see the "Click Distance" section Enterprise Search Relevance Architecture Overview (http://go.microsoft.com/fwlink/?LinkId=93736).
Sites that are assigned the Sites to demote setting will typically appear towards the end of the search results after all other relevance weighting factors have been considered.
This means that they often appear on the search results page after pages that are not even specified as an authoritative page. We recommend that you use this setting for sites that contain less relevant information (for example, an archive site).
When planning authoritative page settings, consider the purpose of each site, and review its subsites. Group authoritative sites into the three levels by importance and group the sites that are not likely to be relevant as sites to demote.
Good practices to use when planning authoritative page settings include:
-
SharePoint sites and business applications central to high-priority business processes will typically be most authoritative.
-
Sites that encourage collaboration or action are likely to be more authoritative than sites that are merely informative.
-
Sites that are informative but not central to high-priority business processes or used for collaboration are likely to be in the second or third level of authoritative sites.
-
External sites will typically be less authoritative, because your organization cannot control the content on those sites.
-
You don't need to assign an authoritative page setting to every site. It is a good idea to select relevance for a small number of sites that you know are most authoritative or less relevant, and adjust the authoritative page settings during normal operations based on feedback from users and information in the query logs.
|
Worksheet action
|
|---|
|
Record your decisions about authoritative pages in the "Authoritative pages" section of the End-user search experience worksheet (http://go.microsoft.com/fwlink/?LinkId=81039).
|
Federation, a new feature that was first introduced in Search Server 2008 and is available in Office SharePoint Server 2007 by installing the Infrastructure Update for Microsoft Office Servers, can be planned with the rest of the end-user search experience. Federation enables end users to issue a query that searches multiple sources and combines results into a single search results page. These sources can include:
-
Your company's enterprise content repositories
-
Internet search engines or subscriptions services used by your company
-
Enterprise documents indexed by Office SharePoint Server 2007 in other divisions or world regions
When the end user issues a query, Office SharePoint Server 2007 formats and renders the results alongside your indexed results by using new federation Web Parts.
When you plan the experience you want for your users when they search federated locations, try to target the searching needs and habits of users in your company. Ask yourself these questions: What content do your users need to find most to be productive? What queries are they currently using? Target your federated locations to solve the key information problems in your company.
When using federation it is tempting to add many federated locations to satisfy all the possible needs of your users. Unfortunately, this leads many users to disregard federated results as clutter.
To help ensure that federated results target useful answers to queries, federated locations can match specific query formats with trigger rules. When you create a trigger rule for a federated location, the Web Part that is associated with that location displays results only for queries that match the pattern or prefix that you specify.
For example, let's say that you work at a company called Contoso, where employees manufacture a product called a widget. Employees need to find these widgets many times a day by using a ten-digit widget ID. Widgets are stored in a database that cannot be crawled by Office SharePoint Server 2007. To enable Contoso employees to search for widgets, you build a federation connector that searches the widget database. However, displaying widget information for every query would likely frustrate your users. So, you create a federated location trigger by using a pattern that recognizes ten-digit queries. Now, when users search for widget IDs, they get a top federated result from the widget database.
For more information about using triggers and trigger rules, see Work with triggers and query templates (Office SharePoint Server).
You can add and configure federated results on the search results page with either a Federated Results Web Part or a Top Federated Results Web Part. By default, the search results page contains two Federated Search Results Web Parts and one Top Federated Results Web Part. You set the federated location and its properties in the Web Parts on the search results page.
For more information about federation, see Federating search results from other locations (Office SharePoint Server).