Saturday, April 25, 2009

Web Content Management search seed list to a readable format?

Question
A wcmsearchseed list page is used by the WebSphere Portal search engine to crawl a Web Content Management (WCM) searchable site. However, the wcmsearchseed list page is not human readable.

Cause
By default, the wcmsearchseed list page is base64 encoded.

Answer
If you are interested is seeing the wcmsearchseed list page in a format that you can read for debug purposes, then copy the Managed Web Content site content source URL, open a new Web browser, and access the copied URL with the following appended URL parameters:

&userid=&password=
&debug=1


The number of items per page in the wcmsearchseed list is configured by /wcm/shared/app/config/wcmservices/SearchService.properties SearchService.DefaultSeedPageSize. The default is 200. To see more than the first page, append the following URL parameter also:

&pageNum=


The first page is &pageNum=0.

An example of the viewable default wcmsearchseed list URL is as follows:

http://www.ibm.com:10038/wps/wcmsearchseed/searchseed?
siteid=98eae7804755055eb71db746880f549b&userid=wpsadmin& password=wpsadmin_password


An example of the URL for the viewable wcmsearchseed list that you can read is:

http://www.ibm.com:10038/wps/wcmsearchseed/searchseed?
siteid=98eae7804755055eb71db746880f549b&userid=wpsadmin& password=wpsadmin_password&debug=1


An example of the URL for the viewable wcmsearchseed list (second page) that you can read is:

http://www.ibm.com:10038/wps/wcmsearchseed/searchseed?
siteid=98eae7804755055eb71db746880f549b&userid=wpsadmin& password=wpsadmin_password&debug=1&pageNum=1