Crawl Anywhere 1.1.3 available

We just discovered that the download links for version 1.1.2 were broken. So we published the release 1.1.3 with corrects links. This new release includes one new pipeline stage and a crawler bug fix.

Solr schema

The default Sorl schemas provided now define "AND" as the defaultOperator.

Crawler

Fixe in the url normalisation. The urls http://www.domain.com/ and http://www.domain.com:80/ were considered as different. 

Pipeline

The sample xml mapping file for the SolrIndexQueueWriter stage now explains that the following mapping is mandatory

The new FieldMapping stage was added.

Leave a Reply

 

 

 

You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code class="" title="" data-url=""> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong> <pre class="" title="" data-url=""> <span class="" title="" data-url="">