Teach search crawler about internal sitemap
authorMagnus Hagander <magnus@hagander.net>
Thu, 23 Mar 2017 15:31:39 +0000 (16:31 +0100)
committerMagnus Hagander <magnus@hagander.net>
Thu, 23 Mar 2017 15:31:39 +0000 (16:31 +0100)
commit7edb14284dba2b521249731073ad0b44267cb479
treee6dcd8235ae07c01e69c18b73399097c8504a216
parent428f299f48e4daf507cddfeb520e643c907b3227
Teach search crawler about internal sitemap

We only support it for our main website, which uses a sitemap, so
implement it only for that provider. And always probe
sitemap_internal.xml, since we don't even try to access any external
sites on it.
tools/search/crawler/lib/basecrawler.py
tools/search/crawler/lib/genericsite.py
tools/search/crawler/lib/sitemapsite.py
tools/search/sql/schema.sql