Community Associations

Victoria

Search Engine Demo

Using the open source Stract search engine, we built our own small search engine using Common Crawl web crawl data from a list of community association websites (on this page) in Victoria and Saanich.

Some sample searches: "gorge", "festival", "harm reduction", "council"

Read the blog post about how we made it!

Indexes

Neighbourhoods

Saanich

Indexes

Neighbourhoods

Greater Victoria

Common Crawl Archives Map

Extracting data from Common Crawl

We use the following tool to get recent crawl results for the various websites. Note that some websites have no crawl data due to robots.txt settings.

cdxt --crawl 1 --limit 50 --verbose warc 'secure.pickleballcanada.org/club/victoria-regional-pickleball-association/*'

A GitHub repo with the scripts is here:

Source document

Publish-to: 6kgruqaeaaaa.vichex.ca/community-associations/