I have a relatively small site that I maintain for my church. I downloaded your software and it "seems" to be working, however in the extraction info pane, crawler tab, it is telling me that there are currently over 43,000 pages to scan. most of the 4,000+ already scanned seem to be in the phpBB3 directory. at present there are only 11 posts and 2 users. The program has been running for almost 30 minutes so far. gonna let it continue to see if anything changes. My site is located at
http://www.hrvsttime.org/. Is it possible that it is caught in a loop?
you could try excluding the phpBB3 in the Exclude Patterns select *phpBB3/*
I found smf created lots of links not wanted I used exclude *smf/*
happy computing from systems-edge.co.uk