Membership is FREE, giving all registered users unlimited access to every Acorn Domains feature, resource, and tool! Optional membership upgrades unlock exclusive benefits like profile signatures with links, banner placements, appearances in the weekly newsletter, and much more - customized to your membership level!

Tool Required

Status
Not open for further replies.

Admin

Administrator
Staff member
Joined
Jun 14, 2004
Posts
11,076
Reaction score
962
Does anyone have a tool / script that can run through a domain list and pull out only the domains that contain dictionary word(s), so "acorndomains" for example it know its 2 words.

Thanks
 
Well the problem is in getting a decent dictionary word list.

I have one that was taken from the pspell library (iirc). It wouldn't be hard to do the match up...
 
Word Lists

Hi,

I came across http://www.openwall.com/wordlists

Designed primarily for the black hat password cracker community but a good source of names to run past Overture and availabilty with .co.uk extension.

I'm not selling my scanned lists LOL !



JohnP
 
I can get word lists, my problem is scanning a list of domains with it

Best solution would be to create a dictionary table and load that with the dictionary words and then scan a domain list

e.g

dictionary {
id not-null integer autoincrement
word varchar 128
primary id
unique 'word'
}

easily done and efficient enough in mysql.

S
 
Im interested in doing this also if I could get my head around it. I already have a script written that matches one word dictionary domains to a dictionary (I think I gave admin a copy of this?)

I may be being slow but wont extracting domains that contain only one dictionary word or multiple dictionary words be pretty difficult? You'll have to move along the string from the second charcter and check each increment against the dictionary to see if theres a match, but how do you know what is the beginning and end of a word?

goshooting.co.uk for example contains 'go, 'gosh', 'shoot', 'shooting', 'tin' and 'hoot?'

Im assuming admin doesnt want to extract anything that just contains a dictionary word such as hellosdgsdh.co.uk, rather just domains that only contain dictionary words with no crap in them.

Im sure some genius will be able to explain?

Grant
 
I'd step through each dictionary word and try and match the domain - starting at the beginning of the domain.

If I got a match, I'd ignore the characters that I'd just matched and start stepping though the dictionary again with the remainder of the domain and see if there was a match, if that match took me to the end of the domain I'd stop, if not I'd add the just matched characters to those to ignore and start stepping though the dictionary again with the remainder.

I can easily write scripts like this in Perl to agreed specifications.

The output could be a csv with fields for things such as number of words, and if single letter, or even two letter words were used.

There could be problems with "words" like "a", any script specification would have to consider how to deal with them.
 
The way I would do it is by using MySQL's built in functionality such as:

WHERE MATCH (domains) AGAINST ('$words' IN BOOLEAN MODE)

"domains" being the database column where your domain list is.

$words' being a list of words, with each word seperated by a space, i.e. "+word1* +word2* +word3*" etc. The + sign indicates to search for the word and the * is for wildcard matching.

For example, you have the following domains in your list: hello.co.uk, house.co.uk and acorndomains.co.uk, if your word list contains the word "acorn" or even "orndo" then it would only match acorndomains.co.uk.

There are loads of other ways to do it but I believe this method to be the most efficient.
 
Domain Research Tool should do what you want, even comes with a dictionary file, but can add own (better one).

Although as Grant says you will end up with some crap as well due to matches typically on small (2 letter?) words.

Dave
 
Last edited:
Does anyone have a tool / script that can run through a domain list and pull out only the domains that contain dictionary word(s), so "acorndomains" for example it know its 2 words.

Thanks

Yes I do - how many domains? I can do this for domains with single words or two words only.

Let me know via PM and we'll come to some arrangement for payment via PayPal.

Cheers,
Paul.
 
Status
Not open for further replies.

The Rule #1

Do not insult any other member. Be polite and do business. Thank you!

Members online

Premium Members

New Threads

Domain Forum Friends

Our Mods' Businesses

*the exceptional businesses of our esteemed moderators
General chit-chat
Help Users
  • No one is chatting at the moment.
      There are no messages in the current room.
      Top Bottom