Group:  Microsoft Outlook ยป microsoft.public.outlook.interop
Thread: Am looking for a dictionary of swear words

DotNetBag
.NET Development Newsgroups

HTVi
TV Discussion Newsgroups

Our Hot Pick: Rising Antivirus 2006 - Certified by TUV & Checkmark! Get 10% discount by entering this coupon code: ONDISCOUNT10
Rising Antivirus 2006

Am looking for a dictionary of swear words
"peteZ" <peteZ[ at ]aol.com> 27.01.2006 10:49:39
I have completed my project to develop an anti-spam plugin to Outlook 2003.

It uses VS2005 and VSTO and SQL Express (written in C#) and I must say I am
thrilled with the result. I have been testing on users (like me) who get >
400 spams per day (thus the fake email addy)) and have reduced it to either
1, 2 or 3 or, more frequently no spam each day - it's detected and moved to
configured spam folders. As it gets smarter, I get less spam.

I am using a few new tricks (not Bayesian either) and am getting very
accurate results apart from a few false positves (about 0.5%) which I am not
happy about.

I have developed a word / sentence analysis algorithm that gets smarter with
each email I mark as spam (or the domain as a spam domain) but need to
crystalise a list of "bad words" ie. swear or porno words to better the
algorithm.

Does anyone now of a set of these word types that are freeware ? I can
create my own but it might be a bit of a job to build, but it's quite
possible.

I'd prefer a list that could be loaded into a SQL Server database

thanks if you can help,

- peteZ


Re: Am looking for a dictionary of swear words
"Brian Tillman" <tillman1952[ at ]yahoo.com> 30.01.2006 18:54:56
peteZ <peteZ[ at ]aol.com> wrote:

[Quoted Text]
> I have developed a word / sentence analysis algorithm that gets
> smarter with each email I mark as spam (or the domain as a spam
> domain) but need to crystalise a list of "bad words" ie. swear or
> porno words to better the algorithm.
>
> Does anyone now of a set of these word types that are freeware ?

The alt.comp.freeware newsgroup may be a jumping off point for that.
--
Brian Tillman
Thanks Brian - I'll check it out (nt)
"peteZ" <peteZ[ at ]aol.com> 31.01.2006 08:49:27


Home | Search | Terms | Imprint | Contact
Newsgroups Reader - provided by WiredBox.Net