Spell Checking Oriented Word Lists (SCOWL)
-Version 2017.01.22
-Sun Jan 22 17:39:13 2017 -0500 [fbc7107]
+Version 2017.08.24
+Thu Aug 24 14:36:19 2017 -0400 [2614b88]
by Kevin Atkinson (kevina@gnu.org)
The SCOWL is a collection of word lists split up in various sizes, and
To give you an idea of what the words in the various sizes look like
here is a sample of 25 random words found only in that size:
-10: argue arguing believe center character clear comparing corrects exact
- extreme get idea irritates kept linking notes observing occurred printers
- regulations remembered scores show signs unite
-
-20: advocates classifying commandment competent culprit cumulative
- differential earning extensions mature obeys optic orientating overloaded
- perception prisoners proofs puzzled restrictions retaining rock sister
- stuffing terrorists unfounded
-
-35: aides ascetic braiding clung conquerors dispassionate edicts equalized
- exposition gardenias glamour godmothers handlebars huffs impudent lunge
- masterful needled paddocks pots raping shouldered snooker sprawled
- tomcats
-
-40: boobed carjackings crapped floozies footsie freethinkers gassiest
- genuflecting geriatric globetrotter innovate jukebox marinaded menorah
- nannies neutralization piously premarital rekindling riverbed stilts
- stonewall swimmer tattletale twerps
-
-50: aglitter amazon blinders boggier cerebra coiffuring discernment flintlock
- interstices japanned katydid lagniappes loganberry lorgnette misdealt
- monograph peripatetic polliwogs radicalism schoolhouse seismic soppier
- suffused trisected wastrel
-
-55: bedsit candida coaxingly completions contextualization cutaways
- functionalism handsaw hardboard hyperinflation hypnotherapy inglenooks
- isotherms jobshare naffer outguns playgroup precariousness remaster
- ruched seltzers straitlaced trainspotting woad yukky
-
-60: basinfuls bastardization blueness charlatanism dater dispassion duper
- glutinously goofball greeters imitable lacewing misspeak nickering
- nonbelligerent noneffective nonindependent premix resize retarder
- southeaster steerable talky tarantellas ungoverned
-
-70: apostrophic bioecology celoms chloropicrin choli diapause dithyramb
- doorsill eluder ergotisms geomagnetically hispid inebriant lobelia
- meatman osteitides overprescribes plausibleness quadroon quincunx
- sacculate tache toxics trophoplasm unenthusiastically
-
-80: angledozers arrogancies beadledoms centesis cryobiologist entrepreneuses
- estafettes floriation forgivingness glucosamines hairlocks hoofprint
- intraventricular keffiyahs keloidal lunateds posttranslational rewinded
- sandspurs seeable sparlings starstone underarmed unmarriageableness
- upgrowing
-
-95: amusively anabrus anglophily atrophous augustin bachelry barbarianize
- coadjacence cothe hemicardia inblowing jardini lindo mallear manx
- morcellement olericulturist oversliding palsier pertinate proctoptosis
- recollectednesses rowdyishly tinta unsplattered
+10: allow apology borrow commented commenting confirming device field film
+ forgotten happen industrial insist kept log present processing register
+ representative seems string style suspected tie tying
+
+20: accuse advisable browsed dialings emphasizing farmers fatuous fighting
+ graduates honesty intentions judged mainframes mechanisms mirrors
+ newspapers partition phoned poison prevalent settings smart spokesman
+ thesis underlying
+
+35: accrued bankrupts beetling broadsides colonel compactest dissolved
+ drunkenly encumbrances engagement erects halved imprisonment miscellany
+ neurons neutralized nobly perking postured raisin ripples sprinklings
+ streaking verdicts wedged
+
+40: blabbermouths boardinghouses boardrooms bopping checklist
+ compartmentalized condos famished formidably interconnecting litigates
+ mecca nudists polygraphing pristine puppeteers quarterfinal rethinks
+ silencers soreness stepchildren toeholds trundle typecasts upstaging
+
+50: attributive barbacoa bondsman caduceus colluded consanguinity enfeeble
+ faker firetraps jocundly kestrel macerating piggishness quadruplicates
+ ragweed reassign rebelliously refract skywriting splodge storied turbot
+ umbels unlearning wanderlust
+
+55: androgyny basques bendiest busking constitutionalism conventioneer coshes
+ creditworthiness determinedly fuckheads garçon goalless groundwater
+ innocuousness millionairess preservationists professionalizes raunchily
+ shebeens shitheads stripy stroppiest superciliously tinplate wholefoods
+
+60: anapestic anodizes asphodel bargainers berkelium broncs carotene daemonic
+ defilers enshrinement instate mintage nonacceptance ovular quarterstaff
+ recharter reconsigning recontaminating reexaminations supersaturation
+ tantalization unharness unharnessed widener wonkiest
+
+70: benignantly berretta bittiness blacklegged breadsticks cinquefoils
+ crossruffs epifocal extralegally frequentative gadgeteers isomorph
+ labeler oppositional phanerogams pleochroism pourable preestablish resiny
+ salpinxes singularizes sprit summarizations technic tracheid
+
+80: amoristic anglify antiterrorists boileries deboshing evidents exenterates
+ formulizations graphitizing hamshackle infallibilist mohican multeities
+ mvule nabobesses panini pheochromocytomata quadruplexed rampick resculpt
+ schmooz scotias staws sterlingness valiantness
+
+95: additionist anchoretism antiepithelial appropre casewood colocephalous
+ flusteration haldu hishes interpervasiveness knutty nonencroachment
+ occluse orthoformic pentateuch plumbership preconflict saumont
+ slopselling tangleproof transbaikal unaccordance uniphase unmixableness
+ woom
And here is a count on the number of words in each spelling category
Size Words Names Running Total %
10 4,426 13 4,439 0.7
- 20 8,124 0 12,563 1.9
- 35 37,258 222 50,043 7.6
- 40 6,853 491 57,387 8.7
- 50 25,224 18,221 100,832 15.3
- 55 6,489 0 107,321 16.3
- 60 14,141 774 122,236 18.6
- 70 35,506 7,912 165,654 25.2
- 80 144,283 33,370 343,307 52.2
- 95 227,674 86,649 657,630 100.0
+ 20 8,125 0 12,564 1.9
+ 35 37,259 222 50,045 7.6
+ 40 6,853 491 57,389 8.7
+ 50 25,225 18,222 100,836 15.3
+ 55 6,489 0 107,325 16.3
+ 60 14,268 779 122,372 18.6
+ 70 35,418 7,911 165,701 25.2
+ 80 144,259 33,372 343,332 52.2
+ 95 227,669 86,649 657,650 100.0
(The "Words" column does not include the name count.)
CHANGES:
+From Version 2017.01.22 to 2017.08.24
+
+ Various new words.
+
From Version 2016.11.20 to 2017.01.22
Various new words.
Add schema and scripts for creating a SQLite database from SCOWL.
Add some utility and library functions using them. This database is
- used by the new web app's (http://app.aspel.net/lookup & create).
+ used by the new web app's (http://app.aspell.net/lookup & create).
Enhance speller/make-hunspell-dict. The biggest improvement is that
it that it now generates several more dictionaries in addition to
Variant Conversion Info (VarCon)
-Version 2016.11.20
+Version 2017.08.24
Copyright 2000-2016 by Kevin Atkinson (kevina@gnu.org) and Benjamin
Titze (btitze@protonmail.ch).
preferred Canadian and British spelling ("B" and "C"). However the
American spelling is sometimes used in Canada (as indicated by "Cv",
where the lowercase "v" indicated a variant form) and the British
-spelling is sometimes used in America (as indicated the the "Av").
+spelling is sometimes used in America (as indicated the "Av").
More generally each tag consists of a spelling category (for example
"A") followed possible by a variant indicator. The spelling
to a "V". For example, if the variant is marked as "also" by
Merriam-Webster, or also if only some dictionaries acknowledge the
existence the variant. "-" is used when the variant is generally not
-listed is the dictionary but I could find some evidence of it use, or
-when it is it marked as as a archaic spelling for the word. The "x"
+listed is the dictionary but I could find some evidence of its use, or
+when it is marked as an archaic spelling for the word. The "x"
is used when the spelling is almost generally considered a
misspelling, and is only included for completeness.
form is used. For example:
A B C: practice / AV Cv: practise | <N>
A Cv: practice / AV B C: practise | <V>
-POS info is always given given in the form "<POS>" and if a definition
-is also given the the POS info is always first. The POS tags used are as
+POS info is always given in the form "<POS>" and if a definition
+is also given the POS info is always first. The POS tags used are as
follows:
<N>: Noun
<V>: Verb
A B C: coloration / B. Cv: colouration
A B C: colorations / B. Cv: colourations
A B C: coloration's / B. Cv: colouration's
- ## OED has coloration as the prefered spelling and discolouration as a
+ ## OED has coloration as the preferred spelling and discolouration as a
## variant for British Engl or some reason
In the notes ODE (not to be confused with OED) stands for Oxford
Dictionary of English, "Ox" is used for any Oxford dictionary, and
CHANGELOG:
+From 2016.11.20 to 2017.08.24
+
+ - Typo fixes thanks to Jakub Wilk
+
From 2016.06.26 to 2016.11.20
- New Australian spelling category thanks to the work of Benjamin