Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Masakari - the abandoned text viewer
#3
Still didn't try making the QB64 wrapper of DraFF, wanted to clear the heavy fuzzy benchmark first by showcasing the showdown between DraFF & Kazahana - the two exhaustive fuzzy finderesses of mine, glad to share the complete package which will be the base for further functions in my QB64 GUI assistants.

In essence, the included benchmark addresses the problem with finding hard-to-find errors/misspellings which cannot be fixed with mere spell-check wordlists. The fuzzy searches are indispensable when phrases are the targets.

Now, the problematic question solved:
How many unique misspellings of "Sylvester Stallone" are in the English Wikipedia (from 2024-Oct) XML dump?

   

Code: (Select All)
[sanmayce@djudjeto SS_vs_enwiki]$ cat enwiki-20241001-pages-articles.xml.hits.unique
Salvestor Stallone
Silvester Stallone
=Silvester Stalone
Silvester Stalone
Silvestro Stallone
:Silvestr Stallone
Silvestr Stallone]
Sylvester Stallone
Sylvester Stalone
>Sylvester Stalone
[Sylvester Stalone
|Sylvester Stalone
Sylvester Stalone
Sylvester Stalone,
Sylvester Stalone.
Sylvester Stalone<
Sylvester Stalone]
Sylvester Stalone|
Sylvestor Stallone
[sanmayce@djudjeto SS_vs_enwiki]$
In the near future, I envision DraFF becoming part of my QB64PE text-sidekick TriMasakari...


Attached Files
.7z   CPU_Benchmark_Fuzzy-Search-Wikipedia_2025.7z (Size: 88.13 MB / Downloads: 4)
.pdf   Fuzzy-Finding_DraFF_plus_Kazahana_sourcecode.pdf (Size: 15.81 MB / Downloads: 5)
"He learns not to learn and reverts to what the masses pass by."
Reply


Messages In This Thread
RE: Masakari - the abandoned text viewer - by Sanmayce - 01-06-2025, 03:17 AM



Users browsing this thread: 3 Guest(s)