Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Masakari - the abandoned text viewer
#3
Still didn't try making the QB64 wrapper of DraFF, wanted to clear the heavy fuzzy benchmark first by showcasing the showdown between DraFF & Kazahana - the two exhaustive fuzzy finderesses of mine, glad to share the complete package which will be the base for further functions in my QB64 GUI assistants.

In essence, the included benchmark addresses the problem with finding hard-to-find errors/misspellings which cannot be fixed with mere spell-check wordlists. The fuzzy searches are indispensable when phrases are the targets.

Now, the problematic question solved:
How many unique misspellings of "Sylvester Stallone" are in the English Wikipedia (from 2024-Oct) XML dump?

   

Code: (Select All)
[sanmayce@djudjeto SS_vs_enwiki]$ cat enwiki-20241001-pages-articles.xml.hits.unique
Salvestor Stallone
Silvester Stallone
=Silvester Stalone
Silvester Stalone
Silvestro Stallone
:Silvestr Stallone
Silvestr Stallone]
Sylvester Stallone
Sylvester Stalone
>Sylvester Stalone
[Sylvester Stalone
|Sylvester Stalone
Sylvester Stalone
Sylvester Stalone,
Sylvester Stalone.
Sylvester Stalone<
Sylvester Stalone]
Sylvester Stalone|
Sylvestor Stallone
[sanmayce@djudjeto SS_vs_enwiki]$
In the near future, I envision DraFF becoming part of my QB64PE text-sidekick TriMasakari...


Attached Files
.7z   CPU_Benchmark_Fuzzy-Search-Wikipedia_2025.7z (Size: 88.13 MB / Downloads: 116)
.pdf   Fuzzy-Finding_DraFF_plus_Kazahana_sourcecode.pdf (Size: 15.81 MB / Downloads: 127)
"He learns not to learn and reverts to what the masses pass by."
Reply


Messages In This Thread
RE: Masakari - the abandoned text viewer - by Sanmayce - 01-06-2025, 03:17 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Text Effects 2 2112 6 696 10-30-2025, 11:13 PM
Last Post: Unseen Machine
  Text Encryption-Decryption 2112 6 773 10-21-2025, 11:51 AM
Last Post: euklides
  Upside-Down Big Text SierraKen 2 704 02-22-2025, 01:52 AM
Last Post: SierraKen
  Exercise with picture and text Kernelpanic 10 2,392 06-14-2024, 10:00 PM
Last Post: SMcNeill
  Word (text) processor krovit 19 4,523 09-02-2023, 04:38 PM
Last Post: grymmjack

Forum Jump:


Users browsing this thread: 1 Guest(s)