11-12-2023, 05:07 AM
Hi all
I m looking for a sorting algorithm, see image.
Thanks in advance for your help.
the sorting algorithm needs to sort the columns that are most similar like shown in the image above. The image
was made by hand for illustration purposes. The attached file is just a small sample but the number of columns
will always be 112. The idea is to find around 20 to 25 columns with the highest number of matching *'s
The output only needs to show column numbers, Example, if 15 columns are found to have 6 *'s in common then
those 15 columns will be the output, "5,12,22,35,43,50... n".
The file I am processing often contains 500 or more lines each with 112 columns. This seems like it would be easy
enough but once I started the more complicated it got. Any info would be greatly appreciated.
R1
Sample file, this is a .txt file which on has 25 lines of data.
https://app.box.com/s/38p52cfs6e6bbcbxsuhver7fxeb5dgcs
RSample
appreciate
I m looking for a sorting algorithm, see image.
Thanks in advance for your help.
the sorting algorithm needs to sort the columns that are most similar like shown in the image above. The image
was made by hand for illustration purposes. The attached file is just a small sample but the number of columns
will always be 112. The idea is to find around 20 to 25 columns with the highest number of matching *'s
The output only needs to show column numbers, Example, if 15 columns are found to have 6 *'s in common then
those 15 columns will be the output, "5,12,22,35,43,50... n".
The file I am processing often contains 500 or more lines each with 112 columns. This seems like it would be easy
enough but once I started the more complicated it got. Any info would be greatly appreciated.
R1
Sample file, this is a .txt file which on has 25 lines of data.
https://app.box.com/s/38p52cfs6e6bbcbxsuhver7fxeb5dgcs
RSample
appreciate