Skip to content
Snippets Groups Projects
Commit dad676c4 authored by Noordijk, Ben's avatar Noordijk, Ben
Browse files

Fixed duplicate species in ground_truth

parent 728eaa55
No related branches found
No related tags found
1 merge request!3Added data preparation, hyperparameter optimisation, benchmarking code and k-mer library visualisation
......@@ -52,9 +52,9 @@ def main():
'read id',
'species',
'species id'])
# Dirty fix because NZ_DS264585.1 and NZ_DS264586.1 belong to same species.
# Name is different due to scaffold so we have to change the species id
# Dirty fix because some species ID belong to same species
df.loc[df['species id'] == 'NZ_DS264585.1', 'species id'] = 'NZ_DS264586.1'
df.loc[df['species id'] == 'CP000143.2', 'species id'] = 'CP000144.2'
df.to_csv(args.out_path, index=False)
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment