If I download identifiers.csv from https://opendata.eol.org/dataset/identifiers-csv-gz, the file doesn’t have a csv header line. i.e. it starts with:
20571209,"http://mushroomobserver.org/name/show_name/1",113,5559,"Fungi"
20571277,"",114,4409,"Aceraceae"
20571278,"",114,37290,"Acer"
...
This in turn causes parsing issues with typical csv readers that expect a header in order to expose the fields by name.
Is this expected?
Thanks!
In addition, it appears to have some incorrectly formatted lines, e.g.
45126256,"NAM_lóbulo olfatorio (olfatory lobe)\"",433,1,"Animalia"
Note the \""
at the end of the second field, which throws off some parsers. They end up getting NAM_lóbulo olfatorio (olfatory lobe)\",433,1,Animalia"
as the value for the second field (and nothing for the following fields).
I ended up finding an alternate download location which fixes those issues (details here). But ideally the link above should be fixed as well.
Thanks for reporting, @davidebbo ! The keeper of that data export is investigating. The first word back may be in our general gitter room
1 Like
Great, didn’t know about the room. I’ll follow the discussion there, thanks!
For the record, for anyone who finds this thread later: https://opendata.eol.org/dataset/identifier-map is the up to date link. Sorry I didn’t notice you were trying the old one!
Oh I see. The old link came from OneZoom docs. I opened Point to the correct EoL download by davidebbo · Pull Request #617 · OneZoom/OZtree · GitHub to get it updated to this one.
Thanks, I think we can call this one solved.