-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
📕 Documentation: Dictionary.xml and DictionaryDescription.md of: eoAnalysisInstrument (inactive) #15
Comments
searching with dictionaries
verify that
then search:
Results are in
as
Each CTree (PMC document) is searched into |
Simple
will search all the HTML for "mass spectrom" and gives 50 characters either side |
Hello, I am working on how to migrate the article/instrument matches to Wikidata. The xml with the excerpts is fantastic, but my xml processing skills are still incipient. I remember having seen in the sprint a summary table with the PMC IDs in one column and counts for each term in another column. Would you know how I can obtain this summary file? EDIT: Even though I'm still not able to generate the full html table, I could draft some code to migrate to wikidata from the full table. The code is at https://github.com/caffiendFrog/elife2019/tree/master/wikidatamigration One of the pages edited: https://www.wikidata.org/wiki/Q44476657 |
This is wonderful Tiago
If you checkout oil186/
You will find fulldatatables.html which I think is what you want
…On Thu, 12 Sep 2019, 19:12 Tiago Lubiana, ***@***.***> wrote:
Hello,
I am working on how to migrate the article/instrument matches to Wikidata.
The xml with the excerpts is fantastic, but my xml processing skills are
still incipient. I remember having seen in the sprint a summary table with
the PMC IDs in one column and counts for each term in another column.
Would you know how I can obtain this summary file?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#15?email_source=notifications&email_token=AAFTCS2SVNRIX3PULJHWQK3QJKBCJA5CNFSM4ITTX33KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6SYUPQ#issuecomment-530942526>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAFTCS42G43ZTTEMIDH5LFDQJKBCJANCNFSM4ITTX33A>
.
|
Tiago,
can you send me your email (by email to peter.murray.rust AT gmail DOT com)
so I can connect you with others.
Manny - meet TIago who is in Sao Paulo. Tiago was part of our eLife sprint
and worked on the Instruments and how you put this data into Wikidata! So
his knowledge will be really valuable for missing Wikidata items.
TIago, Manny is in Brasilia and pulling together the CEVOpen project
management of extracting plants and their oils from the literature
On Thu, Sep 12, 2019 at 7:59 PM Peter Murray-Rust <
peter.murray.rust@googlemail.com> wrote:
… This is wonderful Tiago
If you checkout oil186/
You will find fulldatatables.html which I think is what you want
On Thu, 12 Sep 2019, 19:12 Tiago Lubiana, ***@***.***>
wrote:
> Hello,
>
> I am working on how to migrate the article/instrument matches to Wikidata.
>
> The xml with the excerpts is fantastic, but my xml processing skills are
> still incipient. I remember having seen in the sprint a summary table with
> the PMC IDs in one column and counts for each term in another column.
>
> Would you know how I can obtain this summary file?
>
> —
> You are receiving this because you authored the thread.
> Reply to this email directly, view it on GitHub
> <#15?email_source=notifications&email_token=AAFTCS2SVNRIX3PULJHWQK3QJKBCJA5CNFSM4ITTX33KYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6SYUPQ#issuecomment-530942526>,
> or mute the thread
> <https://github.com/notifications/unsubscribe-auth/AAFTCS42G43ZTTEMIDH5LFDQJKBCJANCNFSM4ITTX33A>
> .
>
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
|
Created simple dictionary by hand (can be incremented later)
NOTE:
term
is used for searching (maybe with stemming).NOTE: these are probably not in Wikidata. Also
Clevenger
is not an instrument and should be removed.name
is descriptive.title
attribute ondictionary
must match filenameThe text was updated successfully, but these errors were encountered: