-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Data visualisation #40
Comments
Very keen on this. |
It's possible to do simple static diagrams via [R] packages like igraph. A major bonus, however, might be interactive/responsive graphics. I've tested out the networkD3 and chorddiag packages (both of which are based on D3.js run via the R2D3 package). See These should at least be sufficient to adapt for grouping and displaying sets articles based on coauthors, citations or topics. Ideally, eventually would love to use the bundle variant of a chord diagram (example1 or example2, tutorial. Initial tests of network for some covid authors: |
Great!
On Tue, Apr 14, 2020 at 1:30 PM Thomas Shafee ***@***.***> wrote:
It's possible to do simple static diagrams via [R] packages like igraph
<https://igraph.org/r/>.
A major bonus, however, might be interactive/responsive graphics. I've
tested out the networkD3 <https://christophergandrud.github.io/networkD3>
and chorddiag <https://github.com/mattflor/chorddiag> packages (both of
which are based on D3.js <https://d3js.org/> run via the R2D3 package
<https://rstudio.github.io/r2d3/articles/gallery.html>). See
Excellent. This could work for cooccurrences - e.g. in Counties or
diseases. I have just created (but not pushed) the first extraction of
biorxiv700 (695 papers) - due to coding bugs there are only 600.
These should at least be sufficient to adapt for grouping and displaying
sets articles based on coauthors, citations or topics.
Ideally, eventually would love to use the bundle variant of a chord
diagram (example1
***@***.***/hierarchical-edge-bundling> or example2
***@***.***/hierarchical-edge-bundling/2>, tutorial
<https://www.youtube.com/watch?v=ROflkF1CVhI>.
Initial tests of network for some covid authors:
Excellent.
How do you want to receive the data?
P.
[image: image]
… <https://user-images.githubusercontent.com/10216013/79225072-86853c80-7e9f-11ea-91cf-1e2c68071adf.png>
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#40 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAFTCS6JBNLRR3O64CUGH5LRMRJPJANCNFSM4MC73AVA>
.
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
|
I've now added the visualisation code to wikiPackageTesting.R in the #Visualisations section Data options:
|
Thanks @TS404
Well done. Can you give some screen shots?
That should be possible. Note there are usually many entries in a facet-cell. If you are just looking at bibliographic data we may manage things. There are multiple authors per article. How do we manage that? And what is a "citing" article? we don't have, and won't have , a citation graph.
Don't understand where these edges come from, and what a MisLin or MisNode.
That would be great. Presumably a questioon of getting this accepted by Wikidata-ns , but DanielM put millions of bibliographic references into Wikidata. (HEY! we should be adding QIDs for publications. That would be great!) Note also that I have not got pointers back to Biorxiv working properly. Are they putting preprints into Wikidata? |
Note that we haven't got a simple approach to bibliography. We can do JATS from EPMC . JATS is not always much fun as there can be authorstrings (i.e. all authors run together) and disambiguation (no ORCIDs). |
So it's possible to generate the input |
Data format and storageThe facet cell listing multiple items is fine (essentially I'll aim to turn it into a nested list in [R]). Similarly, ideally there should be a column listing all the authors of a publication (disambiguating to QIDs will be the greatest challenge) but as plaintext strings is fine as a backup. I've checked over at Wikidata's Wikiproject COVID-19 and it seems there are already a few hundred preprints already listed in wikidata, so it shouldn't be too controversial to add all the covid-relevant ones (and eventually others). VisualisationsVisualisations focusing on topics and publications has the clearest immediate public value to show where the main research threads are heading. Visualisations focusing on authors can demonstrate which authors are collaborative (and which are in silos) and in what roles and can help researchers to identify people to watch or contact for collaboration. I like the idea of separating first/middle/last if possible (like this query). I've done a bit more stress-testing of the code for networks of different sizes (e.g. see Anthony Fauci's co-author network below). Next step, I'll start tweaking it to make the nodes=publications and the links=topic_similarity.
|
Ok, so I've managed to get the concomitant co-topic graph working reasonably robustly! In order to make it interactive, I've built a simple shiny app. It works locally fine locally, but the version on shinyapps.io seems to still be having problems (I've left a query on stack overflow). Once I've managed to get it properly working online, next steps for the visualisation:
|
Well done!
I also get "Disconnected from the server" - is that the problem?
(Chrome)
…On Sun, Apr 19, 2020 at 10:54 AM Thomas Shafee ***@***.***> wrote:
Ok, so I've managed to get the concomitant *co-topic* graph working
reasonably robustly!
In order to make it interactive, I've built a simple shiny app. It works
locally fine locally, but the version on shinyapps.io seems to still be
having problems (I've left a query on stack overflow
<https://stackoverflow.com/questions/61301407/immediate-disconnect-from-server-in-shinyapps-local-working-no-errors-reported>
).
- Website: https://ts404.shinyapps.io/topicnetwork
- Code: https://github.com/TS404/TopicNetwork
Once I've managed to get it properly working online, next steps:
1. Take the topics graph from openVirus as the input
2. Present chord diagram as well
3. Improve the click actions (e.g. select node to list publications on
that topic, click multiple nodes to subset?)
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#40 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAFTCS4BN5CML7T6MTUTQLTRNLC5HANCNFSM4MC73AVA>
.
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
|
Ah... you seem to have got some suggestions on StackOverflow
On Sun, Apr 19, 2020 at 4:19 PM Peter Murray-Rust <
peter.murray.rust@googlemail.com> wrote:
… Well done!
I also get "Disconnected from the server" - is that the problem?
(Chrome)
On Sun, Apr 19, 2020 at 10:54 AM Thomas Shafee ***@***.***>
wrote:
> Ok, so I've managed to get the concomitant *co-topic* graph working
> reasonably robustly!
>
> In order to make it interactive, I've built a simple shiny app. It works
> locally fine locally, but the version on shinyapps.io seems to still be
> having problems (I've left a query on stack overflow
> <https://stackoverflow.com/questions/61301407/immediate-disconnect-from-server-in-shinyapps-local-working-no-errors-reported>
> ).
>
> - Website: https://ts404.shinyapps.io/topicnetwork
> - Code: https://github.com/TS404/TopicNetwork
>
> Once I've managed to get it properly working online, next steps:
>
> 1. Take the topics graph from openVirus as the input
> 2. Present chord diagram as well
> 3. Improve the click actions (e.g. select node to list publications
> on that topic, click multiple nodes to subset?)
>
> —
> You are receiving this because you modified the open/close state.
> Reply to this email directly, view it on GitHub
> <#40 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AAFTCS4BN5CML7T6MTUTQLTRNLC5HANCNFSM4MC73AVA>
> .
>
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
|
The CJ Yetman comment fixed it! Try https://ts404.shinyapps.io/topicnetwork now! I'll have to test why the fix works to avoid re-introducing it later, but v. useful for now. |
Updates to https://ts404.shinyapps.io/topicnetwork now enable it to report back the list of publications that are about a set of subjects. Currently picked based on checkboxes, but eventually I'd like it to be based on clicking the nodes. |
This is fantastic.
(Small comments. It's somewhat slow computationally. And it's not easy to
read the labels. But it shows new clusters. Excitin g.)
…On Mon, Apr 27, 2020 at 12:02 PM Thomas Shafee ***@***.***> wrote:
Updates to https://ts404.shinyapps.io/topicnetwork now enable it to
report back the list of publications that are about a set of subjects.
Currently picked based on checkboxes, but eventually I'd like it to be
based on clicking the nodes.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<#40 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAFTCS23LCIHOKIOQ6G43RTROVQ6FANCNFSM4MC73AVA>
.
--
Peter Murray-Rust
Founder ContentMine.org
and
Reader Emeritus in Molecular Informatics
Dept. Of Chemistry, University of Cambridge, CB2 1EW, UK
|
I'm a big fan of a good visualisation. I'm going to start thinking about some possible visualisations using [R] and Shinyapps. Any assistance and ideas welcomed on possible individual or combined visualisations for:
Some existing examples:
Obviously networks and multidimensional scaling projections could be useful. Also probably circos plots between themes?
The text was updated successfully, but these errors were encountered: