Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Render word docs using Google or Microsoft embedded renderers #186

Open
Mr0grog opened this issue Jan 19, 2018 · 2 comments
Open

Render word docs using Google or Microsoft embedded renderers #186

Mr0grog opened this issue Jan 19, 2018 · 2 comments

Comments

@Mr0grog
Copy link
Member

Mr0grog commented Jan 19, 2018

This is kind of related to #179: like PDF and other non-text file formats, we can’t diff MS Word documents. BUT! Both Google and Microsoft offer iframe-embeddable renderers for Word docs, so we could use that to display the contents of the file, even if we can’t diff it.

Google: https://docs.google.com/gview?url=https://edgi-versionista-archive.s3.amazonaws.com/versionista2/74286-6216580/version-14182260.doc&embedded=true

https://docs.google.com/gview?url={URL here}&embedded=true

Microsoft: https://view.officeapps.live.com/op/embed.aspx?src=https://edgi-versionista-archive.s3.amazonaws.com/versionista2/74286-6216580/version-14182260.doc

https://view.officeapps.live.com/op/embed.aspx?src={URL here}

We should see if these viewers work for Powerpoint and Excel files, too.

And of course we should also see if we can figure out a way to actually diff them, but this is an easy short term solution that’s better than displaying nothing at all.

@Mr0grog
Copy link
Member Author

Mr0grog commented Jan 19, 2018

@Mr0grog
Copy link
Member Author

Mr0grog commented Mar 16, 2018

For this, you’ll probably want to create a new view that renders a word document using one of the above methods. See SandboxedHtml for an example, although this view will hopefully be much simpler.

Then modify RawVersion.render() and SideBySideRawVersions.renderVersion() to use that view based on the media type of the version you are rendering.

Check out ChangeView. mediaTypeForVersion() to see how to determine the media type for a version object. (In the future, we hope have an actual media type field on version objects, but that’s not done yet — see edgi-govdata-archiving/web-monitoring-db#199)

@stale stale bot added stale and removed stale labels Jan 10, 2019
@Mr0grog Mr0grog added this to Icebox in Web Monitoring May 23, 2019
@stale stale bot added the stale label Jul 9, 2019
@stale stale bot closed this as completed Jul 16, 2019
Web Monitoring automation moved this from Icebox to Done! Jul 16, 2019
@Mr0grog Mr0grog reopened this Aug 1, 2019
Web Monitoring automation moved this from Done! to Ready Aug 1, 2019
@edgi-govdata-archiving edgi-govdata-archiving deleted a comment from stale bot Aug 1, 2019
@edgi-govdata-archiving edgi-govdata-archiving deleted a comment from stale bot Aug 1, 2019
@Mr0grog Mr0grog moved this from Ready to Discussion in Web Monitoring Oct 21, 2020
@Mr0grog Mr0grog moved this from Discussion to Icebox in Web Monitoring Oct 21, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Web Monitoring
  
Icebox
Development

No branches or pull requests

1 participant