Skip to content

Progress Beethoven Team #2

@unite-analytics

Description

@unite-analytics

Hello team Beethoven @ICT4SD/st-search-members ,

I am encouraged by seeing that you have learned how to use docker and install local version of spark and elasticsearch to get and index data from the common crawl.

Just with that knowledge you could already build an initial version of the search engine by crawling only necessary domains, and perhaps by excluding pages which do not contain at least a few of the keywords relating to the SDGs. Nice job!

Also, i noticed some people have started to look into Kibana, which you can very quickly use as the front end for searching. If you use Kibana for the elasticsearch indexes you have you can already complete a solution!

Also, if you can communicate the requirement of space and servers to Professor RP, he might be able to help you with resources in Amazon Web Services. Would you please discuss with RP?

I wanted to ask if you could take a little bit of time to share the items listed below on github so other team members could also help:

  • An Elastic Search docker container or a saved index which includes already some of your crawled documents.
  • An image or powerpoint slide on how the search front end could look like

These items above would be very helpful for Professor Roberto Gonzales and Carolina Vasquez in the University of Santiago in Chile who are working on a customized search front-end interface for this project. (They are in this github team. Please feel free to interact with them directly).

Finally, at the United Nations in New York, May 15 and 16, 2017 there will be a conference called the Science Technology and Innovation (STI) Forum. At this event there will be discussions about an online platform for STI. I hope from this project we can have a few prototypes of search engines which can be part of that online platform. See more details: https://sustainabledevelopment.un.org/TFM

I look forward to hearing from your next steps!

Best regards,
Jorge

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions