Python HTML JavaScript Tcl Java CSS Other
Clone or download
Permalink
Failed to load latest commit information.
WebContent/js/AjaxFranceLabs/locale Merge pull request #30 from saidobeid/master Jun 22, 2018
bin/common Fix Web job creation simplified UI Aug 13, 2018
datafari-cassandra JVMs heap size adjustment Aug 7, 2018
datafari-core Qf/Pf updated labels Aug 14, 2018
datafari-dependencies Added Cassandra host and port as env variables Aug 1, 2018
datafari-elk Added Kibana memory variable Jul 10, 2018
datafari-git-plugin Removed useless files Jun 28, 2018
datafari-handler Harmonize Solr version in jars and conf files Jul 27, 2018
datafari-jena Datafari 4.1 Jun 21, 2018
datafari-mcf-scripts Added simplified mcf ui Jun 22, 2018
datafari-mcf JVMs heap size adjustment Aug 7, 2018
datafari-realm Datafari 4.1 Jun 21, 2018
datafari-solr Fix exactContent and exactTitle issues Aug 14, 2018
datafari-tika Datafari 4.1 Jun 21, 2018
datafari-tomcat JVMs heap size adjustment Aug 7, 2018
datafari-updateprocessor Fix exactContent and exactTitle issues Aug 14, 2018
datafari-zookeeper Datafari 4.1 Jun 21, 2018
debian7 Update kibana ip in conf file Aug 8, 2018
elk/logs New UI part3 Jun 19, 2017
logs New UI part3 Jun 19, 2017
opensearch Refactor to valid multi module maven project Aug 12, 2016
windows Fix non blocker BackupManifoldCFConnectors exception during first start Mar 11, 2016
.drone.yml update drone config file Aug 2, 2018
.gitignore New UI part6 Jun 19, 2017
.travis.yml Change git depth in travis config file Jul 6, 2018
CHANGES.txt fix postgresql correct version number Jul 13, 2018
Dockerfile update Dockerfile Aug 8, 2018
LICENSE.txt Update LICENSE.txt Jul 29, 2015
LICENSES THIRD-PARTY.txt Added Tika as a module (current v1.14) Nov 24, 2016
README.txt update readme text Jul 13, 2018
pom.xml build by Docker Jul 27, 2018

README.txt

--------------------------- DATAFARI V. 4.1 ------------------------


NOTE: For the changes compared to DATAFARI V4.0.2, please check CHANGES.txt.

Datafari 4.1 is the perfect product for anyone who needs to search and analyze its corporate big data, based on the most advanced open source technologies.
Datafari 4.1 combines the Apache Solr, Cassandra, ManifoldCF products and ELK. It allows its users to search into file shares, cloud shares (dropbox, google drive), databases, but also emails and many more sources.

Available as community and enterprise edition, Datafari is different from the competition :
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it.
- It combines three renowned Apache projects, namely Cassandra, Solr and ManifoldCF, which gives Datafari a long term vision.
- It leverages ELK the reference stack to analyze unstructured big data

The complete documentation (for users, admins and developers) is available here : https://datafari.atlassian.net/wiki/display/DATAFARI/Datafari

Requirements:

- Debian8 or higher Environment 64 bits (a Docker image is available if you are on Windows environment)
Recommended version is Debian 9
- Processor : 1GHZ and RAM : 8GB
- Ports 8080, 5432, 9200, 5601 are opened
- Debian environment : requires curl, debconf, unzip, sudo, libc6-dev, jq, lsof
- Java JDK 8

How to install and start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari with a non root user:
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer (Warning: the videos are for version 1.x) :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here (Warning: the videos are for version 1.x) :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here :
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html

For all the changes please check CHANGES.txt


--------------------------- DATAFARI V. 3.2.1 ------------------------


NOTE: For the major changes compared to DATAFARI V2.2, please check at the bottom of this page.

Datafari 3.2.1 is the perfect product for anyone who needs to search and analyze its corporate big data, based on the most advanced open source technologies.
Datafari 3.2.1 combines the Apache Solr, Cassandra, ManifoldCF products and ELK. It allows its users to search into file shares, cloud shares (dropbox, google drive), databases, but also emails and many more sources.

Available as community and enterprise edition, Datafari is different from the competition :
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it.
- It combines three renowned Apache projects, namely Cassandra, Solr and ManifoldCF, which gives Datafari a long term vision.
- It leverages ELK the reference stack to analyze unstructured big data

The complete documentation (for users, admins and developers) is available here : https://datafari.atlassian.net/wiki/display/DATAFARI/Datafari

Requirements:

- Debian7 or higher Environment 64 bits (a Docker image is available if you are on Windows environment)
Recommended version is Debian 8
(if you are on Debian 7 you will need to add the testing repo in /etc/apt/sources.list)
- Processor : 1GHZ and RAM : 8GB
- Ports 8080, 5432, 9200, 5601 are opened
- Debian environment : requires curl, debconf, unzip, sudo, libc6-dev, jq, lsof

How to install and start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari with a non root user:
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer (Warning: the videos are for version 1.x) :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here (Warning: the videos are for version 1.x) :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here :
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html

Major changes compared to v3.1.0
- Tika updated to version 1.15
- Manifold CF updated to version 2.6
- Cassandra updated to version 3.10
- New UI
- New Advanced Search
- New languages :
  - German
  - Portuguese/Brazilian


--------------------------- DATAFARI V. 3.1.2 ------------------------


NOTE: For the major changes compared to DATAFARI V2.2, please check at the bottom of this page.

Datafari 3.1 is the perfect product for anyone who needs to search and analyze its corporate big data, based on the most advanced open source technologies.
Datafari 3.1 combines the Apache Solr, Cassandra, ManifoldCF products and ELK. It allows its users to search into file shares, cloud shares (dropbox, google drive), databases, but also emails and many more sources.

Available as community and enterprise edition, Datafari is different from the competition :
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it.
- It combines three renowned Apache projects, namely Cassandra, Solr and ManifoldCF, which gives Datafari a long term vision.
- It leverages ELK the reference stack to analyze unstructured big data

The complete documentation (for users, admins and developers) is available here : https://datafari.atlassian.net/wiki/display/DATAFARI/Datafari

Requirements:

- Debian7 or higher Environment 64 bits (a Docker image is available if you are on Windows environment)
Recommended version is Debian 8
(if you are on Debian 7 you will need to add the testing repo in /etc/apt/sources.list)
- Processor : 1GHZ and RAM : 8GB
- Ports 8080, 5432, 9200, 5601 are opened
- Debian environment : requires curl, debconf, unzip, sudo, libc6-dev, jq, lsof

How to install and start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari with a non root user:
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer (Warning: the videos are for version 1.x) :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here (Warning: the videos are for version 1.x) :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here :
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html

To see major changes, see CHANGES.txt







--------------------------- DATAFARI V. 3.0.0 ------------------------


NOTE: For the major changes compared to DATAFARI V2.2, please check at the bottom of this page.

Datafari 3.0 is the perfect product for anyone who needs to search and analyze its corporate big data, based on the most advanced open source technologies.
Datafari 3.0 combines the Apache Solr, Cassandra, ManifoldCF products and ELK. It allows its users to search into file shares, cloud shares (dropbox, google drive), databases, but also emails and many more sources.

Available as community and enterprise edition, Datafari is different from the competition :
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it.
- It combines three renowned Apache projects, namely Cassandra, Solr and ManifoldCF, which gives Datafari a long term vision.
- It leverages ELK the reference stack to analyze unstructured big data

The complete documentation (for users, admins and developers) is available here : https://datafari.atlassian.net/wiki/display/DATAFARI/Datafari

Requirements:

- Debian7 or higher Environment 64 bits (a Docker image is available if you are on Windows environment)
Recommended version is Debian 8
(if you are on Debian 7 you will need to add the testing repo in /etc/apt/sources.list)
- Processor : 1GHZ and RAM : 8GB
- Ports 8080, 5432, 9200, 5601 are opened
- Debian environment : requires curl, debconf, unzip, sudo, libc6-dev

How to install and start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari with a non root user:
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer (Warning: the videos are for version 1.x) :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here (Warning: the videos are for version 1.x) :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here :
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html

Major changes compared to v2.2
- Solrcloud on a single node activated by default
- Added query elevator admin fonctionnality
- Solr updated to version 5.5.1
- Postgres updated to version 9.5.3
- Usage of Tika embedded in MCF instead of Solr







--------------------------- DATAFARI V. 2.2 ------------------------

NOTE: For the major changes compared to DATAFARI V1.x, please check at the bottom of this page.

Datafari is the perfect product for anyone who needs to search within its corporate big data, based on the most advanced open source technologies.
Datafari 2.2 combines the Apache Solr, Cassandra and ManifoldCF products. It allows its users to search into file shares, cloud shares (dropbox, google drive), databases, but also emails and many more sources.

Available as community and enterprise edition, Datafari is different from the competition :
- Its open source license is not aggressive, as it uses the Apache v2 license: you are free to do whatever you want with it,
you just need to mention that you are using it.
- It combines three renowned Apache projects, namely Cassandra, Solr and ManifoldCF, which gives Datafari a long term vision.

Pre-Requirements:

- Debian Environment 64 bits (a Docker image is available if you are on Windows environment)
- Processor : 1GHZ and RAM : 2GB
- Ports 8080 and 5432 are opened
- Debian environment : requires curl, debconf, unzip, sudo, libc6-dev

How to install and start Datafari :

You can build the Debian installer with the ant script Datafari/debian7/build.xml. You can download Debian installer and Docker image from www.datafari.com.

1) Install Datafari :
# dpkg -i datafari.deb
2) Start Datafari with a non root user:
# cd /opt/datafari/bin
# bash start-datafari.sh
3) Stop Datafari :
# cd /opt/datafari/bin
# bash stop-datafari.sh

- Search UI :
http://localhost:8080/Datafari/

- Admin UI :
http://localhost:8080/Datafari/admin

You can find video tutorials on how to install and start Datafari from the installer :
- Debian : https://www.youtube.com/watch?v=cekFICeTTTs


If you want to use the jcifs connector in ManifoldCF, download  jcifs-1.3.xx.jar from http://jcifs.samba.org/src/ to DATAFARI_SOURCE_DIR\mcf\mcf_home\connector-lib-proprietary
Then edit the file Datafari/mcf/mcf_home/connectors.xml and uncomment the line :
 <!--repositoryconnector name="Windows shares" class="org.apache.manifoldcf.crawler.connectors.sharedrive.SharedDriveConnector"/-->
 And restart Datafari

You have to configure your Repository connector and job to add documents to Datafari.
You can find a video tutorial on how to index local file share here :
https://www.youtube.com/watch?v=w0FtsvZO9SI
You can find documentation on how to create connectors and jobs here :
http://manifoldcf.apache.org/release/trunk/en_US/end-user-documentation.html

Major changes compared to v1.0
- Integration of Apache Cassandra
- Proper user management including an admin UI
- Complete overhaul of the admin UI, using the great Devoops v2 template.
- Complete overhaul of the Ajaxfrancelabs search UI, with new widgets and a cool responsive design
- Migration to Apache Solr 5
- Admin UI to configure connection to an Active Directory
- Admin UI to manage promolinks
- Admin UI to boost Solr fields at search time
- Admin UI to configure the autocomplete
- Admin UI to configure the synonyms
- Migration of JDK to JVM version 1.8 u66
- Restructuring of the configuration files to facilitate update processes
- Bugfix for the alerts feature
- Added unit testing
- Added SKOS and OWL ontologies support through Apache Jena 3.0.1

Enjoy :-)