Skip to content

Latest commit

 

History

History
631 lines (590 loc) · 309 KB

File metadata and controls

631 lines (590 loc) · 309 KB

casestudy: bigdata ecosystem and databases

  • bigdata tools : apache hadoop and apache spark
  • databases : MySQL, SQLAlchemy, InfluxDB, Neo4j, MongoDB, RethinkDB, TinkerPop, PostgreSQL, CouchDB, HBase

goals of this article

  • casestudy to outline all available bigdata and database softwares
  • to explore architectures of bigdata tools(hadoop and spark)

Some papers influenced the birth and growth of Hadoop and big data processing. Some of these are:

Spark


some usefull tags to understand bigdata tools in simple words

| Big Data | Data Science | Data Analysis | Jobs and Careers in Data Science | Data Mining | Big Data Analysis | Big Data Analytics | Hadoop in Big Data | Big Data Companies | Masters in Data Science | Data Analytics | Big Data Careers | Data Scientists | Jobs and Careers in Data Science | Learning Data Science | Becoming a Data Scientist | Data Science Interviews | Facebook Data Science | Hadoop in Big Data | Apache Hadoop YARN | Yarn | Apache Mesos | Kubernetes | Microservices | Docker | DevOps | Containers (virtualization) | CoreOS | MapReduce | Apache Hadoop | Hortonworks | MapR Technologies | Hadoop Cluster | Computer Cluster | Cluster analysis | High Performance Computing | Parallel Computing | Compute Unified Device Architecture (CUDA) | GPU Computation | OpenCL | General-Purpose GPU | Graphics Processing Unit | Hadoop Distributed File System | Flume | Apache Kafka | RabbitMQ | MQ Telemetry Transport (MQTT) | eXtensible Messaging and Presence Protocol (XMPP) | Apache Hive | Apache Impala (incubating) | Apache Sqoop | Apache Pig | Apache Spark | Apache Storm | Stream Processing | Data Streams | Distributed Systems | Apache ZooKeeper | Databricks | HBase | Amazon Elastic MapReduce | Apache Pig | Hadoop Jobs | Hadoop Administration | Jobs and Careers in Data Science | Data Mining | Data Scientists | IBM Big Data and Analytics | Big Data Infrastructure | Data Engineering | Data Science Master's Programs | Business Intelligence | Tableau (product) | Data Visualization | D3.js (JavaScript library) | Business Intelligence Software | Data Warehousing | Amazon Redshift | Google BigQuery | Amazon RDS |Informatica (company) | QlikView | Microsoft Power BI | Business Analytics | Extract, Transform, Load (ETL) | Analytics | Web Analytics | Business Analytics | Google Analytics (product) | Google Analytics API | Predictive Analytics | Predictive Modeling | Mobile Analytics | Mobile App Analytics | Data Science Master's Programs | Masters in Data Science |

database tools

| Database Systems | SQL | MySQL | MySQL Performance | Database Administration | Database Theory | Database Design | Relational Databases | Graph Databases | OrientDB | Database Management Software | Neo4j | Titan (graph database) | Online Social Network Graphs | Facebook Graph API | Titan (graph database) | Oracle (company) | Oracle Database 12C | NoSQL | CouchDB | Cassandra (database) | Memcached | Redis | MS SQL Server (product) | MongoDB | Elasticsearch | Oracle Database | Database Management Software | Databases | PostgreSQL | PL/SQL | Oracle DBA | Couchbase | Riak | SQLAlchemy | InfluxDB | RethinkDB | Database Startups | PostgreSQL | HBase |


Top level projects
ASF logo
Commons
Incubator
Other projects
Attic
Licenses

Ref

big data

databases


Hadoop ref

  1. ^ "Hadoop Releases". apache.org. Apache Software Foundation. Retrieved 2014-12-06. 
  2. ^ a b "Welcome to Apache Hadoop!". hadoop.apache.org. Retrieved 2016-08-25. 
  3. ^ "What is the Hadoop Distributed File System (HDFS)?". ibm.com. IBM. Retrieved 2014-10-30. 
  4. ^ Malak, Michael (2014-09-19). "Data Locality: HPC vs. Hadoop vs. Spark". datascienceassn.org. Data Science Association. Retrieved 2014-10-30. 
  5. ^ "Characterization and Optimization of Memory-Resident MapReduce on HPC Systems" (pdf). IEEE. October 2014. 
  6. ^ "Resource (Apache Hadoop Main 2.5.1 API)". apache.org. Apache Software Foundation. 2014-09-12. Retrieved 2014-09-30. 
  7. ^ Murthy, Arun (2012-08-15). "Apache Hadoop YARN – Concepts and Applications". hortonworks.com. Hortonworks. Retrieved 2014-09-30. 
  8. ^ "Continuuity Raises $10 Million Series A Round to Ignite Big Data Application Development Within the Hadoop Ecosystem". finance.yahoo.com. Marketwired. 2012-11-14. Retrieved 2014-10-30. 
  9. ^ "Hadoop-related projects at". Hadoop.apache.org. Retrieved 2013-10-17. 
  10. ^ Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data. John Wiley & Sons. 2014-12-19. p. 300. ISBN 9781118876220. Retrieved 2015-01-29. 
  11. ^ "[nlpatumd] Adventures with Hadoop and Perl". Mail-archive.com. 2010-05-02. Retrieved 2013-04-05. 
  12. ^ Cutting, Mike; Cafarella, Ben; Lorica, Doug (2016-03-31). "The next 10 years of Apache Hadoop". O'Reilly Media. Retrieved 2017-10-12. 
  13. ^ Ghemawat, Sanjay; Gobioff, Howard; Leung, Shun-Tak. "The Google File System". 
  14. ^ Dean, Jeffrey; Ghemawat, Sanjay. "MapReduce: Simplified Data Processing on Large Clusters". 
  15. ^ Cutting, Doug (28 Jan 2006). "new mailing lists request: hadoop". issues.apache.org. The Lucene PMC has voted to split part of Nutch into a new sub-project named Hadoop 
  16. ^ Vance, Ashlee (2009-03-17). "Hadoop, a Free Software Program, Finds Uses Beyond Search". The New York Times. Archived from the original on August 30, 2011. Retrieved 2010-01-20. 
  17. ^ Cutting, Doug (30 March 2006). "[RESULT] VOTE: add Owen O'Malley as Hadoop committer". hadoop-common-dev (Mailing list). 
  18. ^ "archive.apache.org". 
  19. ^ "Apache Hadoop Project Members". 
  20. ^ "Google Research Publication: The Google File System". Retrieved 2016-03-09. 
  21. ^ "Google Research Publication: MapReduce". Retrieved 2016-03-09. 
  22. ^ "[INFRA-700] new mailing lists request: hadoop – ASF JIRA". Retrieved 2016-03-09. 
  23. ^ "[HADOOP-1] initial import of code from Nutch – ASF JIRA". Retrieved 2016-03-09. 
  24. ^ a b c d e f g h i j k White, Tom (2012). Hadoop: The Definitive Guide (3rd ed.). O'Reilly. ISBN 9781449328917. 
  25. ^ "[NUTCH-197] NullPointerException in TaskRunner if application jar does not have "lib" directory – ASF JIRA". Retrieved 2016-03-09. 
  26. ^ a b "From Spiders to Elephants: The History of Hadoop". Retrieved 2016-03-09. 
  27. ^ "Index of /dist/hadoop/core". Retrieved 2016-03-09. 
  28. ^ a b "Hadoop Summit 2009". Retrieved 2016-03-09. 
  29. ^ "Apache Hadoop Releases". Retrieved 2016-03-09. 
  30. ^ Gates, Alan (2011). Programming Pig. O'Reilly. p. 10. ISBN 978-1-4493-0264-1. 
  31. ^ "Yahoo! Launches World's Largest Hadoop Production Application". hadoopnew – Yahoo. Retrieved 2016-03-09. 
  32. ^ "RE: Hadoop summit / workshop at Yahoo!". Retrieved 2016-03-09. 
  33. ^ http://sortbenchmark.org/YahooHadoop.pdf
  34. ^ "Apache Hadoop Wins Terabyte Sort Benchmark". hadoopnew – Yahoo. Retrieved 2016-03-09. 
  35. ^ "Cloudera". Retrieved 2016-03-09. 
  36. ^ http://sortbenchmark.org/Yahoo2009.pdf
  37. ^ http://www.mollynix.com/images_content/01commdes/hadoopschedulepdf.pdf
  38. ^ a b c d e f g h i j k "Welcome to Apache™ Hadoop®!". Retrieved 2016-03-09. 
  39. ^ "MapR Technologies". Retrieved 2016-03-09. 
  40. ^ "Yahoo! Updates from Hadoop Summit 2010". Think Big Analytics. Retrieved April 25, 2016. Baldeschwieler announced that Yahoo has released a beta test of Hadoop Security, which uses Kerberos for authentication and allows colocation of business sensitive data within the same cluster. 
  41. ^ "Apache HBase – Apache HBase™ Home". Retrieved 2016-03-09. 
  42. ^ "Hadoop Summit 2010 – Agenda is available!". hadoopnew – Yahoo. Retrieved 2016-03-09. 
  43. ^ a b "Hadoop Summit 2010". Retrieved 2016-03-09. 
  44. ^ "Apache Hive TM". Retrieved 2016-03-09. 
  45. ^ "Welcome to Apache Pig!". Retrieved 2016-03-09. 
  46. ^ "Apache ZooKeeper – Home". Retrieved 2016-03-09. 
  47. ^ a b "Reality Check: Contributions to Apache Hadoop — Hortonworks". Retrieved 2016-03-09. 
  48. ^ "Apache Hadoop takes top prize at Media Guardian Innovation Awards". The Guardian. Retrieved 2016-03-09. 
  49. ^ a b Harris, Derrick. "The history of Hadoop: From 4 nodes to the future of data". Gigaom. Retrieved 2016-03-09. 
  50. ^ "Hadoop Summit 2011: June 29th, Santa Clara Convention Center". hadoopnew – Yahoo. Retrieved 2016-03-09. 
  51. ^ "Fifth Annual Hadoop Summit 2012 Kicks Off with Record Attendance – Hortonworks". Retrieved 2016-03-09. 
  52. ^ "Hadoop Summit 2013 Amsterdam – It's A Wrap! – Hortonworks". Retrieved 2016-03-09. 
  53. ^ "Hadoop at Yahoo!: More Than Ever Before". Retrieved 2016-03-09. 
  54. ^ "Hadoop Summit North America 2013 Draws Record Ecosystem Support". Business Wire. Retrieved 2016-03-09. 
  55. ^ "The Apache Software Foundation Announces Apache™ Spark™ as a Top-Level Project : The Apache Software Foundation Blog". Retrieved 2016-03-09. 
  56. ^ "Loved Hadoop Summit Europe 2014 – Hope you did too! – SAP HANA". Retrieved 2016-03-09. 
  57. ^ "Hadoop Summit 2014 – Big Data Keeps Getting Bigger". Pentaho. Retrieved 2016-03-09. 
  58. ^ "Hadoop Summit Europe 2015, 15th–16th April 2015". Lanyrd. Retrieved 2016-03-09. 
  59. ^ Chouraria, Harsh (21 October 2012). "MR2 and YARN Briefly Explained". cloudera.com. Cloudera. Retrieved 23 October 2013. 
  60. ^ "HDFS User Guide". Hadoop.apache.org. Retrieved 2014-09-04. 
  61. ^ "Running Hadoop on Ubuntu Linux System(Multi-Node Cluster)". 
  62. ^ "Running Hadoop on Ubuntu Linux (Single-Node Cluster)". Retrieved 6 June 2013. 
  63. ^ Evans, Chris (Oct 2013). "Big data storage: Hadoop storage basics". computerweekly.com. Computer Weekly. Retrieved 21 June 2016. HDFS is not a file system in the traditional sense and isn't usually directly mounted for a user to view 
  64. ^ deRoos, Dirk. "Managing Files with the Hadoop File System Commands". dummies.com. For Dummies. Retrieved 21 June 2016. 
  65. ^ "HDFS Architecture". Retrieved 1 September 2013. 
  66. ^ a b Pessach, Yaniv (2013). "Distributed Storage" (Distributed Storage: Concepts, Algorithms, and Implementations ed.). Amazon.com 
  67. ^ "Version 2.0 provides for manual failover and they are working on automatic failover:". Hadoop.apache.org. Retrieved 30 July 2013. 
  68. ^ "Improving MapReduce performance through data placement in heterogeneous Hadoop Clusters" (PDF). Eng.auburn.ed. April 2010. 
  69. ^ "Mounting HDFS". Retrieved 2016-08-05. 
  70. ^ Shafer, Jeffrey; Rixner, Scott; Cox, Alan. "The Hadoop Distributed Filesystem: Balancing Portability and Performance" (PDF). Rice University. Retrieved 2016-09-19. 
  71. ^ Mouzakitis, Evan. "How to Collect Hadoop Performance Metrics". Retrieved 2016-10-24. 
  72. ^ "HDFS Users Guide – Rack Awareness". Hadoop.apache.org. Retrieved 2013-10-17. 
  73. ^ "Cloud analytics: Do we really need to reinvent the storage stack?" (PDF). IBM. June 2009. 
  74. ^ "HADOOP-6330: Integrating IBM General Parallel File System implementation of Hadoop Filesystem interface". IBM. 2009-10-23. 
  75. ^ "HADOOP-6704: add support for Parascale filesystem". Parascale. 2010-04-14. 
  76. ^ "HDFS with CloudIQ Storage". Appistry,Inc. 2010-07-06. 
  77. ^ "High Availability Hadoop". HP. 2010-06-09. 
  78. ^ job Archived August 17, 2011, at the Wayback Machine.
  79. ^ "Refactor the scheduler out of the JobTracker". Hadoop Common. Apache Software Foundation. Retrieved 9 June 2012. 
  80. ^ Jones, M. Tim (6 December 2011). "Scheduling in Hadoop". ibm.com. IBM. Retrieved 20 November 2013. 
  81. ^ "Hadoop Fair Scheduler Design Document" (PDF).
  82. spark ref

    1. ^ "Spark Release 2.0.0". MLlib in R: SparkR now offers MLlib APIs [..] Python: PySpark now offers many more MLlib algorithms" 
    2. ^ a b c d Zaharia, Matei; Chowdhury, Mosharaf; Franklin, Michael J.; Shenker, Scott; Stoica, Ion. Spark: Cluster Computing with Working Sets (PDF). USENIX Workshop on Hot Topics in Cloud Computing (HotCloud). 
    3. ^ "Spark 2.2.0 Quick Start". apache.org. 2017-07-11. Retrieved 2017-10-19. we highly recommend you to switch to use Dataset, which has better performance than RDD 
    4. ^ "Spark 2.2.0 deprecation list". apache.org. 2017-07-11. Retrieved 2017-10-10. 
    5. ^ Damji, Jules (2016-07-14). "A Tale of Three Apache Spark APIs: RDDs, DataFrames, and Datasets: When to use them and why". databricks.com. Retrieved 2017-10-19. 
    6. ^ Chambers, Bill (2017-08-10). "11". Spark: The Definitive Guide ("Rough Cut" pre-print ed.). O'Reilly Media. virtually all Spark code you run, where DataFrames or Datasets, compiles down to an RDD 
    7. ^ Zaharia, Matei; Chowdhury, Mosharaf; Das, Tathagata; Dave, Ankur; Ma,, Justin; McCauley, Murphy; J., Michael; Shenker, Scott; Stoica, Ion. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing (PDF). USENIX Symp. Networked Systems Design and Implementation. 
    8. ^ Xin, Reynold; Rosen, Josh; Zaharia, Matei; Franklin, Michael; Shenker, Scott; Stoica, Ion (June 2013). "Shark: SQL and Rich Analytics at Scale" (PDF). 
    9. ^ Harris, Derrick (28 June 2014). "4 reasons why Spark could jolt Hadoop into hyperdrive". Gigaom. 
    10. ^ "Cluster Mode Overview - Spark 1.2.0 Documentation - Cluster Manager Types". apache.org. Apache Foundation. 2014-12-18. Retrieved 2015-01-18. 
    11. ^ Figure showing Spark in relation to other open-source Software projects including Hadoop
    12. ^ MapR ecosystem support matrix
    13. ^ Doan, DuyHai (2014-09-10). "Re: cassandra + spark / pyspark". Cassandra User (Mailing list). Retrieved 2014-11-21. 
    14. ^ https://github.com/dfdx/Spark.jl
    15. ^ https://spark.apache.org/releases/spark-release-1-3-0.html
    16. ^ "Applying the Lambda Architecture with Spark, Kafka, and Cassandra | Pluralsight". www.pluralsight.com. Retrieved 2016-11-20. 
    17. ^ Shapira, Gwen (29 August 2014). "Building Lambda Architecture with Spark Streaming". cloudera.com. Cloudera. Retrieved 17 June 2016. re-use the same aggregates we wrote for our batch application on a real-time data stream 
    18. ^ "Benchmarking Streaming Computation Engines: Storm, Flink and Spark Streaming" (PDF). IEEE. May 2016. 
    19. ^ Kharbanda, Arush (17 March 2015). "Getting Data into Spark Streaming". sigmoid.com. Sigmoid (Sunnyvale, California IT product company). Retrieved 7 July 2016. 
    20. ^ Zaharia, Matei (2016-07-28). "Structured Streaming In Apache Spark: A new high-level API for streaming". databricks.com. Retrieved 2017-10-19. 
    21. ^ Sparks, Evan; Talwalkar, Ameet (2013-08-06). "Spark Meetup: MLbase, Distributed Machine Learning with Spark". slideshare.net. Spark User Meetup, San Francisco, California. Retrieved 10 February 2014. 
    22. ^ "MLlib | Apache Spark". spark.apache.org. Retrieved 2016-01-18. 
    23. ^ Malak, Michael (14 June 2016). "Finding Graph Isomorphisms In GraphX And GraphFrames: Graph Processing vs. Graph Database". slideshare.net. sparksummit.org. Retrieved 11 July 2016. 
    24. ^ Malak, Michael (1 July 2016). Spark GraphX in Action. Manning. p. 89. ISBN 9781617292521. Pregel and its little sibling aggregateMessages() are the cornerstones of graph processing in GraphX. ... algorithms that require more flexibility for the terminating condition have to be implemented using aggregateMessages() 
    25. ^ Malak, Michael (14 June 2016). "Finding Graph Isomorphisms In GraphX And GraphFrames: Graph Processing vs. Graph Database". slideshare.net. sparksummit.org. Retrieved 11 July 2016. 
    26. ^ Malak, Michael (1 July 2016). Spark GraphX in Action. Manning. p. 9. ISBN 9781617292521. Giraph is limited to slow Hadoop Map/Reduce 
    27. ^ Gonzalez, Joseph; Xin, Reynold; Dave, Ankur; Crankshaw, Daniel; Franklin, Michael; Stoica, Ion (Oct 2014). "GraphX: Graph Processing in a Distributed Dataflow Framework" (PDF). 
    28. ^ "The Apache Software Foundation Announces Apache&#8482 Spark&#8482 as a Top-Level Project". apache.org. Apache Software Foundation. 27 February 2014. Retrieved 4 March 2014. 
    29. ^ Spark officially sets a new record in large-scale sorting
    30. ^ Open HUB Spark development activity
    31. ^ "The Apache Software Foundation Announces Apache&#8482 Spark&#8482 as a Top-Level Project". apache.org. Apache Software Foundation. 27 February 2014. Retrieved 4 March 2014. 
    32. ^ "NY gets new bootcamp for data scientists: It's free, but harder to get into than Harvard". Venture Beat. Retrieved 2016-02-21. 
    33. ^ "Spark News". apache.org. Retrieved 2017-03-30. 

    bigdata ref

    1. ^ "The World's Technological Capacity to Store, Communicate, and Compute Information". MartinHilbert.net. Retrieved 13 April 2016. 
    2. ^ boyd, dana; Crawford, Kate (September 21, 2011). "Six Provocations for Big Data". Social Science Research Network: A Decade in Internet Time: Symposium on the Dynamics of the Internet and Society. doi:10.2139/ssrn.1926431. 
    3. ^ a b c d e f g "Data, data everywhere". The Economist. 25 February 2010. Retrieved 9 December 2012. 
    4. ^ "Community cleverness required". Nature. 455 (7209): 1. 4 September 2008. doi:10.1038/455001a. PMID 18769385. 
    5. ^ Reichman, O.J.; Jones, M.B.; Schildhauer, M.P. (2011). "Challenges and Opportunities of Open Data in Ecology". Science. 331 (6018): 703–5. doi:10.1126/science.1197962. PMID 21311007. 
    6. ^ Hellerstein, Joe (9 November 2008). "Parallel Programming in the Age of Big Data". Gigaom Blog. 
    7. ^ Segaran, Toby; Hammerbacher, Jeff (2009). Beautiful Data: The Stories Behind Elegant Data Solutions. O'Reilly Media. p. 257. ISBN 978-0-596-15711-1. 
    8. ^ a b Hilbert, Martin; López, Priscila (2011). "The World's Technological Capacity to Store, Communicate, and Compute Information". Science. 332 (6025): 60–65. doi:10.1126/science.1200970. PMID 21310967. 
    9. ^ "IBM What is big data? – Bringing big data to the enterprise". www.ibm.com. Retrieved 2013-08-26. 
    10. ^ Reinsel, David; Gantz, John; Rydning, John (2017-04-13). "Data Age 2025: The Evolution of Data to Life-Critical" (PDF). seagate.com. Framingham, MA, US: International Data Corporation. Retrieved 2017-11-02. 
    11. ^ Oracle and FSN, "Mastering Big Data: CFO Strategies to Transform Insight into Opportunity", December 2012
    12. ^ Jacobs, A. (6 July 2009). "The Pathologies of Big Data". ACMQueue. 
    13. ^ Magoulas, Roger; Lorica, Ben (February 2009). "Introduction to Big Data". Release 2.0. Sebastopol CA: O'Reilly Media (11). 
    14. ^ John R. Mashey (25 April 1998). "Big Data ... and the Next Wave of InfraStress" (PDF). Slides from invited talk. Usenix. Retrieved 28 September 2016. 
    15. ^ Steve Lohr (1 February 2013). "The Origins of 'Big Data': An Etymological Detective Story". New York Times. Retrieved 28 September 2016. 
    16. ^ a b Snijders, C.; Matzat, U.; Reips, U.-D. (2012). "'Big Data': Big gaps of knowledge in the field of Internet". International Journal of Internet Science. 7: 1–5. 
    17. ^ Dedić, N.; Stanier, C. (2017). "Towards Differentiating Business Intelligence, Big Data, Data Analytics and Knowledge Discovery". 285. Berlin ; Heidelberg: Springer International Publishing. ISSN 1865-1356. OCLC 909580101. 
    18. ^ Everts, Sarah (2016). "Information Overload". Distillations. 2 (2): 26–33. Retrieved 17 February 2017. 
    19. ^ Ibrahim; Targio Hashem, Abaker; Yaqoob, Ibrar; Badrul Anuar, Nor; Mokhtar, Salimah; Gani, Abdullah; Ullah Khan, Samee (2015). "big data" on cloud computing: Review and open research issues". Information Systems. 47: 98–115. doi:10.1016/j.is.2014.07.006. 
    20. ^ Laney, Douglas. "3D Data Management: Controlling Data Volume, Velocity and Variety" (PDF). Gartner. Retrieved 6 February 2001. 
    21. ^ Beyer, Mark. "Gartner Says Solving 'Big Data' Challenge Involves More Than Just Managing Volumes of Data". Gartner. Archived from the original on 10 July 2011. Retrieved 13 July 2011. 
    22. ^ "Gartner IT Glossary > Big Data – From the Gartner IT Glossary: What is Big Data?". Gartner. Archived from the original on 2017-07-18. Retrieved 2017-07-18. Gartner IT Glossary > Big Data From the Gartner IT Glossary: What is Big Data? Big Data is high-volume, high-velocity and/or high-variety information assets that demand cost-effective, innovative forms of information processing that enable enhanced insight, decision making, and process automation. 
    23. ^ De Mauro, Andrea; Greco, Marco; Grimaldi, Michele (2016). "A Formal definition of Big Data based on its essential Features". Library Review. 65: 122–135. doi:10.1108/LR-06-2015-0061. 
    24. ^ "What is Big Data?". Villanova University. 
    25. ^ Grimes, Seth. "Big Data: Avoid 'Wanna V' Confusion". InformationWeek. Retrieved 5 January 2016. 
    26. ^ a b Hilbert, Martin. "Big Data for Development: A Review of Promises and Challenges. Development Policy Review". martinhilbert.net. Retrieved 2015-10-07. 
    27. ^ a b c DT&SC 7-3: What is Big Data?. YouTube. 12 August 2015. 
    28. ^ Mayer-Schönberger, V., & Cukier, K. (2013). Big data: a revolution that will transform how we live, work and think. London: John Murray.
    29. ^ "Digital Technology & Social Change". Canvas.instructure.com. Retrieved 8 October 2017. 
    30. ^ "avec focalisation sur Big Data & Analytique" (PDF). Bigdataparis.com. Retrieved 8 October 2017. 
    31. ^ a b Billings S.A. "Nonlinear System Identification: NARMAX Methods in the Time, Frequency, and Spatio-Temporal Domains". Wiley, 2013
    32. ^ "le Blog ANDSI  » DSI Big Data". Andsi.fr. Retrieved 8 October 2017. }
    33. ^ Les Echos (3 April 2013). "Les Echos – Big Data car Low-Density Data ? La faible densité en information comme facteur discriminant – Archives". Lesechos.fr. Retrieved 8 October 2017. }
    34. ^ Big Data's Fourth V
    35. ^ Wu, D., Liu. X., Hebert, S., Gentzsch, W., Terpenny, J. (2015). Performance Evaluation of Cloud-Based High Performance Computing for Finite Element Analysis. Proceedings of the ASME 2015 International Design Engineering Technical Conference & Computers and Information in Engineering Conference (IDETC/CIE2015), Boston, Massachusetts, U.S.
    36. ^ Wu, D.; Rosen, D.W.; Wang, L.; Schaefer, D. (2015). "Cloud-Based Design and Manufacturing: A New Paradigm in Digital Manufacturing and Design Innovation". Computer-Aided Design. 59 (1): 1–14. doi:10.1016/j.cad.2014.07.006. 
    37. ^ Lee, Jay; Bagheri, Behrad; Kao, Hung-An (2014). "Recent Advances and Trends of Cyber-Physical Systems and Big Data Analytics in Industrial Informatics". IEEE Int. Conference on Industrial Informatics (INDIN) 2014. 
    38. ^ Lee, Jay; Lapira, Edzel; Bagheri, Behrad; Kao, Hung-an. "Recent advances and trends in predictive manufacturing systems in big data environment". Manufacturing Letters. 1 (1): 38–41. doi:10.1016/j.mfglet.2013.09.005. 
    39. ^ "Survey: Biggest Databases Approach 30 Terabytes". Eweek.com. Retrieved 8 October 2017. 
    40. ^ "LexisNexis To Buy Seisint For $775 Million". Washington Post. Retrieved 15 July 2004. 
    41. ^ "LexisNexis Parent Set to Buy ChoicePoint". Washington Post. Retrieved 22 February 2008. 
    42. ^ "Quantcast Opens Exabyte-Ready File System". www.datanami.com. Retrieved 1 October 2012. 
    43. ^ Bertolucci, Jeff "Hadoop: From Experiment To Leading Big Data Platform", "Information Week", 2013. Retrieved on 14 November 2013.
    44. ^ Webster, John. "MapReduce: Simplified Data Processing on Large Clusters", "Search Storage", 2004. Retrieved on 25 March 2013.
    45. ^ "Big Data Solution Offering". MIKE2.0. Retrieved 8 December 2013. 
    46. ^ "Big Data Definition". MIKE2.0. Retrieved 9 March 2013. 
    47. ^ Boja, C; Pocovnicu, A; Bătăgan, L. (2012). "Distributed Parallel Architecture for Big Data". Informatica Economica. 16 (2): 116–127. 
    48. ^ "IMS_CPS — IMS Center". Imscenter.net. Retrieved 16 June 2016. 
    49. ^ "SOLVING KEY BUSINESS CHALLENGES WITH A BIG DATA LAKE" (PDF). Hcltech.com. August 2014. Retrieved 8 October 2017. 
    50. ^ "Method for testing the fault tolerance of MapReduce frameworks" (PDF). Computer Networks. 2015. 
    51. ^ a b Manyika, James; Chui, Michael; Bughin, Jaques; Brown, Brad; Dobbs, Richard; Roxburgh, Charles; Byers, Angela Hung (May 2011). "Big Data: The next frontier for innovation, competition, and productivity". McKinsey Global Institute. Retrieved January 16, 2016. 
    52. ^ "Future Directions in Tensor-Based Computation and Modeling" (PDF). May 2009. 
    53. ^ Lu, Haiping; Plataniotis, K.N.; Venetsanopoulos, A.N. (2011). "A Survey of Multilinear Subspace Learning for Tensor Data" (PDF). Pattern Recognition. 44 (7): 1540–1551. doi:10.1016/j.patcog.2011.01.004. 
    54. ^ Pllana, Sabri; Janciak, Ivan; Brezany, Peter; Wöhrer, Alexander. "A Survey of the State of the Art in Data Mining and Integration Query Languages". 2011 International Conference on Network-Based Information Systems (NBIS 2011). IEEE Computer Society. Retrieved 2 April 2016. 
    55. ^ "Characterization and Optimization of Memory-Resident MapReduce on HPC Systems" (PDF). IEEE. October 2014. 
    56. ^ L’Heureux, A.; Grolinger, K.; Elyamany, H. F.; Capretz, M. A. M. (2017). "Machine Learning With Big Data: Challenges and Approaches". IEEE Access. 5: 7776–7797. doi:10.1109/ACCESS.2017.2696365. ISSN 2169-3536. 
    57. ^ Monash, Curt (30 April 2009). "eBay's two enormous data warehouses". 
      Monash, Curt (6 October 2010). "eBay followup – Greenplum out, Teradata > 10 petabytes, Hadoop has some value, and more". 
    58. ^ "Resources on how Topological Data Analysis is used to analyze big data". Ayasdi. 
    59. ^ CNET News (1 April 2011). "Storage area networks need not apply". 
    60. ^ "How New Analytic Systems will Impact Storage". September 2011. 
    61. ^ "An Error Occurred Setting Your User Cookie". The Information Society. 30: 127–143. doi:10.1080/01972243.2013.873748. 
    62. ^ Rajpurohit, Anmol (11 July 2014). "Interview: Amy Gershkoff, Director of Customer Analytics & Insights, eBay on How to Design Custom In-House BI Tools". KDnuggets. Retrieved 2014-07-14. Dr. Amy Gershkoff: "Generally, I find that off-the-shelf business intelligence tools do not meet the needs of clients who want to derive custom insights from their data. Therefore, for medium-to-large organizations with access to strong technical talent, I usually recommend building custom, in-house solutions." 
    63. ^ "The Government and big data: Use, problems and potential". Computerworld. 21 March 2012. Retrieved 12 September 2016. 
    64. ^ "White Paper: Big Data for Development: Opportunities & Challenges (2012) – United Nations Global Pulse". Unglobalpulse.org. Retrieved 13 April 2016. 
    65. ^ "WEF (World Economic Forum), & Vital Wave Consulting. (2012). Big Data, Big Impact: New Possibilities for International Development". World Economic Forum. Retrieved 24 August 2012. 
    66. ^ a b c d "Big Data for Development: From Information- to Knowledge Societies". SSRN 2205145Freely accessible.  Missing or empty |url= (help)
    67. ^ "Elena Kvochko, Four Ways To talk About Big Data (Information Communication Technologies for Development Series)". worldbank.org. Retrieved 2012-05-30. 
    68. ^ "Daniele Medri: Big Data & Business: An on-going revolution". Statistics Views. 21 October 2013. 
    69. ^ Tobias Knobloch and Julia Manske (11 January 2016). "Responsible use of data". D+C, Development and Cooperation. 
    70. ^ Lee, Jay; Wu, F.; Zhao, W.; Ghaffari, M.; Liao, L (January 2013). "Prognostics and health management design for rotary machinery systems—Reviews, methodology and applications". Mechanical Systems and Signal Processing. 42 (1). 
    71. ^ "Tutorials". PHM Society. Retrieved 27 September 2016. 
    72. ^ "Prognostic and Health Management Technology for MOCVD Equipment". Industrial Technology Research Institute. Retrieved 27 September 2016. 
    73. ^ "Impending Challenges for the Use of Big Data". International Journal of Radiation Oncology*Biology*Physics. doi:10.1016/j.ijrobp.2015.10.060. 
    74. ^ O'Donoghue, John; Herbert, John (1 October 2012). "Data Management Within mHealth Environments: Patient Sensors, Mobile Devices, and Databases". Journal of Data and Information Quality. 4 (1): 5:1–5:20. doi:10.1145/2378016.2378021. Retrieved 16 June 2016 – via ACM Digital Library. 
    75. ^ Mirkes, E.M.; Coats, T.J.; Levesley, J.; Gorban, A.N. (2016). "Handling missing data in large healthcare dataset: A case study of unknown trauma outcomes". Computers in Biology and Medicine. 75: 203–216. doi:10.1016/j.compbiomed.2016.06.004. 
    76. ^ Murdoch, Travis B.; Detsky, Allan S. (2013-04-03). "The Inevitable Application of Big Data to Health Care". JAMA. 309 (13): 1351. doi:10.1001/jama.2013.393. ISSN 0098-7484. 
    77. ^ "Degrees in Big Data: Fad or Fast Track to Career Success". Forbes. Retrieved 2016-02-21. 
    78. ^ "NY gets new boot camp for data scientists: It's free but harder to get into than Harvard". Venture Beat. Retrieved 2016-02-21. 
    79. ^ Couldry, Nick; Turow, Joseph (2014). "Advertising, Big Data, and the Clearance of the Public Realm: Marketers' New Approaches to the Content Subsidy". International Journal of Communication. 8: 1710–1726. 
    80. ^ "Big data and analytics: C4 and Genius Digital". Ibc.org. Retrieved 8 October 2017. 
    81. ^ "QuiO Named Innovation Champion of the Accenture HealthTech Innovation Challenge". Businesswire.com. Retrieved 8 October 2017. 
    82. ^ "A Software Platform for Operational Technology Innovation" (PDF). Predix.com. Retrieved 8 October 2017. 
    83. ^ a b Solnik, Ray. "The Time Has Come: Analytics Delivers for IT Operations". Data Center Journal. Retrieved June 21, 2016. 
    84. ^ Kalil, Tom. "Big Data is a Big Deal". White House. Retrieved 26 September 2012. 
    85. ^ Executive Office of the President (March 2012). "Big Data Across the Federal Government" (PDF). White House. Retrieved 26 September 2012. 
    86. ^ Lampitt, Andrew. "The real story of how big data analytics helped Obama win". Infoworld. Retrieved 31 May 2014. 
    87. ^ Hoover, J. Nicholas. "Government's 10 Most Powerful Supercomputers". Information Week. UBM. Retrieved 26 September 2012. 
    88. ^ Bamford, James (15 March 2012). "The NSA Is Building the Country's Biggest Spy Center (Watch What You Say)". Wired Magazine. Retrieved 2013-03-18. 
    89. ^ "Groundbreaking Ceremony Held for $1.2 Billion Utah Data Center". National Security Agency Central Security Service. Retrieved 2013-03-18. 
    90. ^ Hill, Kashmir. "TBlueprints of NSA's Ridiculously Expensive Data Center in Utah Suggest It Holds Less Info Than Thought". Forbes. Retrieved 2013-10-31. 
    91. ^ "News: Live Mint". Are Indian companies making enough sense of Big Data?. Live Mint. 23 June 2014. Retrieved 2014-11-22. 
    92. ^ "Survey on Big Data Using Data Mining" (PDF). International Journal of Engineering Development and Research. 2015. Retrieved 14 September 2016. 
    93. ^ "Recent advances delivered by Mobile Cloud Computing and Internet of Things for Big Data applications: a survey". International Journal of Network Management. 11 March 2016. Retrieved 14 September 2016. 
    94. ^ Wingfield, Nick (12 March 2013). "Predicting Commutes More Accurately for Would-Be Home Buyers – NYTimes.com". Bits.blogs.nytimes.com. Retrieved 2013-07-21. 
    95. ^ "FICO® Falcon® Fraud Manager". Fico.com. Retrieved 2013-07-21. 
    96. ^ Alexandru, Dan. "Prof" (PDF). cds.cern.ch. CERN. Retrieved 24 March 2015. 
    97. ^ "LHC Brochure, English version. A presentation of the largest and the most powerful particle accelerator in the world, the Large Hadron Collider (LHC), which started up in 2008. Its role, characteristics, technologies, etc. are explained for the general public". CERN-Brochure-2010-006-Eng. LHC Brochure, English version. CERN. Retrieved 20 January 2013. 
    98. ^ "LHC Guide, English version. A collection of facts and figures about the Large Hadron Collider (LHC) in the form of questions and answers". CERN-Brochure-2008-001-Eng. LHC Guide, English version. CERN. Retrieved 20 January 2013. 
    99. ^ Brumfiel, Geoff (19 January 2011). "High-energy physics: Down the petabyte highway". Nature. 469. pp. 282–83. doi:10.1038/469282a. 
    100. ^ "IBM Research - Zurich" (PDF). Zurich.ibm.com. Retrieved 8 October 2017. 
    101. ^ "Future telescope array drives development of Exabyte processing". Ars Technica. Retrieved 15 April 2015. 
    102. ^ "Australia's bid for the Square Kilometre Array – an insider's perspective". The Conversation. 1 February 2012. Retrieved 27 September 2016. 
    103. ^ "Delort P., OECD ICCP Technology Foresight Forum, 2012" (PDF). Oecd.org. Retrieved 8 October 2017. 
    104. ^ "NASA – NASA Goddard Introduces the NASA Center for Climate Simulation". Nasa.gov. Retrieved 13 April 2016. 
    105. ^ Webster, Phil. "Supercomputing the Climate: NASA's Big Data Mission". CSC World. Computer Sciences Corporation. Retrieved 2013-01-18. 
    106. ^ "These six great neuroscience ideas could make the leap from lab to market". The Globe and Mail. 20 November 2014. Retrieved 1 October 2016. 
    107. ^ "DNAstack tackles massive, complex DNA datasets with Google Genomics". Google Cloud Platform. Retrieved 1 October 2016. 
    108. ^ "23andMe – Ancestry". 23andme.com. Retrieved 29 December 2016. 
    109. ^ a b Potenza, Alessandra (13 July 2016). "23andMe wants researchers to use its kits, in a bid to expand its collection of genetic data". The Verge. Retrieved 29 December 2016. 
    110. ^ "This Startup Will Sequence Your DNA, So You Can Contribute To Medical Research". Fast Company. 23 December 2016. Retrieved 29 December 2016. 
    111. ^ Seife, Charles. "23andMe Is Terrifying, but Not for the Reasons the FDA Thinks". Scientific American. Retrieved 29 December 2016. 
    112. ^ Zaleski, Andrew (22 June 2016). "This biotech start-up is betting your genes will yield the next wonder drug". CNBC. Retrieved 29 December 2016. 
    113. ^ Regalado, Antonio. "How 23andMe turned your DNA into a $1 billion drug discovery machine". MIT Technology Review. Retrieved 29 December 2016. 
    114. ^ "23andMe reports jump in requests for data in wake of Pfizer depression study | FierceBiotech". fiercebiotech.com. Retrieved 29 December 2016. 
    115. ^ Admire Moyo. "Data scientists predict Springbok defeat". www.itweb.co.za. Retrieved 12 December 2015. 
    116. ^ Regina Pazvakavambwa. "Predictive analytics, big data transform sports". www.itweb.co.za. Retrieved 12 December 2015. 
    117. ^ Rich Miller. "The Lessons of Moneyball for Big Data Analysis". www.datecenterknowledge.com. Retrieved 12 December 2015. 
    118. ^ Dave Ryan. "Sports: Where Big Data Finally Makes Sense". www.huffingtonpost.com. Retrieved 12 December 2015. 
    119. ^ Frank Bi. "How Formula One Teams Are Using Big Data To Get The Inside Edge". www.forbes.com. Retrieved 12 December 2015. 
    120. ^ Tay, Liz. "Inside eBay's 90PB data warehouse". ITNews. Retrieved 2016-02-12. 
    121. ^ Layton, Julia. "Amazon Technology". Money.howstuffworks.com. Retrieved 2013-03-05. 
    122. ^ "Scaling Facebook to 500 Million Users and Beyond". Facebook.com. Retrieved 2013-07-21. 
    123. ^ "Google Still Doing at Least 1 Trillion Searches Per Year". Search Engine Land. 16 January 2015. Retrieved 15 April 2015. 
    124. ^ Lamb, Charles. "Oracle NoSQL Database Exceeds 1 Million Mixed YCSB Ops/Sec". 
    125. ^ Siwach, Gautam; Esmailpour, Amir (March 2014). Encrypted Search & Cluster Formation in Big Data (PDF). ASEE 2014 Zone I Conference. University of Bridgeport, Bridgeport, Connecticut, US. 
    126. ^ "Obama Administration Unveils "Big Data" Initiative:Announces $200 Million In New R&D Investments" (PDF). The White House. 
    127. ^ "AMPLab at the University of California, Berkeley". Amplab.cs.berkeley.edu. Retrieved 2013-03-05. 
    128. ^ "NSF Leads Federal Efforts in Big Data". National Science Foundation (NSF). 29 March 2012. 
    129. ^ Timothy Hunter; Teodor Moldovan; Matei Zaharia; Justin Ma; Michael Franklin; Pieter Abbeel; Alexandre Bayen (October 2011). Scaling the Mobile Millennium System in the Cloud. 
    130. ^ David Patterson (5 December 2011). "Computer Scientists May Have What It Takes to Help Cure Cancer". The New York Times. 
    131. ^ "Secretary Chu Announces New Institute to Help Scientists Improve Massive Data Set Research on DOE Supercomputers". "energy.gov". 
    132. ^ "Governor Patrick announces new initiative to strengthen Massachusetts' position as a World leader in Big Data". Commonwealth of Massachusetts. 
    133. ^ "Big Data @ CSAIL". Bigdata.csail.mit.edu. 22 February 2013. Retrieved 2013-03-05. 
    134. ^ "Big Data Public Private Forum". Cordis.europa.eu. 1 September 2012. Retrieved 2013-03-05. 
    135. ^ "Alan Turing Institute to be set up to research big data". BBC News. 19 March 2014. Retrieved 2014-03-19. 
    136. ^ "Inspiration day at University of Waterloo, Stratford Campus". betakit.com/. Retrieved 2014-02-28. 
    137. ^ Lee, Jay; Lapira, Edzel; Bagheri, Behrad; Kao, Hung-An (2013). "Recent Advances and Trends in Predictive Manufacturing Systems in Big Data Environment". Manufacturing Letters. 1 (1): 38–41. doi:10.1016/j.mfglet.2013.09.005. 
    138. ^ a b c Reips, Ulf-Dietrich; Matzat, Uwe (2014). "Mining "Big Data" using Big Data Services". International Journal of Internet Science. 1 (1): 1–8. 
    139. ^ Preis, Tobias; Moat,, Helen Susannah; Stanley, H. Eugene; Bishop, Steven R. (2012). "Quantifying the Advantage of Looking Forward". Scientific Reports. 2: 350. doi:10.1038/srep00350. PMC 3320057Freely accessible. PMID 22482034. 
    140. ^ Marks, Paul (5 April 2012). "Online searches for future linked to economic success". New Scientist. Retrieved 9 April 2012. 
    141. ^ Johnston, Casey (6 April 2012). "Google Trends reveals clues about the mentality of richer nations". Ars Technica. Retrieved 9 April 2012. 
    142. ^ Tobias Preis (24 May 2012). "Supplementary Information: The Future Orientation Index is available for download" (PDF). Retrieved 2012-05-24. 
    143. ^ Philip Ball (26 April 2013). "Counting Google searches predicts market movements". Nature. Retrieved 9 August 2013. 
    144. ^ Tobias Preis, Helen Susannah Moat and H. Eugene Stanley (2013). "Quantifying Trading Behavior in Financial Markets Using Google Trends". Scientific Reports. 3: 1684. doi:10.1038/srep01684. PMC 3635219Freely accessible. PMID 23619126. 
    145. ^ Nick Bilton (26 April 2013). "Google Search Terms Can Predict Stock Market, Study Finds". New York Times. Retrieved 9 August 2013. 
    146. ^ Christopher Matthews (26 April 2013). "Trouble With Your Investment Portfolio? Google It!". TIME Magazine. Retrieved 9 August 2013. 
    147. ^ Philip Ball (26 April 2013). "Counting Google searches predicts market movements". Nature. Retrieved 9 August 2013. 
    148. ^ Bernhard Warner (25 April 2013). "'Big Data' Researchers Turn to Google to Beat the Markets". Bloomberg Businessweek. Retrieved 9 August 2013. 
    149. ^ Hamish McRae (28 April 2013). "Hamish McRae: Need a valuable handle on investor sentiment? Google it". The Independent. London. Retrieved 9 August 2013. 
    150. ^ Richard Waters (25 April 2013). "Google search proves to be new word in stock market prediction". Financial Times. Retrieved 9 August 2013. 
    151. ^ David Leinweber (26 April 2013). "Big Data Gets Bigger: Now Google Trends Can Predict The Market". Forbes. Retrieved 9 August 2013. 
    152. ^ Jason Palmer (25 April 2013). "Google searches predict market moves". BBC. Retrieved 9 August 2013. 
    153. ^ E. Sejdić, "Adapt current tools for use with big data," Nature, vol. vol. 507, no. 7492, pp. 306, Mar. 2014.
    154. ^ Stanford. "MMDS. Workshop on Algorithms for Modern Massive Data Sets".
    155. ^ Deepan Palguna; Vikas Joshi; Venkatesan Chakaravarthy; Ravi Kothari & L. V. Subramaniam (2015). Analysis of Sampling Algorithms for Twitter. International Joint Conference on Artificial Intelligence. 
    156. ^ Kimble, C.; Milolidakis, G. (2015). "Big Data and Business Intelligence: Debunking the Myths". Global Business and Organizational Excellence. 35 (1): 23–34. doi:10.1002/joe.21642. 
    157. ^ Chris Anderson (23 June 2008). "The End of Theory: The Data Deluge Makes the Scientific Method Obsolete". WIRED. 
    158. ^ Graham M. (9 March 2012). "Big data and the end of theory?". The Guardian. London. 
    159. ^ "Good Data Won't Guarantee Good Decisions. Harvard Business Review". Shah, Shvetank; Horne, Andrew; Capellá, Jaime;. HBR.org. Retrieved 8 September 2012. 
    160. ^ a b Big Data requires Big Visions for Big Change., Hilbert, M. (2014). London: TEDxUCL, x=independently organized TED talks
    161. ^ Jonathan Rauch (1 April 2002). "Seeing Around Corners". The Atlantic. 
    162. ^ Epstein, J. M., & Axtell, R. L. (1996). Growing Artificial Societies: Social Science from the Bottom Up. A Bradford Book.
    163. ^ "Delort P., Big data in Biosciences, Big Data Paris, 2012" (PDF). Bigdataparis.com. Retrieved 8 October 2017. 
    164. ^ "Next-generation genomics: an integrative approach" (PDF). nature. July 2010. Retrieved 18 October 2016. 
    165. ^ "BIG DATA IN BIOSCIENCES". ResearchGate. October 2015. Retrieved 18 October 2016. 
    166. ^ "Big data: are we making a big mistake?". Financial Times. 28 March 2014. Retrieved 20 October 2016. 
    167. ^ Ohm, Paul. "Don't Build a Database of Ruin". Harvard Business Review. 
    168. ^ Darwin Bond-Graham, Iron Cagebook – The Logical End of Facebook's Patents, Counterpunch.org, 2013.12.03
    169. ^ Darwin Bond-Graham, Inside the Tech industry’s Startup Conference, Counterpunch.org, 2013.09.11
    170. ^ Al-Rodhan, Nayef (2014-09-16). "The Social Contract 2.0: Big Data and the Need to Guarantee Privacy and Civil Liberties – Harvard International Review". Harvard International Review. Retrieved 2017-04-03. 
    171. ^ Barocas, Solon; Nissenbaum, Helen; Lane, Julia; Stodden, Victoria; Bender, Stefan; Nissenbaum, Helen (June 2014). Big Data’s End Run around Anonymity and Consent. Cambridge University Press. pp. 44–75. doi:10.1017/cbo9781107590205.004. ISBN 9781107067356. 
    172. ^ danah boyd (29 April 2010). "Privacy and Publicity in the Context of Big Data". WWW 2010 conference. Retrieved 2011-04-18. 
    173. ^ Jones, MB; Schildhauer, MP; Reichman, OJ; Bowers, S (2006). "The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere" (PDF). Annual Review of Ecology, Evolution, and Systematics. 37 (1): 519–544. doi:10.1146/annurev.ecolsys.37.091305.110031. 
    174. ^ a b Boyd, D.; Crawford, K. (2012). "Critical Questions for Big Data". Information, Communication & Society. 15 (5): 662–679. doi:10.1080/1369118X.2012.678878. 
    175. ^ Failure to Launch: From Big Data to Big Decisions, Forte Wares.
    176. ^ a b Gregory Piatetsky (12 August 2014). "Interview: Michael Berthold, KNIME Founder, on Research, Creativity, Big Data, and Privacy, Part 2". KDnuggets. Retrieved 2014-08-13. 
    177. ^ Pelt, Mason. ""Big Data" is an over used buzzword and this Twitter bot proves it". siliconangle.com. SiliconANGLE. Retrieved 4 November 2015. 
    178. ^ a b Harford, Tim (28 March 2014). "Big data: are we making a big mistake?". Financial Times. Financial Times. Retrieved 2014-04-07. 
    179. ^ Ioannidis, J. P. A. (2005). "Why Most Published Research Findings Are False". PLoS Medicine. 2 (8): e124. doi:10.1371/journal.pmed.0020124. PMC 1182327Freely accessible. PMID 16060722. 
    180. ^ Lohr, Steve; Singer, Natasha (2016-11-10). "How Data Failed Us in Calling an Election". The New York Times. ISSN 0362-4331. Retrieved 2016-11-27. 
    181. ^ Markman, Jon. "Big Data And The 2016 Election". Forbes. Retrieved 2016-11-27.