Which are Useful Analysis / Data Mining / Data Science Tools?

Data collection: Java/Scala/Python

Data storage: MongoDB and MySQL, Dynamo, GitHub, RabbitMQ

Indexing: We index with Sphinx and are getting going with ElasticSearch/Lucene

Data modelling and exploratory data analysis: R, Python, Hadoop (with java/python/R) as necessary to do large scale data filtering, aggregation and clustering, SAS

Data visualization and analysis: Spotfire, D3 visualizations, Tableau

Running models: Java, Python

Web content mining: Octoparse, Scrapy, Kimono

Other Tools: SAS / Matlab / Microsoft Excel (All for analysis), OpenCV (image processing), NLTK (Natural Language Processing), Trifacta (Data cleaning & manipulation), Qlikview /Microstrategy / Pentaho / SiSense / Oracle BI / SAP BI etc (Business Intelligence), RapidMiner, SQL, KNIME, Hadoop, Spark, Weka (Data mining), Rattle (GUI for data mining)