Open-source and Commercial Tools for Data Science

Open-source Tools for Data Science

A.  Data Management Tools

1. SQL tools

  •   MySQL,
  • PostgreSQL,
  •  Microsoft SQL

2. NoSQL tools

  • MongoDB,
  • Hadoop,
  •  Ceph,
  • Elasticsearch,
  • CouchDB
  • Apache Casandra

B.  Data Integration and Transformation Tools

1.     Apache Airflow,

2.     Apache Kafka,

3.     Kubeflow,

4.     Apache Nifi,

5.     Spark SQQL

6.     Node RED

 

C.   Data visualization tools

1.     Hue,

2.   Kibana,

3.   Apache Superset

D. Model Deployment

1.     Apache Prediction IO,

2.     Seldon

3.     M leap

4.     TensorFlow services

5.     TensorFlow Lite

E.   Model Monitoring and Assessment tools,

1.     Model DB

2.     Prometheus

IBM Research Trusted AI

3.     AI Fairness Open-Source Tool kit

4.     Adversarial Robustness 360 Toolbox

5.     AI Explainability 360

F.   Code Asset Management Tools

1.     Git

2.     GitHub

3.     GitLab

4.     Bitbucket

G.  Data Asset Management

1.     Apache Atlas

2.     ODPi EGERIA

3.     Kaylo

 

H.   Development Tools

1.     Jupyter

2.     Jupyter Lab

3.     Apache Zeppelin

4.     R Studio

5.     Spyder

I.     Cluster Execution Tools

1.     Apache Spark

2.     Apache Flink

3.     Rise lab Ray

J.     Fully Integrated Tools

1.     KNIME

2.     Orange

Commercial Tools for Data Science

A.  Data management tools

1.     Oracle

2.     Microsoft SQL

3.     IBM DB2

B.  Data Integration and Transformation Tools

1.     Informatica

2.     IBM InfoSphere DataStage

3.     Talend

4.     IBM Watson Studio Desktop

C.   Data Visualization Tools

1.     Microsoft Power BI

2.     Tableau

3.     IBM Cognos Analytics

4.     IBM Watson Studio Desktop

D. Model Building Tools

1.     SPSS

2.     SAS

3.     IBM Watson Studio Desktop (cloud-based tool)

E.   Model Deployment Tool

1.     IBM SPSS Collaboration and Deployment Services

F.     Model Monitoring and Assessment Tool

·        No relevant Commercial Tool for this new discipline therefore you must use one of the open-source software available.

G.  Code Asset Management Tool

1.     Git

2.     GitHub

H. Data Asset Management

1.     Informatica

2.     IBM InfoSphere Information Governance Catalog

I.     Development Environment

1.     IBM Watson Studio Desktop

J.     Fully Integrated Development Environment

1.     Watson Studio with Watson Open Scale

2.     H2O ai

Comments

Popular posts from this blog

ዳታ ሳይንስ ምንድን ነው?

my trip to be a good software engineer.

የውሂብ አይነቶች (Data Types) Continued...