Home > Software > Scientific > Statistics > KNIME
Details
- Package ID
- knime
- Version
- 4.1.2
- Downloads
- 7707
- Website
- http://www.knime.org/knime
Summary
KNIME® Analytics Platform is the leading open solution for data-driven innovation, helping you discover the potential hidden in your data, mine for fresh insights, or predict new futures.
Description
KNIME Analytics Platform
KNIME Analytics Platform is the open source software for creating data science applications and services. Intuitive, open, and continuously integrating new developments, KNIME makes understanding data and designing data science workflows and reusable components accessible to everyone.
Build end to end data science workflows
Create visual workflows with an intuitive, drag and drop style graphical interface, without the need for coding.
Blend tools from different domains with KNIME native nodes in a single workflow, including scripting in R & Python, machine learning, or connectors to Apache Spark.
Choose from over 2000 modules (“nodes”) to build your workflow. Model each step of your analysis, control the flow of data, and ensure your work is always current.
Get up and running quickly. Select one of the hundreds of publicly available example workflows, or use the integrated workflow coach to guide you through building your workflow.
Blend data from any source
Open and combine simple text formats (CSV, PDF, XLS, JSON, XML, etc), unstructured data types (images, documents, networks, molecules, etc), or time series data.
Connect to a host of databases and data warehouses to integrate data from Oracle, Microsoft SQL, Apache Hive, and more. Load Avro, Parquet, or ORC files from HDFS, S3, or Azure.
Access and retrieve data from sources such as Twitter, AWS S3, Google Sheets, and Azure.
Shape your data
Derive statistics, including mean, quantiles, and standard deviation, or apply statistical tests to validate a hypothesis. Integrate dimension reduction, correlation analysis, and more into your workflows.
Aggregate, sort, filter, and join data either on your local machine, in-database, or in distributed big data environments.
Clean data through normalization, data type conversion, and missing value handling. Detect out of range values with outlier and anomaly detection algorithms.
Extract and select features (or construct new ones) to prepare your dataset for machine learning. Manipulate text, apply formulas on numerical data, and apply rules to filter out or mark samples.
Leverage Machine Learning and AI
Build machine learning models for classification, regression, dimension reduction, or clustering, using advanced algorithms including deep learning, tree-based methods, and logistic regression.
Optimize model performance with hyperparameter optimization, boosting, bagging, stacking, or building complex ensembles.
Validate models by applying performance metrics including Accuracy, R², AUC, and ROC. Perform cross validation to guarantee model stability.
Make predictions using validated models directly, or with industry leading PMML, including on Apache Spark.
Discover and share insights
Visualize data with classic (bar chart, scatter plot) as well as advanced charts (parallel coordinates, sunburst, network graph) and customize them to your needs.
Display summary statistics about columns in a KNIME table and filter out anything that’s irrelevant.
Export reports as PDF, Powerpoint, or other formats for presenting results to stakeholders.
Store processed data or analytics results in many common file formats or databases.
Scale execution with demands
Build workflow prototypes to explore various analysis approaches. Inspect and save intermediate results to ensure fast feedback and efficient discovery of new, creative solutions.
Scale workflow performance through in-memory streaming and multi threaded data processing.
Exercise the power of in-database processing or distributed computing on Apache Spark to further increase computation performance.
KNIME License Terms and Conditions
Please see the GPL v3, and the Additional Permissions according to Sec. 7