Our Open Source ETL platform provides an integrated suite of components to help enterprises extract value from their data. The Vanilla ETL platform addresses some of the key challenges in the data ETL value chain and processes. Some of the different modules of the ETL Platform include :
- Vanilla ETL Management : Vanilla BIGateway
- Data Workflow Management : Vanilla Workflow
- Master Data Management : Vanilla Architect
- Scheduler
Vanilla ETL platform has support for SQL relational databases as well as non-SQL databases for Big Data such as Hadoop through the use of connectors.
Vanilla ETL has been integrated and certified with the Hortonworks Data Platform (HDP), Cloudera and Databricks.
Customers have the ultimate flexibility with Vanilla ETL since many of the components are based on open source layers and accessible from one platform.
3 Steps
Why use Vanilla ETL ?
A comprehensive Suite of modules
Vanilla ETL has a range of modules that support data extraction, data transformations jobs and checks, all from one integrated platform.
Reduced Development Cycle
The time from prototype development to production and maintenance is greatly reduced with Vanilla ETL platform’s components which are focused on improving productivity.
Full Data Integration
Vanilla ETL provides the connectors for Hadoop, SQL environments and files systems and tools for transformations between the storage places (ETL, Workflow and Master Data Management).
Vanilla ETL is a Certified Technology Partner
The Vanilla ETL Suite is certified by some of the biggest references of Big Data and alternative data storage technologies players.
Cloudera Certification
The Vanilla ETL platform is Cloudera CDH 5 certified !
Vanilla ETL is built on the power of Hadoop and Big Data platforms to allow integration and use of data in all circumstances.
"With its Apache™ Hadoop® core, Cloudera Enterprise 5 enables customers to store, process and analyze all their data with perfect fidelity, for a variety of different workloads, and manage their business hub making it economically viable and technically feasible" says Tim Stevens , Vice President, Business and Corporate Development at Cloudera .
The Cloudera technology certification program assures users of the platform Vanilla ETL that integration between BI platform Open Source Hadoop components and is fully functional and operative. The Cloudera Certified technologies have been tested and validated, and are able to use supported APIs, while complying with Cloudera development guidelines for integration with Hadoop.
Hortonworks Certification
The Hortonworks Certified Technology Program reviews and certifies technologies for architectural best practices, validated against a comprehensive suite of integration test cases, benchmarked for scale under varied workloads and comprehensively documented.
Hortonworks is a leading commercial vendor of Apache Hadoop, the open source platform for storing, managing and analyzing Big Data. Hortonworks Data Platform, our distribution of Apache Hadoop, provides an open and stable foundation for enterprises and a growing ecosystem to build and deploy Big Data solutions.
Hortonworks is the trusted source for information on Hadoop, and together with the Apache community, Hortonworks is making Hadoop an enterprise data platform. Hortonworks provides unmatched technical support, training and certification programs for enterprises, systems integrators and technology vendors.
Databricks certification
The Databricks company, founders of Spark, reviews and certifies technologies for architectural best practices, validated against a comprehensive suite of integration test cases, benchmarked for scale under varied workloads and comprehensively documented.
Databricks was founded out of the UC Berkeley AMPLab by the creators of Apache Spark. They’ve been working for the past six years on cutting-edge systems to extract value from Big Data. They believe that Big Data is a huge opportunity that is still largely untapped, and they’re working to revolutionize what you can do with it..
Apache Spark is 100% open source. Spark has one of the largest open source communities in Big Data, with over 400 contributors from 100+ organizations. Databricks works closely with the community to maintain this momentum.