Pentaho BA Server also scales well, on a quite standard load balancing scheme. Bottom line of data integration scalability is limited to developers ingenuity on data processing compartmentalization so processing parallelization and remote processing become profitable for clustering. One has only to enable the jobs and transformation to take advantage of PDI's clustering abilities, and that might be tricky but easy nonetheless. Pentaho Data Integration scalation is a breeze: just setup the machines, configure the slaves and master and that is it. At its relatively low prices, when sided with comparable competition, the most valuable features are the data integration and the results delivery platform. It all those plugins are not enough, there are means to develop you own plugin either coding in Java (mostly for PDI) or, for the BA Server, with point-and-click ease with Sparkl, a BA Server plugin for easy development and packing of new BA Server plugins (but some need of JavaScript, CSS and HTML is needed.)Īny company is able to design and delivery a deep and embrancing BI strategy with Pentaho. There are two plugins marketplaces, one for PDI and onde for BA Server, with a good supply of diverse features. The suite's plugin architecture deserves a special remark: Both PDI and BA Server are built to be easily extended with plugins. It supports background processing, results bursting by e-mail, load balacing (through native Java Webserver - like Tomcat - load balancing features), integration with corporate directories services as MS Active Directory and LDAP directories, with account management and lots of bell and whistles. It is built on a scalable, auditable platform able to deliver from dashboards and reports to OLAP and custom-made features. Then there is the Pentaho BA Server, built to be the linchpin on BI delivery for enterprises. Lastly, PDI is easier to use and achieves more with less effort than those other products. I have never worked with anything else, like Informatica's PowerCenter or Microsoft's SSIS but I have always taken the opportunity to inquire who has. Not only is it powerful, but it is also easy to use. It's able to scale from a single desktop computer to lots of nodes, on premises or in the cloud. Pentaho Data Integration's (PDI, former Kettle) features and resources are virtually unbeatable as it can handle everything from the smallest Excel files to the most complex and demanding data loads. Pentaho is a suite with five main products: Pentaho Data Integration for ETL, Pentaho Business Analytics Server for results delivery and development clients Report Designer, Metadata Editor and Schema Workbench.