Pentaho Brings Business Intelligence to Hadoop

By Paul Shread     Feedback

Open source BI vendor Pentaho says it has made BI and data integration easier for Hadoop users -- but the improvements aren't available to users of the free community edition of Pentaho.

Open source business intelligence company Pentaho unveiled BI and data integration tools for Hadoop this week, but they aren't available to users of the free community edition of Pentaho.

The new offerings, unveiled at Hadoop World this week, make Hadoop easier to use for companies trying to solve Big Data challenges, Pentaho says. While Hadoop's open source distributed application framework offers promise for making sense of vast amounts of data stored in enterprises, Pentaho claims it has solved the biggest challenge for Hadoop users by reducing the technical learning curve, the need for specialized staff and the lack of development and deployment applications for data integration and business intelligence.

Pentaho Data Integration (PDI) for Hadoop offers a zero-programming graphical design environment so organizations can easily manage how data is moved into and out of Hadoop, execute and schedule Hadoop tasks in the context of existing ETL and BI workflows, and design and execute massively scalable ETL jobs in Hadoop using more than 200 out-of-the-box ETL steps.

Pentaho also promises easy integration with cloud deployments in Amazon Elastic MapReduce, Cloudera Distribution for Hadoop (CDH) and Apache Hadoop.

The Pentaho BI Suite for Hadoop includes PDI for Hadoop. Users can perform production, operational and batch reporting against the full set of data in Hadoop using Hadoop's Hive data warehouse infrastructure, and ad hoc reporting can be performed against data in Hadoop with zero knowledge of Hadoop or SQL, the company says. Users can also spin off high-performance data marts in minutes for interactive analysis and dashboarding using Pentaho Agile BI, the company says.

"Pentaho just lowered the onramp to Big Data analytics by making it easier and more affordable for companies to get up and running with Hadoop," Shawn Rogers, research vice president for business intelligence at analyst firm Enterprise Management Associates, said in a statement. "It's an essential tool set addition for senior level architects and others at larger organizations with Big Data initiatives, or even for a DBA or ETL guy trying to get into Hadoop."

Pentaho describes the new offerings as "a collaborative effort from both Pentaho Corporation and the Pentaho community," and said the tools were put through "an ex