Developing dimensional data warehouses, data marts, and analytical, reporting, and machine-generated-data databases, smaller than 500 terabytes, powered by Infobright.

Musnikow Consulting

    1 Crest Circle
    Lexington, MA 02421

We specialize in helping our clients design and implement dimensional data warehouses and data marts using the Ralph Kimball star schema approach on MySQL databases using Infobright, which includes its query optimizer and Brighthouse storage engine.

We also migrate data warehouse and data mart presentation areas and reporting and analytical databases to Infobright from row-oriented databases, as well as creating Infobright databases of machine-generated data.

We recommend using an agile, iterative approach to development. This provides useful functionality quickly, helps obtain periodic business user feedback, and helps keep the project from going off track.

We follow Ralph Kimball's 25 Best Practices for Big Data.

Our services

  • Proof of concept or pilot of dimensional data warehouse or data mart on Infobright.
  • Migration of data warehouse or data mart presentation area or analytical or reporting database to Infobright from a row-oriented database, such as Oracle, IBM DB2, Microsoft SQL Server, and Sybase ASE.
  • Creation of an Infobright database of machine-generated data.
  • Leading, or assisting in, the design and implementation of a dimensional data warehouse or data mart on Infobright.

Our fees

Our services are available on a contingency fee, fee per hour, fee per service, or customized basis. Click on Fees for more information.

Why a columnar database?

The June 2, 2012 Information Management slideshow said "Column-oriented ... database ... is conducive to performing many selective queries rapidly, a benchmark of big data analytics."

Boris Evelson, VP and Principal Analyst of Forrester Research, wrote May 30, 2011, "fitting BI into a row-oriented RDBMS is often like putting a square peg into a round hole."

David Menninger, VP and Research Director of Ventana Research, wrote March 3, 2011: "Fundamentally, columnar database technology offers two primary benefits increased speed and reduced storage requirements. We repeatedly emphasize the importance of speed to end users."

Doug Henschen reported in InformationWeek, January 27, 2011, "the performance efficiencies of columnar querying are unmatched by any row-oriented database."

The Bloor Research November 2010 white paper "What's Cool about Columns [and how extend their benefits]" concluded, on page 15:

"Columns provide better performance at a lower cost with a smaller footprint: it is difficult to understand why any company seriously interested in query performance would not consider a column-based solution.

Using columns instead of rows means that you get greatly reduced I/O because you only read the columns referenced by the query. This means that you get dramatically improved performance. The use of compression improves I/O rates (and performance) still further."

Seth Grimes reported in InformationWeek, November 16, 2010, "Column-oriented stores can accelerate many analytical queries, dramatically reduce disk I/O, and slash storage requirement via aggressive compression."

Intelligent Enterprise reported February 22, 2010, "Column-store databases are typically faster than conventional, row-oriented databases (such as Oracle, IBM DB2 and Microsoft SQL Server) in analytic applications because they can query specific data attributes in columns -- such as zip codes, product stock numbers or transaction totals -- while skipping all the other data, row-by-row, that might not be relevant to a query."

Information Management Online reported February 15, 2010, "IDC ... predicts five-year horizon will shift data warehousing to columnar...."

Infobright "Column vs Row" 3 minute web cast.

Why Infobright?

Infobright is greatly accelerated by the application of rough set mathematics and patent-pending compression algorithms, which are finely tuned to the column data type and automatically self-adjusted for each 65,536-value data  pack.

We can generally use the Infobright Community Edition, which is available at no cost, for data warehouse and data mart presentation areas and reporting, analytical, and machine-generated-data databases. In addition, we can often use this free edition of Infobright for complete data warehouses and data marts by using the MyISAM storage engine for the staging area.

For clients who prefer a support subscription, faster loading, distributed loading, and/or the ability to use the Brighthouse storage engine for the staging area, we can use the Infobright Enterprise Edition, which is available at an annual subscription cost per terabyte.

For clients who prefer an "integrated hardware/software solution ... [scalable] to 144 terabytes of data in a single appliance node", we can use the Infopliance.

The "Infobright Community Edition Technology White Paper August 2009" download first convinced us that Infobright deserved careful evaluation.

Lee Brown, CTO and co-founder of HasOffers said about Infobright, May 14, 2013, “We were able to deploy the solution in a few weeks and improve query performance 10-fold almost immediately.”

April 2, 2013: "Infobright Customer J. Craig Venter Institute (JCVI) Recognized by Computerworld as a 2013 Computerworld Honors Laureate

Eric Lubow, CTO and co-founder of SimpleReach, said, February 25, 2013, Infobright "delivers fast, ad-hoc query speed and performance; it's easy to use, and affordable -- encompassing everything we were looking for...."

InformationWeek, October 31, 2012, reported, "Sears is consolidating its databases to MySQL, InfoBright, and Teradata--EMC Greenplum, Microsoft SQL Server, and Oracle (including four Exadata boxes) are on their way out....", October 24, 2012, saw 'A "90% saving" for public sector IT with OpenITGov,' including Infobright.

The Wall Street Journal, July 31, 2012, reported, "Loyalty Lane Deploys Infobright for Faster Point of Sale Data Analysis."

In a February 29, 2012 video, Andrei Dunca, co-founder and CTO of LiveRail, discusses how LiveRail achieved the following.

"Near real-time analysis and reporting -- for both scheduled and ad-hoc analytics
Simple administration, requiring no indexes or tuning
25:1 compression rate, resulting in more data stored on fewer servers
Easy integration of Infobright and Hadoop
Lower TCO due to licensing and hardware savings"

Announcement at Mobile World Congress, Barcelona, Spain, February 28, 2012: "8 of Top 10 Telecommunications Service Providers Use Infobright to Capitalize on Big Data"

"Sybase IQ Users: Top 5 Reasons to Switch to Infobright for Analytics" whitepaper.

Doug Henschen wrote in InformationWeek, October 18, 2011, "Infobright Cuts DBA Labor And Query Times."

Doug Henschen reported in InformationWeek, June 23, 2011, Infobright's "Rough Query for Data Mining ... can speed queries by as much as 20 times...."

RotorNews reported June 10, 2011:
"Using IEE ... Bell Helicopter was able to ... speed analytic queries on executive dashboards by up to 500 percent, achieve 10:1 compression for improved resource utilization, and scale up to deliver vital business intelligence to thousands of end users."Also, "98% improvement for SAP data report results."

David Hahn, SVP Product Management of AdSafe Media, said April 26, 2011, "Infobright provided the query performance, scalability and cost-effectiveness we needed to accommodate our tremendous business growth."

Mike Makowski, CTO of MX Force, reported January 25, 2011: "ICE (Infobright Community Edition) ... combines a columnar database with innovative compression and self-tuning capabilities that eliminate the need to create indexes, partition data or do any manual intervention to achieve fast response for queries and reports. ... ICE ... presented us with several key benefits:

  1. Deployment speed...
  2. Affordability...
  3. Simplicity and flexibility...."

Yves de Montcheuil, Vice President of Marketing of Talend, reported November 30, 2010, "Oracle is losing business to Infobright."

The Bloor Research November 2010 white paper "What's Cool about Columns [and how extend their benefits]" concluded, on page 15:
"However, despite all of these comments, the use of columns is not a panacea. In fact, Infobright demonstrates this very clearly by its improvement on the fundamental architecture of column-based relational databases, which will provide better compression, at least in some instances, and improved query performance through extending the columnar paradigm."

The Associated Press reported November 9, 2010, Infobright "Enables Car Seat Manufacturer to Slash Query Response Times and Significantly Reduce Data Administration Requirements."

November 4, 2010.

The "World Leader in Genomic Research, J. Craig Venter Institute, Selects Infobright..." download reports, "Queries run 10 times faster, and the data compression is 17:1."

InfoWorld selected Infobright one of the ten best open source operating platforms and middleware, August 25, 2010. "Infobright Community Edition (ICE) is a self-tuning, columnar database based on MySQL that is easy to use, offers superfast data loading, high levels of data compression, and excellent query performance. In lieu of indexing, Infobright populates a "Knowledge Grid" of metadata at load time to speed query responses."

Retail Technology reported July 28, 2010, "JC Decaux ... advertising agency cuts costs and reporting time by 50%. ... They ... loaded ... 4 million records into Infobright’s solution in less than two minutes from MySQL tables.... Queries which had taken more than 15 minutes using MySQL only took a couple of seconds using Infobright. ... users are very happy because they have the information they need within the timescales and can meet their deadlines."

Intelligent Enterprise Editors' Choice Awards 2010 noted February 11, 2010: "InfoBright developed unique compression and query approaches before taking its analytical database engine open source. InfoBright has gained marketshare and mindshare since releasing the community edition of its MySQL-based DBMS."

Infobright Community Edition (ICE) supports up to 500 database users and can take advantage of 32 cores and 128 GB in a server without using massively parallel processing (MPP) systems,

The 2010 "Migration Guide: MySQL/MyISAM to Infobright" download says, "due to Infobright’s extremely high query performance levels on large volumes of data, one should consider removing all aggregate, reporting and summary tables that may be in the data model as they are unnecessary."

Infobright Community Edition (ICE) "is designed to run high-performance ad hoc, complex queries on very large data sets without the need of manual tuning, data partitioning, or index creation."

Infobright "Understanding Knowledge Nodes" 3 minute web cast.

Open Source Business Consulting summarizes Infobright benefits:

"Just Load and Go

  • Infobright is simple to implement and manage and requires very little administration.
  • Infobright is self-managing.  There is no need to create or manage indexes or partition data.
  • Infobright is compatible with major Business Intelligence tools such as Jaspersoft, Actuate/BIRT, Cognos, Business Objects, Microstrategy, Pentaho and others.

High performance and scalability

  • Infobright loads data extremely fast - up to 280GB/hour.
  • Infobright's columary approach results in fast response times for complex analytic queries.
  • As [your] database [grows], the query and load performance remains constant.
  • Infobright scales up to 50TB of data.

Low Cost

  • The cost of Infobright is very low compared to closed source, proprietary solutions.
  • Using Infobright eliminates the need for complex hardware infrastructure.
  • Infobright runs on low cost, industry standard servers.  A single server can scale to support 50TB of data.
  • Infobright's industry-leading data compression (10:1 up to 40:1) significantly reduces the amount of storage required.
  • Infobright provides lower operational cost due to its ease of management."
Go to Top