Carnival of Data Quality #1 – Make Data Quality Interesting


This post was written by Vincent McBurney, blogger at ITToolbox and consultant for BearingPoint.

A Carnival is a monthly publication of links to interesting articles on a topic – in this case data quality. It travels to different blog homes each month. We have some great DQ blogs for the first issue and at the bottom of the post we announce the host of issue #2.

Here are the DQ posts for issue #1. Please visit the blogs highlighted and leave comments to show your support.

An Amazon Post that’s not about the Kindle

Daragh O’Brien is busy in Data Quality. Why are there so many DQ gurus in Ireland? He is part of the The International Association for Information and Data Quality (IAIDQ) and writes the entertaining IQ Trainwrecks blog. He has submitted a great article from there called oops Amazon did it again… about how a search for a Spiderman wristwatch on Amazon returns something unexpected. Say hello to your friendly neighborhood Spiderman.

DQ for Business

Stephen Buck brings data quality solutions to small businesses. He got busy in November, he started the Address Quality blog and posted a long and interesting first entry Why do we insist on capturing unvalidated data??? His company DuoShare has interesting business applications – you can use it in QuickBooks and you can open a shop on their website to use the data quality validation on customer shipping addresses.

SOX SUX – compliance time bombs

You cannot talk about data quality for long without talking about Sarbannes-Oxley, and developing a facial tick. A post from Leon on the SOX first network called Thinking and sending about the emails being compliance time bombs when people put in content that shouldn’t be there.

Why Data Entry Errors are Inevitable

I received some submissions to the Carnival that had nothing to do with DQ and had to reject them. I decided this one should be included as it’s perfect for all the data entry folks out there: The Blasted Crossing brings us Uncluttering your mind with tips on focusing and handling distraction.

Death, Taxes and Lost Data

From the ask the CareerCounselor blog comes the post Three Things are Certain: Death, Taxes, and Lost Data which I thought would be about the episode of Star Trek Next Gen where Data triggers an evacuation of the ship and pisses off with the Enterprise, but it turned out to be full of lost data haikus!

Which DQ Leader joined the Carnival?

I didn’t get any posts from data quality leaders Informatica, DataFlux, Business Objects or Oracle. Whoops, did I call Oracle a data quality leader? I did get introduced to Data Governance & Data Quality Insider from Trillium’s Steve Sarsfield with the post Winners and Losers of Data Quality: Nominations. I’m guessing HP is not a Trillium customer because he lays into them! Posts like this and IQ Trainwrecks will expose companies with DQ problems and encourage better governance.

Join the DQ Aggregator

Beth from Confessions of a database geek submitted a few great articles but I had room for just two. In Update: New Information Quality Aggregator she builds an RSS aggregator of notable data quality blogs out there (and there will be some more from Carnival #1), you can put this feed on your own blog sidebar to promote data quality blogs. She also submitted >Stepping up to Responsibility for Data Migration which is a great post for data integration developers with six tips for better data quality in data migration projects.

Open Source DQ

At Darren Hartford’s Developer Blog you have someone who believes in metadata. From his title I can tell it’s a blog for developers by Darren Hartford. In Data Improvement – Addresses Darren avoids CASS and DQ software fees by standardizing US addresses using the open source Pentaho Kettle tool and USPS Publication 28. I wonder whether this code can be used by the KETTLE community.

Data Quality Gobbledygook

And finally we come to my own submission for the Carnival – Data Quality Gobbledygook, a post that has had better feedback then just about any blog post I’ve done in two years! It’s the first blog post that has been forwarded internally in my company to our information management community with a positive review from my boss.

With great feedback and lots of conversations with data quality bloggers this has been a very good month for me. I look forward to visiting the next Carnival…
Announcing Carnival of Data Quality #2 – Happy New Year
The issue #2 Carnival has a home at the IQ Trainwrecks blog for the last Friday of December. Now that issue #1 has been completed you can use the DQ Carnival submission form for blog posts for issue #2.

The suggested theme of issue #2 is Happy New Year!

We encourage posts looking back at 2007 – highlights, lowlights and major DQ events. Posts looking forward to 2008. What are your predictions, resolutions and trends in DQ for 2008? If you are a DQ vendor what product enhancements do you have planned? If you are a customer what do you want to do with your data in 2008?

Since we are reviewing 2007 and looking forward to 2008 we welcome discussions of things that impact the DQ world such as BI and Enterprise 2.0. You can submit your blog post at any time before December 26.

  1. December 5th, 2007 at 04:57 | #1

    Vincent – thanks for organising this Blog Carnival. I just have one question… why do you consider tool vendors to be “DQ Leaders”? Surely the people who should lead data quality/information quality (you say potato, I say pah-tah-to) should be the Information customers… either the end customer of a business or the various ‘internal’ customers?

    Being lead by a vendor in my experience can result in your IQ practices being the ones that the vendor’s tool supports – not necessarily the one that is best suited to your organisation… A real case of “if all you have is a hammer, everything looks like a nail”.

    I look forward to hosting the December Carnival.

  2. January 7th, 2008 at 18:41 | #2

    Thanks for the plug, however I’d like to resist categorizing us as providing solutions only to small business. We do have a solution for QuickBooks, but we also do address validation and presort services for businesses of all sizes including electric and water utilities. Usually the appropriate questions to ask is how many addresses do you have, when do you need them back, what quality do you desire, and how much do you want to pay.

  1. No trackbacks yet.