Earlier this year, I previewed some upcoming projects. Today, I can add one to the list (and scratch out one of the “TBD” placeholders on the publications page while I’m at it):
I’m very pleased to have paired up with Brett Goldstein to write a new book. The working title is Making Analytics Work: Case by Case. The book will be a source of practical advice for CxOs, managers, and team leads who have been tasked with building an internal analytics practice.
To do this, we’re going to interview people in key roles, to learn from their experiences. Maybe you’d like to contribute, as well?
Our post on today’s Strata Blog has more details on the book, and how to contact us.
I’ve just released a collection of UDFs (user-defined functions) for Apache Pig.
Please check out charcuterie for details.
Bad Data Handbook co-author Adam Laiacano recently posted a recap of his 2012 and preview of 2013. I thought this was a great way to round up several projects and details, and it made me realize that I’d done a poor job of announcing things here. So, Adam, allow me to borrow your idea and present my own recap-and-preview.
Truth be told, I rarely place any significance on the new year — if something needs to change, better to do it now than wait for some arbitrary date — but several projects wrapped late last year, leaving me some time at year-end to reflect and plot my next course.
2012 was quite a busy year! Here’s a quick roundup of things that were announced elsewhere, but not here:
- new book: shortly after Parallel R landed in late 2011, I started working on a new title. Bad Data Handbook landed late 2012.
- text-mining fun with
@ChicagoCDO and a team of civic-minded data folk. We even paired up on a Strata talk (“Text-mining Your City”) to share what we’d learned.
- speaking engagements, at local meetups and larger conferences
2013 looks like it will be even more fun (and busy):
- (another) new book: shortly after Bad Data Handbook landed (are you seeing a theme here?), I laid the foundation for a book on time series analysis. Joining me in this adventure is none other than noted R expert and
xts author Jeff Ryan.
- more writing: I plan to release a set of short papers that have been sitting in the pipeline. Some of them will pair me up with Ken Gleason, with whom I co-wrote a chapter in Bad Data Handbook.
- software: continuing my themes of text mining and writing tools for data anlysis, I plan to release some utilities for Apache Pig in the near future. Stay tuned for this and other project updates.
- speaking engagements: I’m already lining up some travel for the coming months. Perhaps I’ll visit your town? Time will tell …
- research & collaboration: I’m exploring some new subjects and avenues. Details forthcoming.
What’s most exciting about the future are the things I haven’t mentioned here, because I don’t yet know about them! Drop a line if there’s something I should know about. I’d be especially interested to hear about new opportunities to collaborate, new projects, and talks.
This site has been quiet but there’s a lot going on behind the scenes. Items of note:
- Parallel R has landed! A great thanks to all who made it possible. Happy reading.
- I’m planning some software updates, and forqlift is in the top slot.
- There’s another fun project brewing … more details soon.
As promised, I have an announcement:
It’s a book!
Well, more like, a book-to-be. I’ve signed on with the fine folks at O’Reilly to publish Parallel R. It’s all about giving R, everyone’s preferred open-source data analysis tool, a parallel boost. If you’re doing large-scale work with R, then likely you’ll want to read this book. Especially if you’d like to blend R and Hadoop.
This will not be a solo venture: my partner in crime will be none other than Stephen Weston. Even if you don’t know him by name (and really, you should), there’s a good chance you know his work: he wrote the R packages
Look forward to more announcements over time.