SUBSCRIBE

Public Health Ontarioโ€™s CIO diagnoses how messy big data can be

Most CIOs get asked questions about how quickly a new software program will be deployed or what it will cost to outsource parts of their IT infrastructure. For Jim Tom, itโ€™s more like this: How long before we can expect to see a decline in the rates of sexually transmitted infections?

The IT executive at

Jim Tom
Public Health Ontario

Big data has been touted as a way for firms in financial services, retail and other sectors to get a better handle on their customerโ€™s behaviour and, ideally, increase the volume of business they get from them. Tom pointed out that the situation is a lot messier in an organization like Public Health Ontario, an arms-length provincial agency which is mandated to test and assess potential risks across a wide array of problems.

โ€œMostly the health system talks about health conditions. Once you get sick, we kind of know what to do,โ€ he explained. โ€œPublic Health is about input โ€” itโ€™s about getting at prevention, at control, what happens before you get the condition. As your grandmothers all told you, that ounce of prevention thing really is important.โ€

Itโ€™s also a particularly complex task, because health can be adversely affected by so many different factors. These include genetics, environmental conditions, socio-economic conditions such as income and more. The link between cause and effect, Tom said, can be considerably long.

โ€œI would say that (in the case of marketing analytics), thereโ€™s a fairly good idea of what the difference between the signals versus the noise,โ€ he said. โ€œWhen yourโ€™e tracking what a customerโ€™s doing on your Web site, you know what theyโ€™re doing. Thereโ€™s not a lot of noise there. When youโ€™re trying to track what causes someoneโ€™s cancer, thereโ€™s a lot of noise.โ€

That doesnโ€™t mean Public Health Ontario isnโ€™t forging ahead anyway. Tom said the agency recently launched Think right data, not big data

Although proponents of big data sometimes paint a picture of harnessing unstructured information and turning it into vital knowledge, Toms said the path at Public Health is much more back and forth. The agency conducts more than four million tests a year for communicable and infectious diseases, and multiple tests can be applied to each sample.

โ€œThere is an iterative process where we implement data transformations and present summary totals and test cases for verification, then go back and revise. The ETL process is not so straightforward as to say, we have a set of business rules and then we just apply them against a set of transactions,โ€ he said. โ€œIn some cases we had to go all the way back to the lab testing and the raw data and say, โ€˜Whatโ€™s going on here?โ€™โ€

Public Health Ontario is conducting similar analysis in areas like HIV testing. Over time, Tom said the agency may be doing big data work that resembles more of what happens in other sectors, in terms of assessing what marketing techniques work to change behaviours around health.

โ€œThe old days when you could give everyone a penicillin shot are kind of over,โ€ he said.

Tech Jobs

Categories