Pacharapol Withayasakpunt Pacharapol Withayasakpunt
Tue 11 August 2020

AI should help early in digitization of public health (e.g. ICD-10)

I am thinking of what health statisticians would do with ICD10 and related health data. I have seen many talks on Speech Recognition, and digitization of patient's health documents; but I think we can go much further than that.

This is partly that I have heard about OpenAI's GPT-3.

Giving GPT-3 a Turing Test

I’ve been playing around with OpenAI’s newGPT-3 language model. When I gotbeta access, the first thing I wondered was, how human is GPT-3? Howclose is it to ...

Current system is not ideal

Currently, in Thailand, as a physician in a small hospital, I am using HosXP, and there are several problems

  • Diagnoses with ICD-10 in outpatients are not accurate and sometimes misleading
    • Not sure if it is always possible to differentiate between new and old patients. -- It is quite accurate in some diseases (e.g. MI, stroke), but not all (e.g. TB (which part of SIR model?)).
  • Health care workers (HCW) might need both attention and training to use the program accurately, but can't that be easily fixed with AI?
  • Not to mention that HosXP is usually detached from several others, e.g. Synapse (X-ray) or INFINITT (X-ray)
  • What about the possibility of Patient Health Record syncing, from private hospitals (including paper files)?

AI can help in more data collection, and beyond ICD-10

  • Not all data has to be keyed in. Some data may come from analysis. This should be done early on, as if the input data is not accurate or enough, it can be corrected. (Garbage-in, Garbage-out.)
  • Also, data cleaning is not fun.

Warning systems and computer-guided recommendations can only be made if computer can make sense of the data


  • I don't think handwriting recognition reliable enough, it should always be proved with human (the writer himself); or avoid it in the first place (type-in, rather than write).
  • Hand-drawings shouldn't be sacrificed. Instead, they should be scanned, and made sense with computer vision.


First and foremost, I am talking about accuracy of data, and early warning. I don't meant to infringe on anyone's privacy. Of course, that should be carefully thought about, including opt-in / opt-out.