top of page

The NLP Solution for De-identification of Clinical Notes from EHR Systems on Azure Databricks


NLP Solution for De-identification of Clinical Notes from EHR Systems on Azure Databricks

Melax Tech is working in collaboration with a leading medical university to de-identify millions of clinical notes from their EHR systems using our de-identification solution. Our de-identification pipeline is equipped with machine learning and deep learning models, context rules, and specific resources tailored to recognize PHI entities from clinical notes, such as physician name lists. The solution also offers several post-processing options, including patient-level date shifting, synthetic value replacement, and other advanced transforms essential for complete de-identification.


Our team has successfully implemented Melax Tech's solution on Databricks clusters. This solution offers efficient and parallel processing of clinical notes, enabling us to work with large volumes of clinical documents without compromising performance. If you're interested in learning more about Melax Tech's NLP solution for de-identification of clinical notes, feel free to request a demo today!


bottom of page