How Can Federal Agencies Become AI Ready?

Home
/
Blog
/
AI
AI
How Can Federal Agencies Become AI Ready?
- Zendata
  April 24, 2024

Content

Example H2

Example H3

Our Newsletter

Get Our Resources Delivered Straight To Your Inbox

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

We respect your privacy. Learn more here.

Introduction

Market research shows that the AI market has doubled since 2021 and is expected to grow to USD 2 Trillion by 2030. Both public and private sector businesses stand to gain many benefits from effective AI implementation.

The difficulty lies in effectively implementing it.

This article will focus on a particular use case from the Department of Labor and outline how to prepare data for AI implementation, how to mitigate bias and promote fairness and how to maintain transparency and accountability in AI systems.

We’ll also discuss the requirements for deploying AI in US Governmental departments in compliance with Executive Order 13960 which mandates that agencies must “reliably assess, test, and monitor AI’s impacts on the public, mitigate the risks of algorithmic discrimination, and provide the public with transparency into how the government uses AI.”

The Use Case

The DoL is currently implementing an AI system for Claims Document Processing which allows them to “identify if physicians notes contain causal language by training custom natural language processing models.”

This NLP will cover document classification and sentence-level causal passage detection. This means it will need to accurately classify the notes into sections and then determine which clause or phrase is explicitly presented as influencing another.

Preparing Your Data for AI Implementation

For the Department of Labor, preparing their data for AI deployment will require great attention to detail. high standards of data quality, consistent data structure and interoperability.

Data Structure and Integration

Efficient data integration capabilities and interoperability are essential for AI systems, particularly when large volumes of data from various sources are involved. This is crucial for ensuring that data flows efficiently between systems and that these systems can manage complex data workflows.

Data Structure and Format: Organise the data into a format that NLP models can process effectively. In our use case, this means structuring each physician's note into distinct sections (e.g., symptoms, diagnosis, treatment), which aids in precise data analysis.
Integration Capabilities: Employ advanced techniques to integrate disparate data sources. This could involve using APIs to pull data from different databases or employing middleware to ensure data consistency across platforms. For the training of NLP models in our example, integrating electronic health records (EHRs) with laboratory results and prescription data could provide a comprehensive view of each patient's history. This broader context enables the AI to more accurately detect causal language and effectively process claims.

Annotation and Labelling

To train models for specific tasks such as classifying documents or detecting specific sentences, detailed annotation and labelling of training data are imperative.

Annotation Guidelines: Develop clear guidelines for annotating documents, which will help in creating a high-quality training set for the AI models. For example, identifying and marking causal language in physician’s notes requires precise guidelines to ensure consistency across annotations.
Labelling Tools: Use labelling tools that can handle the complexity of the documents processed. These tools should support the annotators in marking up text accurately and efficiently.

Contact Us For More Information

            If you’d like to understand more about Zendata’s solutions and how we can help you, please reach out to the
            team today.
        

Start Your Free Trial

Ensuring Data Quality

High-quality data is crucial for training reliable AI models. Regular data cleaning and validation processes are necessary to maintain accuracy and relevancy.

Data Cleaning: This involves correcting inaccuracies and inconsistencies in the data. For our use case, ensuring that physicians' notes are free from errors such as misspellings or incorrect medical terminology is vital. Such errors could significantly distort the AI’s analysis and outputs.
Validation Processes: Establish validation rules to maintain data integrity and relevance. For instance, validation can ensure that timestamps and patient identifiers in the dataset are consistent and correct. This step is crucial to avoid training models on flawed data, which could lead to incorrect interpretations of causal relationships in clinical notes.

This solid foundation not only supports the specific aims of the AI system, like enhancing the precision in processing claims based on physicians' notes but also upholds the overarching goals of fairness and transparency in the deployment of governmental AI solutions.

Ensuring Fairness in AI Applications

For Government operations, like our use case, ensuring fairness in AI systems is critical to maintaining public trust and meeting rigorous regulatory standards. This is especially significant in functions like claims processing, where AI-driven decisions directly affect individuals' lives.

Bias Detection and Mitigation

Detecting and mitigating bias early is vital for developing AI systems that make equitable decisions.

Bias Audits: Conduct thorough audits of AI models to detect biases. For example, these audits could analyse the model's performance across different demographic groups to identify disparities in how decisions are made.
Diverse Training Data: Ensure that the training data includes a wide range of scenarios and patient demographics. This diversity helps the AI model learn from a broad spectrum of cases, reducing the likelihood of biased outcomes. For example, including data from various geographic regions and socioeconomic backgrounds can help the model understand diverse linguistic uses in physicians' notes, which may indicate causal relationships differently.

Implementing Fairness-Enhancing Techniques

There are several techniques the DoL could use to enhance the fairness of their models.

Pre-processing Data:
- Data Resampling: Adjust data samples to ensure balanced representation, crucial for demographic parity in claims processing. For example, oversampling minority classes or undersampling overrepresented ones can help balance the dataset.
- Feature Selection: Remove or modify features that might lead to biased decisions, focusing only on those that are relevant and non-discriminatory. For example, excluding irrelevant personal information such as race or gender unless legally necessary.
In-processing Techniques:
- Adversarial Debiasing: Modify the training process to include an adversary that predicts sensitive attributes from the model’s decisions. This method aims to make the predictions fairer by stopping the model from using these attributes to make decisions.
- Regularisation Techniques: Introduce constraints during training to minimise bias, such as fairness constraints that reduce disparity in error rates between demographic groups.
Post-processing Techniques:
- Equalised Odds Post-processing: After training, adjust the model's outputs to equalise error rates across different groups. This ensures that no particular group is unfairly disadvantaged by the AI’s decisions.
- Threshold Adjustments: Modify decision thresholds for different groups based on performance metrics to compensate for any bias detected in the model's predictions.
Bias Mitigation Algorithms:
- Reweighing: Adjust the weights of training instances before the learning process begins to promote fairness. For instance, changing the weights can help the model pay more attention to underrepresented data points.
- Fairness-aware Learning Algorithms: Use algorithms specifically designed to optimise both accuracy and fairness. These can often be tuned to prioritise fairness according to legal and ethical standards.

For US Government agencies, AI development and deployment must be fair (free from bias) and transparent - especially when dealing with critical functions like claims processing. If a Government agency cannot meet the safeguards outlined in EO 13960 then they “must cease using the AI system…”

Small Language Models (SLMs)

Small Language Models (SLMs) are compact AI systems specifically engineered to process and understand human language using significantly less computational resources than larger models. They are particularly beneficial in scenarios like government operations where processing efficiency and decision accuracy need to be balanced with constraints on resources.

For tasks such as analysing physicians' notes to identify causal language, SLMs could streamline operations and contribute to reduced bias and enhanced data handling.

How Small Language Models Can Mitigate Bias

SLMs can inherently contribute to reducing bias in AI applications through several built-in advantages and strategies:

Focused Training Datasets: SLMs require less data to train effectively. This allows for the careful curation and selection of training datasets that are balanced and diverse, thus minimising the initial bias entering the model.
Model Transparency: Due to their smaller size and simpler structures, SLMs are often more interpretable than larger models. This transparency makes it easier to understand and audit model decisions, facilitating quicker identification and rectification of biased outcomes.
Incremental Learning: SLMs can be updated incrementally without extensive retraining. This feature allows for continuous improvement of the model as more diverse and balanced data becomes available or as biases are detected and need to be addressed in ongoing operations.

How Small Language Models Enhance Pre-processing

The pre-processing requirements for SLMs also support enhanced data integrity and quality that reduces errors and biases:

Selective Feature Use: SLMs' ability to operate effectively with fewer features encourages the selection of only the most relevant and unbiased features during preprocessing. This reduces the risk of model overfitting and bias that can occur when too many irrelevant or noisy variables are included.
Advanced Data Cleaning Techniques: Given the lower volumes of data used, more detailed data cleaning processes can be deployed for SLMs. This process helps to maintain the quality and consistency of the data and reduces the likelihood of bias and errors in model training and outputs.
Efficient Data Anonymisation: SLMs' reduced data needs make it feasible to apply more thorough anonymisation techniques, enhancing privacy protection and reducing bias by stripping unnecessary personal information that could lead to biased decisions.

By leveraging these characteristics, Small Language Models can significantly contribute to bias mitigation and improved preprocessing in AI-driven tasks within government operations.Their implementation allows for efficient, effective, fair and transparent decisions - maintaining high public sector standards.

Maintaining Transparency and Accountability in AI Systems

Like all areas of Government, the application of AI needs to be transparent and accountable. For the Department of Labor, where decisions can significantly affect individual livelihoods, the AI systems used must be not only effective but also perceivable as fair and just by the public.

Implementing Model Explainability

For something to be transparent, it needs to be explainable and with AI, this can be tricky to accomplish. There are two ways you can achieve this:

Explainability Techniques: Techniques such as Local Interpretable Model-agnostic Explanations (LIME) or SHapley Additive exPlanations (SHAP) can be used to illustrate how the NLP model makes decisions. For example, in our use case, these techniques could reveal why certain phrases in a physician's note were identified as causal, helping reviewers understand the model's reasoning.
Documentation and Communication: Maintain detailed documentation of how AI models operate. This includes not only the technical descriptions but also the rationale behind model choices, which should be communicated clearly to all stakeholders. This ensures that when the AI system flags a claim based on a physician’s note, the process and reasoning are fully transparent to the claimant and can be scrutinised if necessary.

Data and AI Governance

Data and AI Governance processes help to ensure accountability in AI systems. This can encompass:

Audit Trails: Create comprehensive audit trails that record decisions made by the AI system. For our use case, this would include logging all instances where the NLP model identified causal language and the corresponding outcomes. Audit trails help track decisions back to their source, crucial for addressing any disputes or grievances.
Ongoing Monitoring and Reviews: Implement regular reviews of the AI system to ensure it continues to operate as intended and adheres to ethical guidelines. This involves periodic checking of the system’s performance and fairness metrics, especially important in dynamic fields such as labour claims where societal norms and legal standards may evolve.
Stakeholder Engagement: Regularly engage with stakeholders, including policymakers, technologists and the public, to gather feedback on the AI system’s performance and impact. This feedback loop can provide insights that might not be evident from internal reviews alone.

Requirements for Deploying AI in US Governmental Departments

Deploying AI within US governmental departments, such as the Department of Labor, entails meeting specific regulatory and operational requirements to ensure both effectiveness and compliance with federal standards.

Compliance with Federal AI Standards

The deployment of AI in federal settings must adhere to a set of established federal guidelines that dictate how AI should be developed, deployed and monitored:

Ethical Guidelines: Federal guidelines often emphasise ethical AI use, mandating that AI systems operate transparently, fairly and without bias. In our use case, this means the NLP system must be capable of demonstrating that its process for detecting causal language does not unfairly disadvantage any group of claimants.
Privacy Regulations: Adherence to privacy laws such as the Health Insurance Portability and Accountability Act (HIPAA) in handling health-related information is crucial. The NLP model must ensure that all personal data used during the processing of claims is handled and stored in a manner that respects patient confidentiality and complies with all applicable laws.
Accessibility Standards: AI systems should be accessible, ensuring that all individuals, regardless of disability, can interact with or be affected by the system equitably. This includes providing means for individuals to question or appeal AI-driven decisions that affect them.

Continuous Improvement and Oversight

To maintain compliance and efficacy, continuous improvement and oversight mechanisms must be embedded throughout the lifecycle of the AI system:

Regular Assessments: Regularly evaluate the AI system to ensure it continues to meet ethical, legal and operational standards. For instance, the Department of Labor could conduct bi-annual reviews of the NLP system to assess its accuracy and fairness in identifying causal language.
Oversight Committees: Establish oversight committees to monitor AI deployments. These committees would be responsible for reviewing all aspects of AI operations, from data handling and model training to deployment and post-deployment activities. They would make sure that the AI systems align with both internal policies and external legal requirements.

This structured approach to AI deployment helps in building a system that is not only technologically advanced but also ethically sound and publicly accountable.

While we may have examined a particular use case for this article, the ideas discussed apply to all Governmental departments that are looking to deploy AI, particularly in sensitive areas such as claims processing. For effective AI implementation, departments need to take a detailed and structured approach to data readiness, fairness, transparency and adherence to regulatory standards.

Our examination of using NLPs to identify causal language in physicians' notes highlights the complexity of developing AI systems that are technically proficient, ethically sound and compliant with federal guidelines.

The steps outlined in this article—from ensuring high-quality data preparation to mitigating bias and maintaining rigorous transparency and accountability mechanisms—are vital for any AI initiative within government settings. Not only do these steps meet the requirements of federal regulations but they also help to build public trust - a crucial element when AI decisions impact individual rights and livelihoods.

Our Newsletter

Get Our Resources Delivered Straight To Your Inbox

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

We respect your privacy. Learn more here.

Balancing Privacy and Fairness In Machine Learning