Human-AI teaming for water quality documentation analysis with large language models
| dc.contributor.author | Conrad, S., author | |
| dc.contributor.author | Rodriguez, J., author | |
| dc.contributor.author | Vizarreta Luna, G., author | |
| dc.date.accessioned | 2026-05-20T18:13:25Z | |
| dc.date.issued | 2025-06 | |
| dc.description | Presented at the 13th World Congress on Water Resources and Environment (EWRA 2025) in Palermo, Italy on 24-28 June 2025. | |
| dc.description.abstract | Water quality management in urban systems depends on extensive documentation, including laboratory reports, field observations, regulatory guidance, and operational records. Manual analysis of these textual sources is time-intensive, inconsistent across reviewers, and often limited by workforce capacity. This study evaluates the use of large language models (LLMs) as part of a human-AI teaming workflow for identifying water quality concerns in documentation. Twenty water quality field and test documents were manually coded for concerns such as corrosion, pollutants, and chemical imbalances, then analyzed using OpenAI's GPT-4o, GPT-4o-mini, and o1-mini models. The models were tested using chunked document inputs and compared under open-text and localized prompting strategies that incorporated standard operating procedures and utility-specific water quality definitions. Performance was assessed using percent agreement, accuracy, precision, recall, and Fleiss’ Kappa. Results showed that localized chunked data improved model accuracy, agreement across iterations, and true positive identification compared with open-text prompting. The findings suggest that LLM-assisted document analysis can reduce interpretive workload, support more consistent review, and help water managers access previously underexamined textual data. Human-AI teaming offers a promising approach for improving the efficiency and reliability of water quality documentation analysis in urban water management. | |
| dc.format.medium | born digital | |
| dc.format.medium | proceedings (reports) | |
| dc.identifier.uri | https://hdl.handle.net/10217/244562 | |
| dc.language | English | |
| dc.language.iso | eng | |
| dc.publisher | Colorado State University. Libraries | |
| dc.relation.ispartof | Publications | |
| dc.rights | Copyright and other restrictions may apply. User is responsible for compliance with all applicable laws. For information about copyright law, please see https://libguides.colostate.edu/copyright. | |
| dc.subject | water quality management | |
| dc.subject | large language models | |
| dc.subject | human-AI teaming | |
| dc.subject | document analysis | |
| dc.subject | qualitative coding | |
| dc.subject | localized prompting | |
| dc.subject | urban water systems | |
| dc.subject | AI-assisted content analysis | |
| dc.title | Human-AI teaming for water quality documentation analysis with large language models | |
| dc.type | Text |
