TY - JOUR
T1 - Towards scalable and reliable coding of semantic property norms
T2 - ChatGPT vs. an improved AC-PLT
AU - Ramos, Diego
AU - Moreno, Sebastián
AU - Canessa, Enrique
AU - Chaigneau, Sergio E.
N1 - Publisher Copyright:
© The Psychonomic Society, Inc. 2025.
PY - 2025/11
Y1 - 2025/11
N2 - When using the Property Listing Task (PLT) to collect semantic content for a set of concepts (Concept Property Norms, CPNs), coding raw properties into standardized labels poses significant challenges. In this work, we address these challenges by enhancing the Assisted Coding for Property Listing Task (AC-PLT) framework, which facilitates the coding process. The current work conducts an ablation study to optimize AC-PLT by evaluating combinations of text cleaning, embedding models (e.g., Word2Vec, E5, LaBSE), and classification methods (e.g., kNN, SVM, XGBoost). Results show that normalization with the E5 embedding model and kNN classification achieves the highest accuracy, with top-1 test accuracies of 0.523 for CPN27 and 0.608 for CPN120 datasets, outperforming the original AC-PLT baseline. Comparisons with ChatGPT (fine-tuned and one-shot) reveal AC-PLT’s superior stability and cost-effectiveness, despite ChatGPT’s competitive performance in some cases. The improved AC-PLT framework offers a scalable, efficient solution to manual coding challenges, reducing variability and time constraints. Future work will explore its role as a recommender system for human coders, further enhancing its practical utility in cognitive psychology and psycholinguistics research.
AB - When using the Property Listing Task (PLT) to collect semantic content for a set of concepts (Concept Property Norms, CPNs), coding raw properties into standardized labels poses significant challenges. In this work, we address these challenges by enhancing the Assisted Coding for Property Listing Task (AC-PLT) framework, which facilitates the coding process. The current work conducts an ablation study to optimize AC-PLT by evaluating combinations of text cleaning, embedding models (e.g., Word2Vec, E5, LaBSE), and classification methods (e.g., kNN, SVM, XGBoost). Results show that normalization with the E5 embedding model and kNN classification achieves the highest accuracy, with top-1 test accuracies of 0.523 for CPN27 and 0.608 for CPN120 datasets, outperforming the original AC-PLT baseline. Comparisons with ChatGPT (fine-tuned and one-shot) reveal AC-PLT’s superior stability and cost-effectiveness, despite ChatGPT’s competitive performance in some cases. The improved AC-PLT framework offers a scalable, efficient solution to manual coding challenges, reducing variability and time constraints. Future work will explore its role as a recommender system for human coders, further enhancing its practical utility in cognitive psychology and psycholinguistics research.
KW - Ablation study
KW - Codification process
KW - Large language models
KW - Machine learning framework
KW - Property listing task
KW - Semantic memory
UR - https://www.scopus.com/pages/publications/105017931282
U2 - 10.3758/s13428-025-02838-5
DO - 10.3758/s13428-025-02838-5
M3 - Article
C2 - 41053389
AN - SCOPUS:105017931282
SN - 1554-351X
VL - 57
JO - Behavior Research Methods
JF - Behavior Research Methods
IS - 11
M1 - 302
ER -