Resumo

Título do Artigo

SUSTAINABLE PROJECT MANAGEMENT FOR THE FOURTH INDUSTRIAL REVOLUTION: SYSTEMATIC REVIEW ON DATA QUALITY EVALUATION OF DATA USED IN COGNITIVE COMPUTING AND AI
Abrir Arquivo
Ver apresentação do trabalho

Palavras Chave

Data quality
Artificial Intelligence
Data Representativeness

Área

Tecnologia da Informação

Tema

Sistemas de Informações e Impactos Organizacionais da TI

Autores

Nome
1 - Marco Aurélio Mazzei
UNIVERSIDADE PAULISTA (UNIP) - Pós-graduação em Administração
2 - Arnaldo Luiz Ryngelblum
UNIVERSIDADE PAULISTA (UNIP) - Programa de Pós-Graduação de Administração
3 - Mauro Luiz Martens
UNIVERSIDADE PAULISTA (UNIP) - Indianópolis

Reumo

Effective use of AI tools depends on our understanding of how the quality and representativeness of data used in AI training and Cognitive Computing can impact algorithm development and how reliable they can be. A systematic review of literature was done to understand how the propositions to use AI to support sustainability in project management evaluates data quality in its technological, linguistic, and sociocultural dimensions and in a new front, propose implementation of guards against improper feedback generated synthetically by Larga Language Models.
When proposing the use of AI to support sustainability in project management, attention should be paid to the quality of data and feedback used in algorithm training. Projects that propose such use must work out how to obtain data quality in all its dimensions. Understand how research and the production of academic knowledge discuss and evaluate the quality and representativeness of the data used in the training of AI algorithms and if they are considering strategies to improve their quality and correct human training to train algorithms.
To assess data quality considerations on the articles, we were able to establish a framework based on ongoing discussions by grouping the types of problem sources into four categories: raw data, synthetic data generated, linguistic, and sociocultural type errors. Such a framework was developed considering articles in the areas of artificial intelligence and cognitive computing development that are already facing the results of these types of errors and alerting to the importance of training to improve technology results.
The academic community already understands raw data quality as a prerequisite for Industry 4.0 technologies and explicitly mentions it in all fourteen articles. As for the discussion of linguistic aspects of data quality, eight articles list this as an important factor. Of the set of articles, only four discussed the sociocultural aspects of data collection and use and none discussed the issue of synthetic data generated and its consequences with the popularization of large language models leading generative artificial intelligence to the masses
There is a risk in automating decision-making processes based on algorithms trained with poor data quality or with a lack of sample representativeness. This risk is already on the research radar However, there is also a new risk arising from using textually biased data to train algorithms that will absorb this bias and hide bad decisions or make them invisible. With the advancement of technology, new risks arise with the use of the Large Language Model tools to do the training replacing the human work in giving feedback on the results of the algorithm being trained.
Aljawder, A., & Al-Karaghouli, W. (2022). The adoption of technology management principles and artificial intelligence for a sustainable lean construction industry in the case of Bahrain. Journal of Decision Systems, 1–30. https://doi.org/10.1080/12460125.2022.2075529 Aria, M., & Cuccurullo, C. (2017). bibliometrix: An R-tool for comprehensive science mapping analysis. Journal of Informetrics, 11(4), 959–975. https://doi.org/10.1016/j.joi.2017.08.007 Baas, J., Schotten, M., Plume, A., Côté, G., & Karimi, R. (2020). Scopus as a curated, high-quality bibliometric data source for academic researc