Preview

Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS)

Advanced search

Automating the Process of Responding to a Tax Request Using Weak Supervision

https://doi.org/10.15514/ISPRAS-2024-36(3)-14

Abstract

This paper examines the application of weak observation techniques to automate tax claim processing in the banking sector. Interest in process automation using machine learning and artificial intelligence techniques in the financial sector has grown significantly in recent years, driven by the desire to improve efficiency, accuracy and customer service. Previous research in financial process automation has often relied on traditional machine learning approaches that require large amounts of well-analyzed data. However, in the banking industry, and especially in specific tasks such as processing tax authority claims, data annotation faces significant challenges due to the need for highly skilled professionals and privacy issues. Our work therefore aims to fill the research gap by applying weak observation, a technique that allows the use of imprecise, inconsistent or incomplete data to train models. This is particularly relevant for the banking sector, where data become outdated quickly and often have limited access due to regulatory constraints. Methodologically, to implement the idea of weak supervision, we used the Snorkel framework to create a training dataset using markup functions developed together with Bank Point experts. This allowed us to significantly reduce the dependence on the time-consuming process of manual data markup and to use large volumes of unlabeled documents. The results of the study showed that weak control approaches can significantly improve the efficiency of tax claims processing by creating models capable of classifying and interpreting different types of tax documents with high accuracy. In addition, the application of weak control can accommodate the need for constant updating of data and legislation, making it preferable for the dynamically changing environment of the financial sector. Using weak supervision to automate responses to tax claims not only improves the quality of data processing, but also helps to reduce the workload of specialists, improving the overall efficiency of financial operations. These results could have an impact on the future application of machine learning in the financial sector, given the importance of innovative approaches in data-constrained environments.

About the Authors

Artur Dmitrievich SOSNOVIKOV
Institute for System Programming of the Russian Academy of Sciences
Russian Federation

Graduate student at the Institute of System Programming since 2023. Research interests: machine learning methods, weakly supervised learning.



Denis Yurievich TURDAKOV
Institute for System Programming of the Russian Academy of Sciences
Russian Federation

Cand. Sci (Phys.-Math.), Head of Department at ISP RAS, associate professor of the Department of System Programming at Moscow State University. Research interests: natural language processing, information extraction, big data analysis, social network analysis.



References

1. Treleaven, Philip, and Bogdan Batrinca. "Algorithmic regulation: automating financial compliance monitoring and regulation using AI and blockchain." Journal of Financial Transformation 45 (2017): 14 - 21.

2. Cartea, Álvaro, Sebastian Jaimungal, and José Penalva. Algorithmic and high-frequency trading. Cambridge University Press, 2015.

3. Wilson, H. James, and Paul R. Daugherty. "Collaborative intelligence: Humans and AI are joining forces." Harvard Business Review 96.4 (2018): 114-123.

4. Horowitz, Michael C., et al. Artificial intelligence and international security. Center for a New American Security., 2022.

5. Winter, Karolin, and Stefanie Rinderle-Ma. "Detecting constraints and their relations from regulatory documents using nlp techniques." On the Move to Meaningful Internet Systems. OTM 2018 Conferences: Confederated International Conferences: CoopIS, C&TC, and ODBASE 2018, Valletta, Malta, October 22-26, 2018, Proceedings, Part I. Springer International Publishing, 2018.

6. Alexander J Ratner, Stephen H Bach, Henry Ehrenberg, Jason Fries, Sen Wu, and Christopher Ré. Snorkel: Rapid training data creation with weak supervision. In VLDB, 2017

7. Shin, Changho, et al. "Universalizing weak supervision." arXiv preprint arXiv:2112.03865 (2021).

8. Ratner, Alexander J., et al. "Data programming: Creating large training sets, quickly." Advances in neural information processing systems 29 (2016).

9. Bach, S. H., He, B., Ratner, A., & Ré, C. (2017, July). Learning the structure of generative models without labeled data. In International Conference on Machine Learning (pp. 273-282). PMLR.


Review

For citations:


SOSNOVIKOV A.D., TURDAKOV D.Yu. Automating the Process of Responding to a Tax Request Using Weak Supervision. Proceedings of the Institute for System Programming of the RAS (Proceedings of ISP RAS). 2024;36(3):203-212. (In Russ.) https://doi.org/10.15514/ISPRAS-2024-36(3)-14



Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2079-8156 (Print)
ISSN 2220-6426 (Online)