References

Aalen, O. (1978). Nonparametric inference for a family of counting processes. The Annals of Statistics, 6(4), 701–726. https://doi.org/10.1214/aos/1176344247
Aas, K., Czado, C., Frigessi, A., & Bakken, H. (2009). Pair-copula constructions of multiple dependence. Insurance: Mathematics and Economics, 44(2), 182–198. https://doi.org/10.1016/j.insmatheco.2007.02.001
Aas, K., Jullum, M., & Løland, A. (2021). Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. Artificial Intelligence, 298, 103502. https://doi.org/10.1016/j.artint.2021.103502
Abadi, M., Chu, A., Goodfellow, I., McMahan, H. B., Mironov, I., Talwar, K., & Zhang, L. (2016). Deep learning with differential privacy. Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security (CCS), 308–318. https://doi.org/10.1145/2976749.2978318
Abnar, S., & Zuidema, W. (2020). Quantifying attention flow in transformers. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), 4190–4197. https://doi.org/10.18653/v1/2020.acl-main.385
Acemoglu, D., Carvalho, V. M., Ozdaglar, A., & Tahbaz-Salehi, A. (2012). The network origins of aggregate fluctuations. Econometrica, 80(5), 1977–2016. https://doi.org/10.3982/ECTA9623
Acemoglu, D., Ozdaglar, A., & Tahbaz-Salehi, A. (2015). Systemic risk and stability in financial networks. American Economic Review, 105(2), 564–608. https://doi.org/10.1257/aer.20130456
Acharya, V. V., Berger, A. N., & Roman, R. A. (2018). Lending implications of u.s. Bank stress tests: Costs or benefits? Journal of Financial Intermediation, 34, 58–90. https://doi.org/10.1016/j.jfi.2018.01.004
Acharya, V. V., Berner, R., Engle, R. F., Jung, H., Stroebel, J., Zeng, X., & Zhao, Y. (2023). Climate stress testing. Annual Review of Financial Economics, 15, 291–326. https://doi.org/10.1146/annurev-financial-110921-101555
Acharya, V. V., Engle, R. F., & Pierret, D. (2014). Testing macroprudential stress tests: The risk of regulatory risk weights. Journal of Monetary Economics, 65, 36–53. https://doi.org/10.1016/j.jmoneco.2014.04.014
Acharya, V. V., Schnabl, P., & Suarez, G. (2013). Securitization without risk transfer. Journal of Financial Economics, 107(3), 515–536. https://doi.org/10.1016/j.jfineco.2012.09.004
Acquisti, A., Brandimarte, L., & Loewenstein, G. (2015). Privacy and human behavior in the age of information. Science, 347(6221), 509–514. https://doi.org/10.1126/science.aaa1465
Acquisti, A., Taylor, C., & Wagman, L. (2016). The economics of privacy. Journal of Economic Literature, 54(2), 442–492. https://doi.org/10.1257/jel.54.2.442
Adams, P., Guttman-Kenney, B., Hayes, L., Hunt, S., Laibson, D., & Stewart, N. (2022). Do nudges reduce borrowing and consumer confusion in the credit card market? Economica, 89(S1), S178–S199. https://doi.org/10.1111/ecca.12427
Adams, W., Einav, L., & Levin, J. (2009a). Liquidity constraints and imperfect information in subprime lending. American Economic Review, 99(1), 49–84. https://doi.org/10.1257/aer.99.1.49
Adams, W., Einav, L., & Levin, J. (2009b). Liquidity constraints and imperfect information in subprime lending. American Economic Review, 99(1), 49–84. https://doi.org/10.1257/aer.99.1.49
Agarwal, A., Beygelzimer, A., Dudı́k, M., Langford, J., & Wallach, H. (2018). A reductions approach to fair classification. Proceedings of the 35th International Conference on Machine Learning (ICML), 60–69.
Agarwal, S., Alok, S., Ghosh, P., & Gupta, S. (2020). Financial inclusion and alternate credit scoring for the millennials: Role of big data and machine learning in fintech. SSRN Working Paper, (3507827). https://doi.org/10.2139/ssrn.3507827
Agarwal, S., Amromin, G., Ben-David, I., Chomsisengphet, S., Piskorski, T., & Seru, A. (2017). Policy intervention in debt renegotiation: Evidence from the Home Affordable Modification Program. Journal of Political Economy, 125(3), 654–712. https://doi.org/10.1086/691701
Agarwal, S., Chomsisengphet, S., Liu, C., Song, C., & Souleles, N. S. (2018). Benefits of relationship banking: Evidence from consumer credit markets. Journal of Monetary Economics, 96, 16–32. https://doi.org/10.1016/j.jmoneco.2018.02.005
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J. (2015). Regulating consumer financial products: Evidence from credit cards. The Quarterly Journal of Economics, 130(1), 111–164. https://doi.org/10.1093/qje/qju037
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J. (2018a). Do banks pass through credit expansions to consumers who want to borrow? The Quarterly Journal of Economics, 133(1), 129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J. (2018b). Do banks pass through credit expansions to consumers who want to borrow? The Quarterly Journal of Economics, 133(1), 129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J. (2018c). Do banks pass through credit expansions to consumers who want to borrow? Quarterly Journal of Economics, 133(1), 129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., & Hauswald, R. (2010). Distance and private information in lending. Review of Financial Studies, 23(7), 2757–2788. https://doi.org/10.1093/rfs/hhq001
Agarwal, S., Qian, W., Yeung, B. Y., & Zou, X. (2019). Mobile wallet and entrepreneurial growth. AEA Papers and Proceedings, 109, 48–53. https://doi.org/10.1257/pandp.20191010
Agarwal, V., & Taffler, R. (2008). Comparing the performance of market-based and accounting-based bankruptcy prediction models. Journal of Banking & Finance, 32(8), 1541–1551. https://doi.org/10.1016/j.jbankfin.2007.07.014
Aguiar, M., & Gopinath, G. (2006). Defaultable debt, interest rates and the current account. Journal of International Economics, 69(1), 64–83. https://doi.org/10.1016/j.jinteco.2005.05.005
Aker, J. C., & Mbiti, I. M. (2010). Mobile phones and economic development in Africa. Journal of Economic Perspectives, 24(3), 207–232. https://doi.org/10.1257/jep.24.3.207
Akerlof, G. A. (1970). The market for “lemons”: Quality uncertainty and the market mechanism. The Quarterly Journal of Economics, 84(3), 488–500. https://doi.org/10.2307/1879431
Akidau, T., Bradshaw, R., Chambers, C., Chernyak, S., Fernández-Moctezuma, R. J., Lax, R., McVeety, S., Mills, D., Perry, F., Schmidt, E., & Whittle, S. (2015). The dataflow model: A practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing. Proceedings of the VLDB Endowment, 8(12), 1792–1803. https://doi.org/10.14778/2824032.2824076
Allen, F., & Gale, D. (2000). Financial contagion. Journal of Political Economy, 108(1), 1–33. https://doi.org/10.1086/262109
Allen, J., Clark, R., & Houde, J.-F. (2014). The effect of mergers in search markets: Evidence from the Canadian mortgage industry. American Economic Review, 104(10), 3365–3396. https://doi.org/10.1257/aer.104.10.3365
Allen, J., Clark, R., & Houde, J.-F. (2019). Search frictions and market power in negotiated-price markets. Journal of Political Economy, 127(4), 1550–1598. https://doi.org/10.1086/701684
Allison, P. D. (1982). Discrete-time methods for the analysis of event histories. Sociological Methodology, 13, 61–98. https://doi.org/10.2307/270718
Alpaydin, E. (1999). Combined 5×2 CV F test for comparing supervised classification learning algorithms. Neural Computation, 11(8), 1885–1892. https://doi.org/10.1162/089976699300016007
Altman, E. I. (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The Journal of Finance, 23(4), 589–609. https://doi.org/10.2307/2978933
Altman, E. I. (2000). Predicting financial distress of companies: Revisiting the Z-score and ZETA models. Stern School of Business, New York University Working Paper.
Altman, E. I. (2005). An emerging market credit scoring system for corporate bonds. Emerging Markets Review, 6(4), 311–323. https://doi.org/10.1016/j.ememar.2005.09.007
Altman, E. I., Brady, B., Resti, A., & Sironi, A. (2005). The link between default and recovery rates: Theory, empirical evidence, and implications. The Journal of Business, 78(6), 2203–2228. https://doi.org/10.1086/497044
Altman, E. I., Haldeman, R. G., & Narayanan, P. (1977a). ZETA analysis: A new model to identify bankruptcy risk of corporations. Journal of Banking & Finance, 1(1), 29–54. https://doi.org/10.1016/0378-4266(77)90017-6
Altman, E. I., Haldeman, R. G., & Narayanan, P. (1977b). ZETA analysis: A new model to identify bankruptcy risk of corporations. Journal of Banking & Finance, 1(1), 29–54. https://doi.org/10.1016/0378-4266(77)90017-6
Altman, E. I., Iwanicz-Drozdowska, M., Laitinen, E. K., & Suvas, A. (2017). Financial distress prediction in an international context: A review and empirical analysis of Altman’s Z-score model. Journal of International Financial Management & Accounting, 28(2), 131–171. https://doi.org/10.1111/jifm.12053
Altman, E. I., & Sabato, G. (2007). Modelling credit risk for SMEs: Evidence from the US market. Abacus, 43(3), 332–357. https://doi.org/10.1111/j.1467-6281.2007.00234.x
Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010). Permutation importance: A corrected feature importance measure. Bioinformatics, 26(10), 1340–1347. https://doi.org/10.1093/bioinformatics/btq134
Alvarez-Melis, D., & Jaakkola, T. S. (2018). On the robustness of interpretability methods.
Ambrose, B. W., & LaCour-Little, M. (2001). Prepayment risk in adjustable rate mortgages subject to initial year discounts: Some new evidence. Real Estate Economics, 29(2), 305–327. https://doi.org/10.1111/1080-8620.00012
Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E., Nagappan, N., Nushi, B., & Zimmermann, T. (2019). Software engineering for machine learning: A case study. IEEE/ACM International Conference on Software Engineering (ICSE-SEIP), 291–300. https://doi.org/10.1109/ICSE-SEIP.2019.00042
An, X., Cordell, L., Smith, L., & Wang, K. (2022). Racial and ethnic disparities in mortgage lending: New evidence from Expanded HMDA data. Federal Reserve Bank of Philadelphia Working Paper, (22-02). https://www.philadelphiafed.org/the-economy/banking-and-financial-markets/racial-and-ethnic-disparities-in-mortgage-lending
Andersen, P. K., & Gill, R. D. (1982). Cox’s regression model for counting processes: A large sample study. The Annals of Statistics, 10(4), 1100–1120. https://doi.org/10.1214/aos/1176345976
Anderson, R. (2007). The credit scoring toolkit: Theory and practice for retail credit risk management and decision automation.
Anderson, T. W. (1951). Classification by multivariate analysis. Psychometrika, 16(1), 31–50. https://doi.org/10.1007/BF02313425
Andrews, I., Stock, J. H., & Sun, L. (2019). Weak instruments in instrumental variables regression: Theory and practice. Annual Review of Economics, 11, 727–753. https://doi.org/10.1146/annurev-economics-080218-025643
Angelino, E., Larus-Stone, N., Alabi, D., Seltzer, M., & Rudin, C. (2018). Learning certifiably optimal rule lists for categorical data. Journal of Machine Learning Research, 18, 1–78.
Angelopoulos, A. N., & Bates, S. (2023). Conformal prediction: A gentle introduction. Foundations and Trends in Machine Learning, 16(4), 494–591. https://doi.org/10.1561/2200000101
Angelopoulos, A. N., Bates, S., Jordan, M., & Malik, J. (2021). Uncertainty sets for image classifiers using conformal prediction. International Conference on Learning Representations (ICLR).
Angrist, J. D., Imbens, G. W., & Rubin, D. B. (1996). Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434), 444–455. https://doi.org/10.2307/2291629
Ansari, A. F., Stella, L., Turkmen, C., Zhang, X., Mercado, P., Shen, H., Shchur, O., Rangapuram, S. S., Arango, S. P., Kapoor, S., Zschiegner, J., Maddix, D. C., Wang, H., Mahoney, M. W., Torkkola, K., Wilson, A. G., Bohlke-Schneider, M., & Wang, Y. (2024). Chronos: Learning the language of time series. Transactions on Machine Learning Research; arXiv:2403.07815. https://arxiv.org/abs/2403.07815
Antweiler, W., & Frank, M. Z. (2004). Is all that talk just noise? The information content of internet stock message boards. The Journal of Finance, 59(3), 1259–1294. https://doi.org/10.1111/j.1540-6261.2004.00662.x
Apley, D. W., & Zhu, J. (2020). Visualizing the effects of predictor variables in black box supervised learning models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 82(4), 1059–1086. https://doi.org/10.1111/rssb.12377
Araci, D. (2019). FinBERT: Financial sentiment analysis with pre-trained language models. arXiv:1908.10063.
Arellano, C. (2008). Default risk and income fluctuations in emerging economies. American Economic Review, 98(3), 690–712. https://doi.org/10.1257/aer.98.3.690
Argyle, B. S., Nadauld, T. D., & Palmer, C. J. (2020). Monthly payment targeting and the demand for maturity. Review of Financial Studies, 33(11), 5416–5462. https://doi.org/10.1093/rfs/hhaa004
Aridor, G., Che, Y.-K., & Salz, T. (2024). The effect of privacy regulation on the data industry: Empirical evidence from GDPR. RAND Journal of Economics, 55(4), 503–530. https://doi.org/10.1111/1756-2171.12586
Arik, S. Ö., & Pfister, T. (2021). TabNet: Attentive interpretable tabular learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35, 6679–6687.
Arkhangelsky, D., Athey, S., Hirshberg, D. A., Imbens, G. W., & Wager, S. (2021). Synthetic difference-in-differences. American Economic Review, 111(12), 4088–4118. https://doi.org/10.1257/aer.20190159
Arlot, S., & Celisse, A. (2010). A survey of cross-validation procedures for model selection. Statistics Surveys, 4, 40–79. https://doi.org/10.1214/09-SS054
Armbrust, M., Das, T., Torres, J., Yavuz, B., Zhu, S., Xin, R., Ghodsi, A., Stoica, I., & Zaharia, M. (2018). Structured streaming: A declarative API for real-time applications in Apache Spark. Proceedings of the 2018 ACM International Conference on Management of Data (SIGMOD), 601–613. https://doi.org/10.1145/3183713.3190664
Arnold, D., Dobbie, W., & Yang, C. S. (2018). Racial bias in bail decisions. The Quarterly Journal of Economics, 133(4), 1885–1932. https://doi.org/10.1093/qje/qjy012
Aronszajn, N. (1950). Theory of reproducing kernels. Transactions of the American Mathematical Society, 68(3), 337–404. https://doi.org/10.2307/1990404
Arrieta, A. B., Dı́az-Rodrı́guez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., Garcı́a, S., Gil-López, S., Molina, D., Benjamins, R., Chatila, R., & Herrera, F. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82–115. https://doi.org/10.1016/j.inffus.2019.12.012
Ascarza, E. (2018). Retention futility: Targeting high-risk customers might be ineffective. Journal of Marketing Research, 55(1), 80–98. https://doi.org/10.1509/jmr.16.0163
Asian Development Bank. (2022a). Fintech policy tool kit for regulators and policy makers in Asia and the Pacific. Asian Development Bank. https://www.adb.org/publications/fintech-policy-tool-kit-regulators-policy-makers-asia-pacific
Asian Development Bank. (2022b). Viet nam financial sector report: Deepening financial inclusion. Asian Development Bank. https://www.adb.org/countries/viet-nam/main
Asian Development Bank. (2023). Digital financial inclusion in Southeast Asia. Asian Development Bank. https://www.adb.org/publications/digital-financial-inclusion-southeast-asia
Assefa, S. A., Dervovic, D., Mahfouz, M., Tillman, R. E., Reddy, P., & Veloso, M. (2020). Generating synthetic data in finance: Opportunities, challenges and pitfalls. Proceedings of the First ACM International Conference on AI in Finance. https://doi.org/10.1145/3383455.3422554
Athey, S., & Imbens, G. (2016). Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences, 113(27), 7353–7360. https://doi.org/10.1073/pnas.1510489113
Athey, S., Tibshirani, J., & Wager, S. (2019). Generalized random forests. The Annals of Statistics, 47(2), 1148–1178. https://doi.org/10.1214/18-AOS1709
Athey, S., & Wager, S. (2021). Policy learning with observational data. Econometrica, 89(1), 133–161. https://doi.org/10.3982/ECTA15732
Atiya, A. F. (2001). Bankruptcy prediction for credit risk using neural networks: A survey and new results. IEEE Transactions on Neural Networks, 12(4), 929–935. https://doi.org/10.1109/72.935101
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2007). The 2006 HMDA data. Federal Reserve Bulletin, 93, A73–A109.
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2009b). Credit scoring and its effects on the availability and affordability of credit. Journal of Consumer Affairs, 43(3), 516–537. https://doi.org/10.1111/j.1745-6606.2009.01151.x
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2009a). Credit scoring and its effects on the availability and affordability of credit. Journal of Consumer Affairs, 43(3), 516–537. https://doi.org/10.1111/j.1745-6606.2009.01151.x
Avery, R. B., Calem, P. S., Canner, G. B., & Bostic, R. W. (2003). An overview of consumer data and credit reporting. Federal Reserve Bulletin, 89, 47–73.
Azizpour, S., Giesecke, K., & Schwenkler, G. (2018). Exploring the sources of default clustering. Journal of Financial Economics, 129(1), 154–183. https://doi.org/10.1016/j.jfineco.2018.04.008
Ba, J. L., Kiros, J. R., & Hinton, G. E. (2016). Layer normalization. arXiv Preprint arXiv:1607.06450.
Babaev, D., Ovsov, N., Kireev, I., Ivanova, M., Gusev, G., Nazarov, I., & Tuzhilin, A. (2022). CoLES: Contrastive learning for event sequences with self-supervision. https://doi.org/10.1145/3514221.3526129
Babina, T., Bahaj, S. A., Buchak, G., De Marco, F., Foulis, A. K., Gornall, W., Mazzola, F., & Yu, T. (2024). Customer data access and fintech entry: Early evidence from open banking. National Bureau of Economic Research Working Paper, (32089). https://doi.org/10.3386/w32089
Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., & Samek, W. (2015). On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLOS ONE, 10(7), e0130140. https://doi.org/10.1371/journal.pone.0130140
Baesens, B., Van Gestel, T., Stepanova, M., Van den Poel, D., & Vanthienen, J. (2005). Neural network survival analysis for personal loan data. Journal of the Operational Research Society, 56(9), 1089–1098. https://doi.org/10.1057/palgrave.jors.2601990
Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J., & Vanthienen, J. (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. Journal of the Operational Research Society, 54(6), 627–635. https://doi.org/10.1057/palgrave.jors.2601545
Baghai, R. P., Servaes, H., & Tamayo, A. (2014). Have rating agencies become more conservative? Implications for capital structure and debt pricing. Journal of Finance, 69(5), 1961–2005. https://doi.org/10.1111/jofi.12153
Bai, S., Kolter, J. Z., & Koltun, V. (2018). An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv Preprint arXiv:1803.01271.
Bai, X., Tsiatis, A. A., & O’Brien, S. M. (2013). Doubly robust estimators of treatment-specific survival distributions in observational studies with stratified sampling. Biometrics, 69(4), 830–839. https://doi.org/10.1111/biom.12076
Bai, Y., Kadavath, S., Kundu, S., Askell, A., Kernion, J., Jones, A., Chen, A., Goldie, A., Mirhoseini, A., McKinnon, C., Chen, C., Olsson, C., Olah, C., Hernandez, D., Drain, D., Ganguli, D., Li, D., Tran-Johnson, E., Perez, E., … Kaplan, J. (2022). Constitutional AI: Harmlessness from AI feedback. arXiv:2212.08073.
Baker, S. R. (2018). Debt and the response to household income shocks: Validation and application of linked financial account data. Journal of Political Economy, 126(4), 1504–1557. https://doi.org/10.1086/698106
Baker, S. R., Bloom, N., & Davis, S. J. (2016). Measuring economic policy uncertainty. The Quarterly Journal of Economics, 131(4), 1593–1636. https://doi.org/10.1093/qje/qjw024
Baldauf, M., Garlappi, L., & Yannelis, C. (2020). Does climate change affect real estate prices? Only if you believe in it. Review of Financial Studies, 33(3), 1256–1295. https://doi.org/10.1093/rfs/hhz073
Balyuk, T., & Davydenko, S. A. (2024). Reintermediation in FinTech: Evidence from online lending. Journal of Financial and Quantitative Analysis, 59(5), 1997–2037. https://doi.org/10.1017/S0022109023000789
Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415. https://doi.org/10.1016/0022-2496(75)90001-2
Banasik, J., & Crook, J. (2007). Reject inference, augmentation, and sample selection. European Journal of Operational Research, 183(3), 1582–1594. https://doi.org/10.1016/j.ejor.2006.06.072
Banasik, J., Crook, J. N., & Thomas, L. C. (1999a). Not if but when will borrowers default. Journal of the Operational Research Society, 50(12), 1185–1190. https://doi.org/10.1057/palgrave.jors.2600851
Banasik, J., Crook, J. N., & Thomas, L. C. (1999b). Not if but when will borrowers default. Journal of the Operational Research Society, 50(12), 1185–1190. https://doi.org/10.1057/palgrave.jors.2600851
Banasik, J., Crook, J. N., & Thomas, L. C. (2003). Sample selection bias in credit scoring models. Journal of the Operational Research Society, 54(8), 822–832. https://doi.org/10.1057/palgrave.jors.2601578
Banco Central do Brasil. (2013). Circular no. 3.648: IRB approach for credit risk capital requirement. Banco Central do Brasil. https://www.bcb.gov.br/
Banco Central do Brasil. (2020). Joint resolution no. 1: Implementation of Open Finance in brazil. Banco Central do Brasil; Conselho Monetário Nacional. https://www.bcb.gov.br/estabilidadefinanceira/openfinance
Bangia, A., Diebold, F. X., Kronimus, A., Schagen, C., & Schuermann, T. (2002). Ratings migration and the business cycle, with application to credit portfolio stress testing. Journal of Banking & Finance, 26(2–3), 445–474. https://doi.org/10.1016/S0378-4266(01)00229-1
Bank for International Settlements. (2020). Financial stability considerations in emerging market economies: BIS papers no. 113. Bank for International Settlements. https://www.bis.org/publ/bppdf/bispap113.htm
Bank for International Settlements. (2022a). Big tech regulation: In search of a new framework (FSI Occasional Paper 20). Bank for International Settlements. https://www.bis.org/fsi/fsipapers20.htm
Bank for International Settlements. (2022b). Credit markets in emerging market economies: Evolution and policy challenges (BIS Papers 125). Bank for International Settlements. https://www.bis.org/publ/bppdf/bispap125.htm
Bank for International Settlements. (2023a). Big tech regulation: In search of a new framework (BIS Papers 141). Bank for International Settlements. https://www.bis.org/publ/bppdf/bispap141.htm
Bank for International Settlements. (2023b). Financial stability risks from non-bank financial intermediation in emerging market economies. BIS Papers. https://www.bis.org/
Bank for International Settlements, Financial Stability Institute. (2024). Regulating AI in the financial sector: Recent developments and main challenges (FSI insights no. 63). Bank for International Settlements.
Bank of England. (2022). Stress testing the UK banking system: Key elements of the 2022 annual cyclical scenario. Bank of England. https://www.bankofengland.co.uk/stress-testing/2022/key-elements-of-the-2022-annual-cyclical-scenario
Barber, R. F., Candès, E. J., Ramdas, A., & Tibshirani, R. J. (2021). Predictive inference with the jackknife+. The Annals of Statistics, 49(1), 486–507. https://doi.org/10.1214/20-AOS1965
Barboni, G., Cárdenas, J. C., & De Roux, N. (2026). Behavioral messages and debt repayment. Review of Finance, rfag015.
Barboza, F., Kimura, H., & Altman, E. (2017). Machine learning models and bankruptcy prediction. Expert Systems with Applications, 83, 405–417. https://doi.org/10.1016/j.eswa.2017.04.006
Bardoscia, M., Barucca, P., Battiston, S., Caccioli, F., Cimini, G., Garlaschelli, D., Saracco, F., Squartini, T., & Caldarelli, G. (2021). The physics of financial networks. Nature Reviews Physics, 3(7), 490–507. https://doi.org/10.1038/s42254-021-00322-5
Barocas, S., & Selbst, A. D. (2016). Big data’s disparate impact. California Law Review, 104(3), 671–732.
Barocas, S., Selbst, A. D., & Raghavan, M. (2020). The hidden assumptions behind counterfactual explanations and principal reasons. Proceedings of the ACM Conference on Fairness, Accountability, and Transparency, 80–89. https://doi.org/10.1145/3351095.3372830
Barron, A. R. (1993). Universal approximation bounds for superpositions of a sigmoidal function. IEEE Transactions on Information Theory, 39(3), 930–945. https://doi.org/10.1109/18.256500
Barrot, J.-N., & Sauvagnat, J. (2016). Input specificity and the propagation of idiosyncratic shocks in production networks. Quarterly Journal of Economics, 131(3), 1543–1592. https://doi.org/10.1093/qje/qjw018
Bartlett, P. L., & Mendelson, S. (2002). Rademacher and Gaussian complexities: Risk bounds and structural results. Journal of Machine Learning Research, 3, 463–482.
Bartlett, R., Morse, A., Stanton, R., & Wallace, N. (2022b). Consumer-lending discrimination in the FinTech era. Journal of Financial Economics, 143(1), 30–56. https://doi.org/10.1016/j.jfineco.2021.05.047
Bartlett, R., Morse, A., Stanton, R., & Wallace, N. (2022a). Consumer-lending discrimination in the FinTech era. Journal of Financial Economics, 143(1), 30–56. https://doi.org/10.1016/j.jfineco.2021.05.047
Basel Committee on Banking Supervision. (2005b). An explanatory note on the basel II IRB risk weight functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005c). An explanatory note on the basel II IRB risk weight functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005a). An explanatory note on the basel II IRB risk weight functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005d). An explanatory note on the basel II IRB risk weight functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.pdf
Basel Committee on Banking Supervision. (2005e). Studies on the validation of internal rating systems (Working Paper 14). Bank for International Settlements.
Basel Committee on Banking Supervision. (2006). International convergence of capital measurement and capital standards: A revised framework, comprehensive version [Technical Report]. https://www.bis.org/publ/bcbs128.htm
Basel Committee on Banking Supervision. (2010). Sound practices for backtesting counterparty credit risk models. Bank for International Settlements. https://www.bis.org/publ/bcbs185.htm
Basel Committee on Banking Supervision. (2013). Principles for effective risk data aggregation and risk reporting (BCBS 239). Bank for International Settlements. https://www.bis.org/publ/bcbs239.htm
Basel Committee on Banking Supervision. (2015). Guidance on credit risk and accounting for expected credit losses (BCBS 350). Bank for International Settlements. https://www.bis.org/bcbs/publ/d350.htm
Basel Committee on Banking Supervision. (2016). Guidelines on the supervisory review and evaluation process and pillar 2 capital (BCBS 355). Bank for International Settlements. https://www.bis.org/bcbs/publ/d355.htm
Basel Committee on Banking Supervision. (2017a). Basel III: Finalising post-crisis reforms [Technical Report]. https://www.bis.org/bcbs/publ/d424.htm
Basel Committee on Banking Supervision. (2017b). Guidelines on credit risk and accounting for expected credit losses (BCBS Guidance d350). Bank for International Settlements. https://www.bis.org/bcbs/publ/d350.htm
Basel Committee on Banking Supervision. (2021). Principles for the effective management of third-party risks (revisions in the context of AI use). Bank for International Settlements.
Bastos, J. A. (2010). Forecasting bank loans loss-given-default. Journal of Banking & Finance, 34(10), 2510–2517. https://doi.org/10.1016/j.jbankfin.2010.04.011
Batista, G. E. A. P. A., Prati, R. C., & Monard, M. C. (2004). A study of the behavior of several methods for balancing machine learning training data. ACM SIGKDD Explorations Newsletter, 6(1), 20–29. https://doi.org/10.1145/1007730.1007735
Battiston, S., Puliga, M., Kaushik, R., Tasca, P., & Caldarelli, G. (2012). DebtRank: Too central to fail? Financial networks, the FED and systemic risk. Scientific Reports, 2, 541. https://doi.org/10.1038/srep00541
Baum, L. E., Petrie, T., Soules, G., & Weiss, N. (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. The Annals of Mathematical Statistics, 41(1), 164–171. https://doi.org/10.1214/aoms/1177697196
Bayer, P., Ferreira, F., & Ross, S. L. (2018). What drives racial and ethnic differences in high-cost mortgages? The role of high-risk lenders. The Review of Financial Studies, 31(1), 175–205. https://doi.org/10.1093/rfs/hhx035
Bazarbash, M. (2019). FinTech in financial inclusion: Machine learning applications in assessing credit risk [IMF Working Paper]. (WP/19/109).
Bazot, G. (2018). Financial consumption and the cost of finance: Measuring financial efficiency in europe (1950–2007). Journal of the European Economic Association, 16(1), 123–160. https://doi.org/10.1093/jeea/jvx008
Beaver, W. H. (1966). Financial ratios as predictors of failure. Journal of Accounting Research, 4, 71–111. https://doi.org/10.2307/2490171
Becker, B., & Milbourn, T. (2011). How did increased competition affect credit ratings? Journal of Financial Economics, 101(3), 493–514. https://doi.org/10.1016/j.jfineco.2011.03.012
Begenau, J., Farboodi, M., & Veldkamp, L. (2018). Big data in finance and the growth of large firms. Journal of Monetary Economics, 97, 71–87. https://doi.org/10.1016/j.jmoneco.2018.05.013
Begley, J., Ming, J., & Watts, S. (1996). Bankruptcy classification errors in the 1980s: An empirical analysis of Altman’s and Ohlson’s models. Review of Accounting Studies, 1(4), 267–284. https://doi.org/10.1007/BF00570833
Begley, T. A., & Purnanandam, A. (2017). Design of financial securities: Empirical evidence from private-label RMBS deals. Review of Financial Studies, 30(1), 120–161. https://doi.org/10.1093/rfs/hhw068
Begley, T. A., & Purnanandam, A. (2021). Color and credit: Race, regulation, and the quality of financial services. Journal of Financial Economics, 141(1), 48–65. https://doi.org/10.1016/j.jfineco.2021.02.013
Behn, M., Haselmann, R., & Vig, V. (2022). The limits of model-based regulation. The Journal of Finance, 77(3), 1635–1684. https://doi.org/10.1111/jofi.13124
Belkin, M., Hsu, D., Ma, S., & Mandal, S. (2019). Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proceedings of the National Academy of Sciences, 116(32), 15849–15854. https://doi.org/10.1073/pnas.1903070116
Belloni, A., Chernozhukov, V., & Hansen, C. (2014). Inference on treatment effects after selection among high-dimensional controls. The Review of Economic Studies, 81(2), 608–650. https://doi.org/10.1093/restud/rdt044
Bellotti, T., & Crook, J. (2009b). Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications, 36(2), 3302–3308. https://doi.org/10.1016/j.eswa.2008.01.005
Bellotti, T., & Crook, J. (2009a). Support vector machines for credit scoring and discovery of significant features. Expert Systems with Applications, 36(2), 3302–3308. https://doi.org/10.1016/j.eswa.2008.01.005
Bellotti, T., & Crook, J. (2013). Forecasting and stress testing credit card default using dynamic models. International Journal of Forecasting, 29(4), 563–574. https://doi.org/10.1016/j.ijforecast.2013.04.003
Benavoli, A., Corani, G., Demsar, J., & Zaffalon, M. (2017). Time for a change: A tutorial for comparing multiple classifiers through Bayesian analysis. Journal of Machine Learning Research, 18(77), 1–36.
Bengio, Y., Simard, P., & Frasconi, P. (1994). Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks, 5(2), 157–166. https://doi.org/10.1109/72.279181
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological), 57(1), 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
Benmelech, E., & Dlugosz, J. (2009). The alchemy of CDO credit ratings. Journal of Monetary Economics, 56(5), 617–634. https://doi.org/10.1016/j.jmoneco.2009.04.007
Berg, T., Burg, V., Gombović, A., & Puri, M. (2020). On the rise of FinTechs: Credit scoring using digital footprints. The Review of Financial Studies, 33(7), 2845–2897. https://doi.org/10.1093/rfs/hhz099
Berg, T., Puri, M., & Rocholl, J. (2020). Loan officer incentives, internal rating models, and default rates. Review of Finance, 24(3), 529–578. https://doi.org/10.1093/rof/rfz018
Berger, A. N., Miller, N. H., Petersen, M. A., Rajan, R. G., & Stein, J. C. (2005). Does function follow organizational form? Evidence from the lending practices of large and small banks. Journal of Financial Economics, 76(2), 237–269. https://doi.org/10.1016/j.jfineco.2004.06.003
Berger, A. N., & Udell, G. F. (2002). Small business credit availability and relationship lending: The importance of bank organisational structure. Economic Journal, 112(477), F32–F53. https://doi.org/10.1111/1468-0297.00682
Berger, D. W., Milbradt, K., Tourre, F., & Vavra, J. (2021). Mortgage prepayment and path-dependent effects of monetary policy. American Economic Review, 111(9), 2829–2878. https://doi.org/10.1257/aer.20181857
Bergmeir, C., & Benı́tez, J. M. (2012). On the use of cross-validation for time series predictor evaluation. Information Sciences, 191, 192–213. https://doi.org/10.1016/j.ins.2011.12.028
Bergmeir, C., Hyndman, R. J., & Koo, B. (2018). A note on the validity of cross-validation for evaluating autoregressive time series prediction. Computational Statistics & Data Analysis, 120, 70–83. https://doi.org/10.1016/j.csda.2017.11.003
Berka, P. (1999). PKDD’99 discovery challenge financial dataset. University of Economics, Prague.
Berkson, J. (1944). Application of the logistic function to bio-assay. Journal of the American Statistical Association, 39(227), 357–365. https://doi.org/10.1080/01621459.1944.10500699
Berkson, J., & Gage, R. P. (1952). Survival curve for cancer patients following treatment. Journal of the American Statistical Association, 47(259), 501–515. https://doi.org/10.2307/2281318
Bernstein, A., Gustafson, M. T., & Lewis, R. (2019). Disaster on the horizon: The price effect of sea level rise. Journal of Financial Economics, 134(2), 253–272. https://doi.org/10.1016/j.jfineco.2019.03.013
Bertomeu, J., Cheynel, E., Floyd, E., & Pan, W. (2021). Using machine learning to detect misstatements. Review of Accounting Studies, 26(2), 468–519. https://doi.org/10.1007/s11142-020-09563-8
Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How much should we trust differences-in-differences estimates? The Quarterly Journal of Economics, 119(1), 249–275. https://doi.org/10.1162/003355304772839588
Bertrand, M., & Morse, A. (2011). Information disclosure, cognitive biases, and payday borrowing. Journal of Finance, 66(6), 1865–1893. https://doi.org/10.1111/j.1540-6261.2011.01698.x
Bertsimas, D., & Dunn, J. (2017). Optimal classification trees. Machine Learning, 106(7), 1039–1082. https://doi.org/10.1007/s10994-017-5633-9
Beutel, A., Chen, J., Zhao, Z., & Chi, E. H. (2017). Data decisions and theoretical implications when adversarially learning fair representations. FAT/ML Workshop at KDD.
Bharadwaj, P., Jack, W., & Suri, T. (2021). Fintech and household resilience to shocks: Evidence from digital loans in Kenya. Journal of Development Economics, 153, 102697. https://doi.org/10.1016/j.jdeveco.2021.102697
Bharath, S. T., & Shumway, T. (2008). Forecasting default with the Merton distance to default model. Review of Financial Studies, 21(3), 1339–1369. https://doi.org/10.1093/rfs/hhn044
Bhatt, U., Weller, A., & Moura, J. M. F. (2020). Evaluating and aggregating feature-based model explanations. Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI), 3016–3022.
Bhutta, N., & Hizmo, A. (2021). Do minorities pay more for mortgages? The Review of Financial Studies, 34(2), 763–789. https://doi.org/10.1093/rfs/hhaa047
Bhutta, N., Hizmo, A., & Ringo, D. (2022). How much does racial bias affect mortgage lending? Evidence from human and algorithmic credit decisions. Finance and Economics Discussion Series, (2022-067). https://doi.org/10.17016/FEDS.2022.067
Bhutta, N., Skiba, P. M., & Tobacman, J. (2015b). Payday loan choices and consequences. Journal of Money, Credit and Banking, 47(2-3), 223–260. https://doi.org/10.1111/jmcb.12175
Bhutta, N., Skiba, P. M., & Tobacman, J. (2015a). Payday loan choices and consequences. Journal of Money, Credit and Banking, 47(2-3), 223–260. https://doi.org/10.1111/jmcb.12175
Bia, M., Huber, M., & Lafférs, L. (2024). Double machine learning for sample selection models. Journal of Business and Economic Statistics, 42(3), 958–969. https://doi.org/10.1080/07350015.2023.2271071
Biamonte, J., Wittek, P., Pancotti, N., Rebentrost, P., Wiebe, N., & Lloyd, S. (2017). Quantum machine learning. Nature, 549(7671), 195–202. https://doi.org/10.1038/nature23474
Biau, G., & Scornet, E. (2016). A random forest guided tour. TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Bica, I., Alaa, A. M., Jordon, J., & Schaar, M. van der. (2020). Estimating counterfactual treatment outcomes over time through adversarially balanced representations. International Conference on Learning Representations (ICLR).
Bickel, P. J., & Levina, E. (2004). Some theory for Fisher’s linear discriminant function, “naive Bayes,” and some alternatives when there are many more variables than observations. Bernoulli, 10(6), 989–1010. https://doi.org/10.3150/bj/1106314847
Bickel, S., Brückner, M., & Scheffer, T. (2009). Discriminative learning under covariate shift. Journal of Machine Learning Research, 10, 2137–2155.
Bierman, H., & Hausman, W. H. (1970). The credit granting decision. Management Science, 16(8), B519–B532. https://doi.org/10.1287/mnsc.16.8.B519
Bifet, A., & Gavalda, R. (2007). Learning from time-changing data with adaptive windowing. Proceedings of the 2007 SIAM International Conference on Data Mining (SDM), 443–448. https://doi.org/10.1137/1.9781611972771.42
Billingsley, P. (1995). Probability and measure (3rd ed.). Wiley.
Björkegren, D., & Grissen, D. (2020). Behavior revealed in mobile phone usage predicts credit repayment. The World Bank Economic Review, 34(3), 618–634. https://doi.org/10.1093/wber/lhz006
Black, F., & Cox, J. C. (1976). Valuing corporate securities: Some effects of bond indenture provisions. The Journal of Finance, 31(2), 351–367. https://doi.org/10.2307/2326607
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654. https://doi.org/10.1086/260062
Blanche, P., Dartigues, J.-F., & Jacqmin-Gadda, H. (2013). Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Statistics in Medicine, 32(30), 5381–5397. https://doi.org/10.1002/sim.5958
Blattner, L., & Nelson, S. (2022). How costly is noise? Data and disparities in consumer credit. SSRN Electronic Journal.
Bleier, A., Goldfarb, A., & Tucker, C. (2020). Consumer privacy and the future of data-based innovation and marketing. International Journal of Research in Marketing, 37(3), 466–480. https://doi.org/10.1016/j.ijresmar.2020.03.006
Blinder, A. S. (1973). Wage discrimination: Reduced form and structural estimates. The Journal of Human Resources, 8(4), 436–455. https://doi.org/10.2307/144855
Blume, M. E., Lim, F., & MacKinlay, A. C. (1998). The declining credit quality of U.S. Corporate debt: Myth or reality? Journal of Finance, 53(4), 1389–1413. https://doi.org/10.1111/0022-1082.00057
Blumenstock, J., Cadamuro, G., & On, R. (2015). Predicting poverty and wealth from mobile phone metadata. Science, 350(6264), 1073–1076. https://doi.org/10.1126/science.aac4420
Blundell, R., & Powell, J. L. (2003). Endogeneity in nonparametric and semiparametric regression models. Advances in Economics and Econometrics: Theory and Applications, Eighth World Congress, 2, 312–357.
Board of Governors of the Federal Reserve System. (2007). Report to the congress on credit scoring and its effects on the availability and affordability of credit. Federal Reserve. https://www.federalreserve.gov/boarddocs/rptcongress/creditscore/
Board of Governors of the Federal Reserve System. (2011). Supervisory guidance on model risk management (SR 11-7). Federal Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Board of Governors of the Federal Reserve System. (2015a). Federal reserve supervisory assessment of capital planning and positions for large and noncomplex firms (SR 15-19). Federal Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1519.htm
Board of Governors of the Federal Reserve System. (2015b). Federal reserve supervisory assessment of capital planning and positions for LISCC firms and large and complex firms (SR 15-18). Federal Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1518.htm
Board of Governors of the Federal Reserve System. (2023). 2023 supervisory stress test results. Federal Reserve. https://www.federalreserve.gov/publications/2023-june-dodd-frank-act-stress-test.htm
Board of Governors of the Federal Reserve System and Federal Deposit Insurance Corporation and Office of the Comptroller of the Currency. (2023). Interagency guidance on third-party relationships: Risk management. 88 Federal Register 37920. https://www.federalregister.gov/documents/2023/06/09/2023-12340/
Board of Governors of the Federal Reserve System and Office of the Comptroller of the Currency. (2011a). SR 11-7: Guidance on model risk management. Federal Reserve System. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Board of Governors of the Federal Reserve System and Office of the Comptroller of the Currency. (2011b). Supervisory guidance on model risk management (SR 11-7 / OCC 2011-12). Federal Reserve Supervision and Regulation Letter SR 11-7.
Board of Governors of the Federal Reserve System and Office of the Comptroller of the Currency. (2011c). Supervisory guidance on model risk management (SR 11-7).
Board of Governors of the Federal Reserve System, & Office of the Comptroller of the Currency. (2011). Supervisory guidance on model risk management (SR 11-7 / OCC 2011-12) (SR 11-7). Board of Governors of the Federal Reserve System. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Bodnaruk, A., Loughran, T., & McDonald, B. (2015). Using 10-K text to gauge financial constraints. Journal of Financial and Quantitative Analysis, 50(4), 623–646. https://doi.org/10.1017/S0022109015000411
Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics, 5, 135–146. https://doi.org/10.1162/tacl_a_00051
Bolton, P., & Kacperczyk, M. (2021). Do investors care about carbon risk? Journal of Financial Economics, 142(2), 517–549. https://doi.org/10.1016/j.jfineco.2021.05.008
Bolton, R. J., & Hand, D. J. (2002). Statistical fraud detection: A review. Statistical Science, 17(3), 235–249. https://doi.org/10.1214/ss/1042727940
Bonacich, P. (1972). Factoring and weighting approaches to status scores and clique identification. Journal of Mathematical Sociology, 2(1), 113–120. https://doi.org/10.1080/0022250X.1972.9989806
Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B., Patel, S., Ramage, D., Segal, A., & Seth, K. (2017). Practical secure aggregation for privacy-preserving machine learning. Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security (CCS), 1175–1191. https://doi.org/10.1145/3133956.3133982
Bonsall, S. B., Koharki, K., & Neamtiu, M. (2017). The disciplining effect of credit default swap trading on the quality of credit rating agencies. Journal of Accounting and Economics, 63(2–3), 182–208. https://doi.org/10.1016/j.jacceco.2016.12.002
Bonvini, M., & Kennedy, E. H. (2022). Sensitivity analysis via the proportion of unmeasured confounding. Journal of the American Statistical Association, 117(539), 1540–1550. https://doi.org/10.1080/01621459.2020.1864382
Boot, A. W. A. (2000). Relationship banking: What do we know? Journal of Financial Intermediation, 9(1), 7–25. https://doi.org/10.1006/jfin.2000.0282
Boot, A., Hoffmann, P., Laeven, L., & Ratnovski, L. (2021). Fintech: What’s old, what’s new? Journal of Financial Stability, 53, 100836. https://doi.org/10.1016/j.jfs.2020.100836
Borisov, V., Leemann, T., Seßler, K., Haug, J., Pawelczyk, M., & Kasneci, G. (2024). Deep neural networks and tabular data: A survey. IEEE Transactions on Neural Networks and Learning Systems, 35(6), 7499–7519. https://doi.org/10.1109/TNNLS.2022.3229161
Borri, N., & Verdelhan, A. (2023). Sovereign risk premia and global macroeconomic conditions. Journal of Financial Economics, 147(1), 172–197. https://doi.org/10.1016/j.jfineco.2022.10.001
Borusyak, K., Jaravel, X., & Spiess, J. (2024). Revisiting event-study designs: Robust and efficient estimation. Review of Economic Studies, 91(6), 3253–3285. https://doi.org/10.1093/restud/rdae007
Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training algorithm for optimal margin classifiers. 144–152. https://doi.org/10.1145/130385.130401
Bowen, D., & Ungar, L. (2020). Generalized SHAP: Generating multiple types of explanations in machine learning. arXiv Preprint arXiv:2006.07155.
Boyd, S., & Vandenberghe, L. (2004). Convex optimization. Cambridge University Press. https://doi.org/10.1017/CBO9780511804441
Bracke, P., Datta, A., Jung, C., & Sen, S. (2019). Machine learning explainability in finance: An application to default risk analysis. Bank of England Staff Working Paper, (816).
Braun, M., & Schweidel, D. A. (2011). Modeling customer lifetimes with multiple causes of churn. Marketing Science, 30(5), 881–902. https://doi.org/10.1287/mksc.1110.0665
Breck, E., Cai, S., Nielsen, E., Salib, M., & Sculley, D. (2017). The ML test score: A rubric for ML production readiness and technical debt reduction. IEEE International Conference on Big Data, 1123–1132. https://doi.org/10.1109/BigData.2017.8258038
Breeden, J. L. (2007a). Modeling data with multiple time dimensions. Computational Statistics & Data Analysis, 51(9), 4761–4785. https://doi.org/10.1016/j.csda.2007.01.023
Breeden, J. L. (2007b). Modeling data with multiple time dimensions. Computational Statistics and Data Analysis, 51(9), 4761–4785. https://doi.org/10.1016/j.csda.2006.07.026
Breeden, J. L. (2020). A survey of machine learning in credit risk. Journal of Credit Risk, 16(1), 1–62.
Breiman, L. (1996a). Bagging predictors. Machine Learning, 24(2), 123–140. https://doi.org/10.1007/BF00058655
Breiman, L. (1996b). Bagging predictors. Machine Learning, 24(2), 123–140. https://doi.org/10.1007/BF00058655
Breiman, L. (1996c). Heuristics of instability and stabilization in model selection. The Annals of Statistics, 24(6), 2350–2383. https://doi.org/10.1214/aos/1032181158
Breiman, L. (1996d). Stacked regressions. Machine Learning, 24(1), 49–64. https://doi.org/10.1007/BF00117832
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Wadsworth.
Breslow, N. E. (1974). Covariance analysis of censored survival data. Biometrics, 30(1), 89–99. https://doi.org/10.2307/2529620
Brevoort, K. P., Grimm, P., & Kambara, M. (2016). Credit invisibles and the unscored. Cityscape, 18(2), 9–34.
Bricken, T., Templeton, A., Batson, J., Chen, B., Jermyn, A., Conerly, T., Turner, N., Anil, C., Denison, C., Askell, A., et al. (2023). Towards monosemanticity: Decomposing language models with dictionary learning. Transformer Circuits Thread. https://transformer-circuits.pub/2023/monosemantic-features/index.html
Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1), 1–3.
Broder, A. Z. (1997). On the resemblance and containment of documents. 21–29. https://doi.org/10.1109/SEQUEN.1997.666900
Brodersen, K. H., Gallusser, F., Koehler, J., Remy, N., & Scott, S. L. (2015). Inferring causal impact using Bayesian structural time-series models. Annals of Applied Statistics, 9(1), 247–274. https://doi.org/10.1214/14-AOAS788
Bronstein, M. M., Bruna, J., LeCun, Y., Szlam, A., & Vandergheynst, P. (2017). Geometric deep learning: Going beyond Euclidean data. IEEE Signal Processing Magazine, 34(4), 18–42. https://doi.org/10.1109/MSP.2017.2693418
Brown, I., & Mues, C. (2012). An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Systems with Applications, 39(3), 3446–3453. https://doi.org/10.1016/j.eswa.2011.09.033
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A., Ziegler, D. M., Wu, J., Winter, C., … Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems 33 (NeurIPS), 1877–1901.
Buchak, G., Matvos, G., Piskorski, T., & Seru, A. (2018). Fintech, regulatory arbitrage, and the rise of shadow banks. Journal of Financial Economics, 130(3), 453–483. https://doi.org/10.1016/j.jfineco.2018.03.011
Bücker, M., Kampen, M. van, & Krämer, W. (2013). Reject inference in consumer credit scoring with nonignorable missing data. Journal of Banking & Finance, 37(3), 1040–1045. https://doi.org/10.1016/j.jbankfin.2012.11.002
Bühlmann, P., & Hothorn, T. (2007). Boosting algorithms: Regularization, prediction and model fitting. Statistical Science, 22(4), 477–505. https://doi.org/10.1214/07-STS242
Buja, A., & Stuetzle, W. (2006). Observations on bagging. Statistica Sinica, 16(2), 323–351.
Bumacov, V., Ashta, A., & Singh, P. (2014). The use of credit scoring in microfinance institutions and their outreach. Strategic Change, 23(7-8), 401–413. https://doi.org/10.1002/jsc.1985
Bursztyn, L., Fiorin, S., Gottlieb, D., & Kanz, M. (2019). Moral incentives in credit card debt repayment: Evidence from a field experiment. Journal of Political Economy, 127(4), 1641–1683. https://doi.org/10.1086/701605
Bussmann, N., Giudici, P., Marinelli, D., & Papenbrock, J. (2021). Explainable AI in fintech risk management. Frontiers in Artificial Intelligence, 3, 26. https://doi.org/10.3389/frai.2020.00026
Butaru, F., Chen, Q., Clark, B., Das, S., Lo, A. W., & Siddique, A. (2016). Risk and risk management in the credit card industry. Journal of Banking and Finance, 72, 218–239. https://doi.org/10.1016/j.jbankfin.2016.07.015
Buuren, S. van, & Groothuis-Oudshoorn, K. (2011). mice: Multivariate imputation by chained equations in R. Journal of Statistical Software, 45(3), 1–67. https://doi.org/10.18637/jss.v045.i03
Cadena, X., & Schoar, A. (2011). Remembering to pay? Reminders vs. Financial incentives for loan payments (NBER Working Paper 17020). National Bureau of Economic Research. https://doi.org/10.3386/w17020
Calabrese, R. (2014). Downturn loss given default: Mixture distribution estimation. European Journal of Operational Research, 237(1), 271–277. https://doi.org/10.1016/j.ejor.2014.01.043
Calabrese, R., Osmetti, S. A., & Zanin, L. (2024). Sample selection bias in non-traditional lending: A copula-based approach for imbalanced data. Socio-Economic Planning Sciences, 95, 102045. https://doi.org/10.1016/j.seps.2024.102045
Calabrese, R., & Zenga, M. (2010). Bank loan recovery rates: Measuring and nonparametric density estimation. Journal of Banking & Finance, 34(5), 903–911. https://doi.org/10.1016/j.jbankfin.2009.10.001
Callaway, B., & Sant’Anna, P. H. C. (2021). Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2), 200–230. https://doi.org/10.1016/j.jeconom.2020.12.001
Calonico, S., Cattaneo, M. D., & Titiunik, R. (2014). Robust nonparametric confidence intervals for regression-discontinuity designs. Econometrica, 82(6), 2295–2326. https://doi.org/10.3982/ECTA11757
Calzolari, G., & Nardotto, M. (2017). Effective reminders. Management Science, 63(9), 2915–2932. https://doi.org/10.1287/mnsc.2016.2499
Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2008). Bootstrap-based improvements for inference with clustered errors. Review of Economics and Statistics, 90(3), 414–427. https://doi.org/10.1162/rest.90.3.414
Cameron, A. C., & Miller, D. L. (2015). A practitioner’s guide to cluster-robust inference. Journal of Human Resources, 50(2), 317–372. https://doi.org/10.3368/jhr.50.2.317
Campbell, J. L., Chen, H., Dhaliwal, D. S., Lu, H., & Steele, L. B. (2014). The information content of mandatory risk factor disclosures in corporate filings. Review of Accounting Studies, 19(1), 396–455. https://doi.org/10.1007/s11142-013-9258-3
Campbell, J. Y., & Cocco, J. F. (2015). A model of mortgage default. The Journal of Finance, 70(4), 1495–1554. https://doi.org/10.1111/jofi.12252
Campbell, J. Y., Hilscher, J., & Szilagyi, J. (2008). In search of distress risk. The Journal of Finance, 63(6), 2899–2939. https://doi.org/10.1111/j.1540-6261.2008.01416.x
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., & Tzoumas, K. (2015). Apache Flink: Stream and batch processing in a single engine. IEEE Data Engineering Bulletin, 38(4), 28–38.
Card, D., & Krueger, A. B. (1994). Minimum wages and employment: A case study of the fast-food industry in new jersey and pennsylvania. American Economic Review, 84(4), 772–793.
Carlehed, M., & Petrov, A. (2012). A methodology for point-in-time-through-the-cycle probability of default decomposition in risk classification systems. Journal of Risk Model Validation, 6(3), 3–25. https://doi.org/10.21314/JRMV.2012.091
Caruana, R., Lou, Y., Gehrke, J., Koch, P., Sturm, M., & Elhadad, N. (2015). Intelligible models for healthcare: Predicting pneumonia risk and hospital 30-day readmission. Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1721–1730. https://doi.org/10.1145/2783258.2788613
Carvalho, V. M., Nirei, M., Saito, Y. U., & Tahbaz-Salehi, A. (2021). Supply chain disruptions: Evidence from the Great East Japan earthquake. Quarterly Journal of Economics, 136(2), 1255–1321. https://doi.org/10.1093/qje/qjaa044
Casella, G., & Berger, R. L. (2002). Statistical inference (2nd ed.). Duxbury.
Castrén, O., Dées, S., & Zaher, F. (2010). Stress-testing euro area corporate default probabilities using a global macroeconomic model. Journal of Financial Stability, 6(2), 64–78. https://doi.org/10.1016/j.jfs.2009.10.002
Cattaneo, M. D., Jansson, M., & Ma, X. (2020). Simple local polynomial density estimators. Journal of the American Statistical Association, 115(531), 1449–1455. https://doi.org/10.1080/01621459.2019.1635480
Cellini, S. R., Ferreira, F., & Rothstein, J. (2010). The value of school facility investments: Evidence from a dynamic regression discontinuity design. Quarterly Journal of Economics, 125(1), 215–261. https://doi.org/10.1162/qjec.2010.125.1.215
Central Bank of Kenya. (2013). Prudential guidelines for institutions licensed under the banking act (CBK/PG/04 risk management). Central Bank of Kenya. https://www.centralbank.go.ke/
Central Bank of Kenya. (2020). Banking (credit reference bureau) regulations. Legal Notice, as amended 2020. https://www.centralbank.go.ke/credit-reference-bureaus/
Central Bank of Kenya. (2022). Digital credit providers regulations, 2022. Central Bank of Kenya. https://www.centralbank.go.ke/digital-credit-providers/
Cerezo, M., Arrasmith, A., Babbush, R., Benjamin, S. C., Endo, S., Fujii, K., McClean, J. R., Mitarai, K., Yuan, X., Cincio, L., & Coles, P. J. (2021). Variational quantum algorithms. Nature Reviews Physics, 3(9), 625–644. https://doi.org/10.1038/s42254-021-00348-9
Cessie, S. le, & Houwelingen, J. C. van. (1992). Ridge estimators in logistic regression. Journal of the Royal Statistical Society. Series C (Applied Statistics), 41(1), 191–201. https://doi.org/10.2307/2347628
CGAP. (2019). Digital credit market monitoring in Tanzania. Consultative Group to Assist the Poor. https://www.cgap.org/research/publication/digital-credit-market-monitoring-tanzania
Chaisemartin, C. de, & D’Haultfœuille, X. (2020). Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9), 2964–2996. https://doi.org/10.1257/aer.20181169
Challu, C., Olivares, K. G., Oreshkin, B. N., Garza Ramirez, F., Mergenthaler-Canseco, M., & Dubrawski, A. (2023). NHITS: Neural hierarchical interpolation for time series forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 6989–6997. https://doi.org/10.1609/aaai.v37i6.25854
Chan, K. C. G., & Yam, S. C. P. (2014). Oracle, multiple robust and multipurpose calibration in a missing response problem. Statistical Science, 29(3), 380–396. https://doi.org/10.1214/14-STS486
Chandrashekaran, M., & Sinha, R. K. (1995). Isolating the determinants of innovativeness: A split-population tobit (SPOT) duration model. Journal of Marketing Research, 32(4), 444–456. https://doi.org/10.1177/002224379503200407
Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2(3), 1–27. https://doi.org/10.1145/1961189.1961199
Chapelle, O., Schölkopf, B., & Zien, A. (2006). Semi-supervised learning. MIT Press.
Chava, S., & Jarrow, R. A. (2004). Bankruptcy prediction with industry effects. Review of Finance, 8(4), 537–569. https://doi.org/10.1093/rof/8.4.537
Chava, S., Paradkar, N., & Zhang, Y. (2021). Winners and losers of marketplace lending: Evidence from borrower credit dynamics. Journal of Financial Economics, 142(3), 1186–1208. https://doi.org/10.1016/j.jfineco.2021.05.027
Chava, S., Stefanescu, C., & Turnbull, S. (2011). Modeling the loss distribution. Management Science, 57(7), 1267–1287. https://doi.org/10.1287/mnsc.1110.1345
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. https://doi.org/10.1613/jair.953
Chefer, H., Gur, S., & Wolf, L. (2021). Transformer interpretability beyond attention visualization. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 782–791. https://doi.org/10.1109/CVPR46437.2021.00084
Chen, C., Li, O., Tao, D., Barnett, A., Rudin, C., & Su, J. K. (2019). This looks like that: Deep learning for interpretable image recognition. Advances in Neural Information Processing Systems 32 (NeurIPS 2019).
Chen, H. (2010). Macroeconomic conditions and the puzzles of credit spreads and capital structure. The Journal of Finance, 65(6), 2171–2212. https://doi.org/10.1111/j.1540-6261.2010.01613.x
Chen, H., Janizek, J. D., Lundberg, S., & Lee, S.-I. (2020). True to the model or true to the data? ICML Workshop on Human Interpretability in Machine Learning.
Chen, L., Jia, N., Jiao, Z., Zhao, H., Cui, R., & Wang, H. (2025). A semi-supervised reject inference framework with hierarchical heterogeneous networks for credit scoring. International Journal of Forecasting, 41(3), 920–939. https://doi.org/10.1016/j.ijforecast.2024.07.011
Chen, M. A., Wu, Q., & Yang, B. (2019). How valuable is FinTech innovation? The Review of Financial Studies, 32(5), 2062–2106. https://doi.org/10.1093/rfs/hhy130
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785
Cheng, D., Tu, Y., Ma, Z., Niu, Z., & Zhang, L. (2019). Risk assessment for networked-guarantee loans using high-order graph attention representation. Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI), 5822–5828. https://doi.org/10.24963/ijcai.2019/807
Cheng, K., Fan, T., Jin, Y., Liu, Y., Chen, T., Papadopoulos, D., & Yang, Q. (2021). SecureBoost: A lossless federated learning framework. IEEE Intelligent Systems, 36, 87–98. https://doi.org/10.1109/MIS.2021.3082561
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., & Robins, J. (2018). Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1), C1–C68. https://doi.org/10.1111/ectj.12097
Chernozhukov, V., Escanciano, J. C., Ichimura, H., Newey, W. K., & Robins, J. M. (2022). Locally robust semiparametric estimation. Econometrica, 90(4), 1501–1535. https://doi.org/10.3982/ECTA16294
Chernozhukov, V., Fernández-Val, I., & Galichon, A. (2010). Quantile and probability curves without crossing. Econometrica, 78(3), 1093–1125. https://doi.org/10.3982/ECTA7880
Chiang, W.-L., Liu, X., Si, S., Li, Y., Bengio, S., & Hsieh, C.-J. (2019). Cluster-GCN: An efficient algorithm for training deep and large graph convolutional networks. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 257–266. https://doi.org/10.1145/3292500.3330925
Chiburis, R. C., Das, J., & Lokshin, M. (2012). A practical comparison of the bivariate probit and linear IV estimators. Economics Letters, 117(3), 762–766. https://doi.org/10.1016/j.econlet.2012.08.037
Chouldechova, A. (2017). Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big Data, 5(2), 153–163. https://doi.org/10.1089/big.2016.0047
Chow, G. C. (1960). Tests of equality between sets of coefficients in two linear regressions. Econometrica, 28(3), 591–605. https://doi.org/10.2307/1910133
Christen, P. (2012). A survey of indexing techniques for scalable record linkage and deduplication. IEEE Transactions on Knowledge and Data Engineering, 24(9), 1537–1555. https://doi.org/10.1109/TKDE.2011.127
Chung, F. R. K. (1997). Spectral graph theory. CBMS Regional Conference Series in Mathematics, 92.
Ciampi, F. (2015). Corporate governance characteristics and default prediction modeling for small enterprises: An empirical analysis of Italian firms. Journal of Business Research, 68(5), 1012–1025. https://doi.org/10.1016/j.jbusres.2014.10.003
Clark, K., Khandelwal, U., Levy, O., & Manning, C. D. (2019). What does BERT look at? An analysis of BERT’s attention. Proceedings of the 2019 ACL Workshop BlackboxNLP, 276–286. https://doi.org/10.18653/v1/W19-4828
Clayton, D., & Cuzick, J. (1985). Multivariate generalizations of the proportional hazards model. Journal of the Royal Statistical Society. Series A (General), 148(2), 82–117. https://doi.org/10.2307/2981943
Cohen, L., & Frazzini, A. (2008). Economic links and predictable returns. The Journal of Finance, 63(4), 1977–2011. https://doi.org/10.1111/j.1540-6261.2008.01379.x
Cohen, L., Malloy, C., & Nguyen, Q. (2020). Lazy prices. The Journal of Finance, 75(3), 1371–1415. https://doi.org/10.1111/jofi.12885
Collin-Dufresne, P., Goldstein, R. S., & Martin, J. S. (2001). The determinants of credit spread changes. The Journal of Finance, 56(6), 2177–2207. https://doi.org/10.1111/0022-1082.00402
Comisión Nacional Bancaria y de Valores. (2024). Disposiciones de carácter general aplicables a las instituciones de crédito (Circular Única de Bancos). As amended through 2024. https://www.cnbv.gob.mx/
Conley, T. G., Hansen, C. B., & Rossi, P. E. (2012). Plausibly exogenous. Review of Economics and Statistics, 94(1), 260–272. https://doi.org/10.1162/REST_a_00139
Conselho Monetário Nacional. (2017). Resolução no. 4.557: Integrated risk management and capital management structure. Conselho Monetário Nacional, Banco Central do Brasil. https://www.bcb.gov.br/
Consumer Financial Protection Bureau. (2011). Regulation b: Equal credit opportunity act. 12 C.F.R. Part 1002.
Consumer Financial Protection Bureau. (2013a). Equal credit opportunity act (ECOA) examination procedures. CFPB Supervision and Examination Manual. https://www.consumerfinance.gov/compliance/supervision-examinations/
Consumer Financial Protection Bureau. (2013b). Regulation B, 12 CFR § 1002.9: notifications. https://www.consumerfinance.gov/rules-policy/regulations/1002/9/
Consumer Financial Protection Bureau. (2013c). Regulation B: Equal credit opportunity (12 CFR part 1002). https://www.consumerfinance.gov/rules-policy/regulations/1002/
Consumer Financial Protection Bureau. (2014). Using publicly available information to proxy for unidentified race and ethnicity: A methodology and assessment. CFPB Research Report. https://www.consumerfinance.gov/data-research/research-reports/
Consumer Financial Protection Bureau. (2017). List of consumer reporting companies. CFPB. https://www.consumerfinance.gov/consumer-tools/credit-reports-and-scores/consumer-reporting-companies/
Consumer Financial Protection Bureau. (2022b). Circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. CFPB. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022a). Circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. U.S. Consumer Financial Protection Bureau. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022f). Consumer financial protection circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022d). Consumer financial protection circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms [Circular]. CFPB.
Consumer Financial Protection Bureau. (2022e). Consumer financial protection circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. Consumer Financial Protection Bureau.
Consumer Financial Protection Bureau. (2022c). Consumer financial protection circular 2022-03: Adverse action notification requirements in connection with credit decisions based on complex algorithms. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03/
Consumer Financial Protection Bureau. (2023a). Chatbots in consumer finance. CFPB. https://www.consumerfinance.gov/data-research/research-reports/chatbots-in-consumer-finance/
Consumer Financial Protection Bureau. (2023b). Consumer financial protection circular 2023-03: Adverse action notification requirements and the proper use of the CFPB’s sample forms provided in regulation B. https://www.consumerfinance.gov/compliance/circulars/circular-2023-03-adverse-action-notification-requirements-and-the-proper-use-of-the-cfpbs-sample-forms-provided-in-regulation-b/
Consumer Financial Protection Bureau. (2024a). Home mortgage disclosure act (HMDA) public loan/application register. FFIEC and CFPB Public Data Platform.
Consumer Financial Protection Bureau. (2024b). Required rulemaking on personal financial data rights (section 1033) [Final Rule, 12 CFR Part 1033]. https://www.consumerfinance.gov/rules-policy/final-rules/personal-financial-data-rights/
Cont, R., Moussa, A., & Santos, E. B. (2013). Network structure and systemic risk in banking systems. Handbook on Systemic Risk, 327–368. https://doi.org/10.1017/CBO9781139151184.018
Copas, J. B., & Li, H. G. (1997). Inference for non-random samples. Journal of the Royal Statistical Society. Series B (Methodological), 59(1), 55–95. https://doi.org/10.1111/1467-9868.00055
Corbett-Davies, S., Gaebler, J. D., Nilforoshan, H., Shroff, R., & Goel, S. (2023). The measure and mismeasure of fairness. Journal of Machine Learning Research, 24(312), 1–117.
Corbett-Davies, S., Pierson, E., Feller, A., Goel, S., & Huq, A. (2017). Algorithmic decision making and the cost of fairness. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 797–806. https://doi.org/10.1145/3097983.3098095
Corcoran, A. W. (1978). The use of exponentially-smoothed transition matrices to improve forecasting of cash flows from accounts receivable. Management Science, 24(7), 732–739. https://doi.org/10.1287/mnsc.24.7.732
Cornaggia, J., & Cornaggia, K. J. (2013). Estimating the costs of issuer-paid credit ratings. Review of Financial Studies, 26(9), 2229–2269. https://doi.org/10.1093/rfs/hht041
Cornelli, G., Frost, J., Gambacorta, L., Rau, P. R., Wardrop, R., & Ziegler, T. (2023a). Fintech and big tech credit: Drivers of the growth of digital lending. Journal of Banking and Finance, 148, 106742. https://doi.org/10.1016/j.jbankfin.2022.106742
Cornelli, G., Frost, J., Gambacorta, L., Rau, P. R., Wardrop, R., & Ziegler, T. (2023b). Fintech and big tech credit: Drivers of the growth of digital lending (BIS Working Paper 1028). Bank for International Settlements. https://www.bis.org/publ/work1028.htm
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297. https://doi.org/10.1007/BF00994018
Costello, A. M., Down, A. K., & Mehta, M. N. (2020). Machine + man: A field experiment on the role of discretion in augmenting AI-based lending models. Journal of Accounting and Economics, 70(2–3), 101360. https://doi.org/10.1016/j.jacceco.2020.101360
Cover, T. M., & Thomas, J. A. (2006a). Elements of information theory.
Cover, T. M., & Thomas, J. A. (2006b). Elements of information theory. Wiley Series in Telecommunications and Signal Processing, 2nd Ed. https://doi.org/10.1002/047174882X
Covert, I., Lundberg, S. M., & Lee, S.-I. (2020). Understanding global feature contributions with additive importance measures. Advances in Neural Information Processing Systems 33 (NeurIPS 2020).
Covert, I., Lundberg, S. M., & Lee, S.-I. (2021). Explaining by removing: A unified framework for model explanation. Journal of Machine Learning Research, 22(209), 1–90.
Cox, D. R. (1958). The regression analysis of binary sequences. Journal of the Royal Statistical Society. Series B (Methodological), 20(2), 215–242.
Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society. Series B (Methodological), 34(2), 187–220.
Cox, D. R. (1975). Partial likelihood. Biometrika, 62(2), 269–276. https://doi.org/10.1093/biomet/62.2.269
Crankshaw, D., Wang, X., Zhou, G., Franklin, M. J., Gonzalez, J. E., & Stoica, I. (2017). Clipper: A low-latency online prediction serving system. USENIX Symposium on Networked Systems Design and Implementation (NSDI), 613–627.
Crawford, G. S., Pavanini, N., & Schivardi, F. (2018). Asymmetric information and imperfect competition in lending markets. American Economic Review, 108(7), 1659–1701. https://doi.org/10.1257/aer.20150487
Credit Fusion, & Will Cukierski. (2011). Give me some credit. Kaggle Competition.
Credit Information Center of Vietnam. (2023). Annual report on credit information activities. CIC, State Bank of Vietnam. https://cic.gov.vn/
Crook, J. N., & Banasik, J. (2004). Does reject inference really improve the performance of application scoring models? Journal of Banking & Finance, 28(4), 857–874. https://doi.org/10.1016/j.jbankfin.2003.10.010
Crook, J. N., & Bellotti, T. (2010). Time varying and dynamic models for default risk in consumer loans. Journal of the Royal Statistical Society: Series A, 173(2), 283–305. https://doi.org/10.1111/j.1467-985X.2009.00617.x
Crook, J. N., Edelman, D. B., & Thomas, L. C. (2007). Recent developments in consumer credit risk assessment. European Journal of Operational Research, 183(3), 1447–1465. https://doi.org/10.1016/j.ejor.2006.09.100
Crouhy, M., Galai, D., & Mark, R. (2001). Prototype risk rating system. Journal of Banking & Finance, 25(1), 47–95. https://doi.org/10.1016/S0378-4266(00)00117-5
Cybenko, G. (1989). Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2(4), 303–314. https://doi.org/10.1007/BF02551274
Cyert, R. M., Davidson, H. J., & Thompson, G. L. (1962). Estimation of the allowance for doubtful accounts by Markov chains. Management Science, 8(3), 287–303. https://doi.org/10.1287/mnsc.8.3.287
D’Haultfoeuille, X. (2010). A new instrumental method for dealing with endogenous selection. Journal of Econometrics, 154(1), 1–15. https://doi.org/10.1016/j.jeconom.2009.06.003
Dal Pozzolo, A., Caelen, O., Johnson, R. A., & Bontempi, G. (2015). Calibrating probability with undersampling for unbalanced classification. 159–166. https://doi.org/10.1109/SSCI.2015.33
Daniel, K., Titman, S., & Wei, K. J. (2001). Explaining the cross-section of stock returns in japan: Factors or characteristics? The Journal of Finance, 56(2), 743–766.
Daniels, M. J., & Hogan, J. W. (2008). Missing data in longitudinal studies: Strategies for bayesian modeling and sensitivity analysis. Chapman; Hall/CRC. https://doi.org/10.1201/9781420011180
Das, S. R., & Chen, M. Y. (2007). Yahoo! For Amazon: Sentiment extraction from small talk on the web. Management Science, 53(9), 1375–1388. https://doi.org/10.1287/mnsc.1070.0704
Das, S. R., Duffie, D., Kapadia, N., & Saita, L. (2007). Common failings: How corporate defaults are correlated. Journal of Finance, 62(1), 93–117. https://doi.org/10.1111/j.1540-6261.2007.01202.x
Dastile, X., Celik, T., & Potsane, M. (2020). Statistical and machine learning models in credit scoring: A systematic literature survey. Applied Soft Computing, 91, 106263. https://doi.org/10.1016/j.asoc.2020.106263
Davis, J., & Goadrich, M. (2006). The relationship between precision-recall and ROC curves. 233–240. https://doi.org/10.1145/1143844.1143874
Dawid, A. P. (1982). The well-calibrated bayesian. Journal of the American Statistical Association, 77(379), 605–610. https://doi.org/10.2307/2287720
Defferrard, M., Bresson, X., & Vandergheynst, P. (2016). Convolutional neural networks on graphs with fast localized spectral filtering. Advances in Neural Information Processing Systems 29 (NIPS 2016).
DeFusco, A. A., & Paciorek, A. (2017). The interest rate elasticity of mortgage demand: Evidence from bunching at the conforming loan limit. American Economic Journal: Economic Policy, 9(1), 210–240. https://doi.org/10.1257/pol.20140108
DeGroot, M. H., & Fienberg, S. E. (1983). The comparison and evaluation of forecasters. The Statistician, 32(1/2), 12–22. https://doi.org/10.2307/2987588
DellaVigna, S., & Linos, E. (2022). RCTs to scale: Comprehensive evidence from two nudge units. Econometrica, 90(1), 81–116. https://doi.org/10.3982/ECTA18709
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988a). Comparing the areas under two or more correlated receiver operating characteristic curves. Biometrics, 44(3), 837–845. https://doi.org/10.2307/2531595
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988b). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics, 44(3), 837–845. https://doi.org/10.2307/2531595
Demarta, S., & McNeil, A. J. (2005). The t copula and related copulas. International Statistical Review, 73(1), 111–129. https://doi.org/10.1111/j.1751-5823.2005.tb00254.x
Demirgüç-Kunt, A., Klapper, L., Singer, D., & Ansar, S. (2022). The global findex database 2021: Financial inclusion, digital payments, and resilience in the age of COVID-19. https://www.worldbank.org/en/publication/globalfindex
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 1–38.
Demšar, J. (2006). Statistical comparisons of classifiers over multiple data sets. Journal of Machine Learning Research, 7, 1–30.
Demyanyk, Y., & Van Hemert, O. (2011). Understanding the subprime mortgage crisis. The Review of Financial Studies, 24(6), 1848–1880. https://doi.org/10.1093/rfs/hhp033
Deng, Y., Quigley, J. M., & Van Order, R. (2000). Mortgage terminations, heterogeneity and the exercise of mortgage options. Econometrica, 68(2), 275–307. https://doi.org/10.1111/1468-0262.00110
Dettmers, T., Pagnoni, A., Holtzman, A., & Zettlemoyer, L. (2023). QLoRA: Efficient finetuning of quantized LLMs. Advances in Neural Information Processing Systems 36 (NeurIPS), 10088–10115.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 4171–4186. https://doi.org/10.18653/v1/N19-1423
DeYoung, R., Glennon, D., & Nigro, P. (2008). Borrower-lender distance, credit scoring, and loan performance: Evidence from informational-opaque small business borrowers. Journal of Financial Intermediation, 17(1), 113–143. https://doi.org/10.1016/j.jfi.2007.07.002
Dhurandhar, A., Chen, P.-Y., Luss, R., Tu, C.-C., Ting, P., Shanmugam, K., & Das, P. (2018). Explanations based on the missing: Towards contrastive explanations with pertinent negatives. Advances in Neural Information Processing Systems 31 (NeurIPS 2018).
Diamond, D. W. (1984). Financial intermediation and delegated monitoring. The Review of Economic Studies, 51(3), 393–414. https://doi.org/10.2307/2297430
Diamond, D. W. (1991). Monitoring and reputation: The choice between bank loans and directly placed debt. Journal of Political Economy, 99(4), 689–721. https://doi.org/10.1086/261775
Dietterich, T. G. (1998). Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation, 10(7), 1895–1923. https://doi.org/10.1162/089976698300017197
Dirick, L., Claeskens, G., & Baesens, B. (2017). Time to default in credit scoring using survival analysis: A benchmark study. Journal of the Operational Research Society, 68(6), 652–665. https://doi.org/10.1057/s41274-016-0128-9
Djeundje, V. B., & Crook, J. (2018). Dynamic survival models with varying coefficients for credit risks. International Journal of Forecasting, 34(4), 636–649. https://doi.org/10.1016/j.ijforecast.2018.04.006
Dobbie, W., Liberman, A., Paravisini, D., & Pathania, V. (2021). Measuring bias in consumer lending. Review of Economic Studies, 88(6), 2799–2832. https://doi.org/10.1093/restud/rdaa078
Dobbie, W., & Song, J. (2015). Debt relief and debtor outcomes: Measuring the effects of consumer bankruptcy protection. American Economic Review, 105(3), 1272–1311. https://doi.org/10.1257/aer.20130612
Doerr, S., Frost, J., Gambacorta, L., & Qiu, H. (2022). Fintech and the digital transformation of financial services. BIS Working Papers, (1008). https://www.bis.org/publ/work1008.htm
Dorfleitner, G., Priberny, C., Schuster, S., Stoiber, J., Weber, M., Castro, I. de, & Kammler, J. (2016). Description-text related soft information in peer-to-peer lending: Evidence from two leading European platforms. Journal of Banking and Finance, 64, 169–187. https://doi.org/10.1016/j.jbankfin.2015.11.009
Doshi-Velez, F., & Kim, B. (2017). Towards a rigorous science of interpretable machine learning. arXiv Preprint arXiv:1702.08608.
Drineas, P., & Mahoney, M. W. (2005). On the Nyström method for approximating a Gram matrix for improved kernel-based learning. Journal of Machine Learning Research, 6, 2153–2175.
Drummond, C., & Holte, R. C. (2003). C4.5, class imbalance, and cost sensitivity: Why under-sampling beats over-sampling.
Drummond, C., & Holte, R. C. (2006). Cost curves: An improved method for visualizing classifier performance. Machine Learning, 65(1), 95–130. https://doi.org/10.1007/s10994-006-8199-5
Druz, M., Petzev, I., Wagner, A. F., & Zeckhauser, R. J. (2020). When managers change their tone, analysts and investors change their tune. Financial Analysts Journal, 76(2), 47–69. https://doi.org/10.1080/0015198X.2019.1707592
Duan, J.-C. (1994). Maximum likelihood estimation using price data of the derivative contract. Mathematical Finance, 4(2), 155–167. https://doi.org/10.1111/j.1467-9965.1994.tb00055.x
Duan, J.-C., Gauthier, G., & Simonato, J.-G. (2004). On the equivalence of the KMV and maximum likelihood methods for structural credit risk models. Finance Research Letters, 1(3), 167–181. https://doi.org/10.1016/j.frl.2004.04.003
Duan, J.-C., Sun, J., & Wang, T. (2012). Multiperiod corporate default prediction: A forward intensity approach. Journal of Econometrics, 170(1), 191–209. https://doi.org/10.1016/j.jeconom.2012.05.002
Duarte, J., Siegel, S., & Young, L. (2012). Trust and credit: The role of appearance in peer-to-peer lending. The Review of Financial Studies, 25(8), 2455–2484. https://doi.org/10.1093/rfs/hhs071
Duffie, D., Eckner, A., Horel, G., & Saita, L. (2009b). Frailty correlated default. The Journal of Finance, 64(5), 2089–2123. https://doi.org/10.1111/j.1540-6261.2009.01495.x
Duffie, D., Eckner, A., Horel, G., & Saita, L. (2009a). Frailty correlated default. The Journal of Finance, 64(5), 2089–2123. https://doi.org/10.1111/j.1540-6261.2009.01495.x
Duffie, D., & Lando, D. (2001). Term structures of credit spreads with incomplete accounting information. Econometrica, 69(3), 633–664. https://doi.org/10.1111/1468-0262.00208
Duffie, D., Saita, L., & Wang, K. (2007). Multi-period corporate default prediction with stochastic covariates. Journal of Financial Economics, 83(3), 635–665. https://doi.org/10.1016/j.jfineco.2005.10.011
Duffie, D., & Singleton, K. J. (1999a). Modeling term structures of defaultable bonds. The Review of Financial Studies, 12(4), 687–720. https://doi.org/10.1093/rfs/12.4.687
Duffie, D., & Singleton, K. J. (1999b). Modeling term structures of defaultable bonds. The Review of Financial Studies, 12(4), 687–720. https://doi.org/10.1093/rfs/12.4.687
Dumitrescu, E., Hué, S., Hurlin, C., & Tokpavi, S. (2022). Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research, 297(3), 1178–1192. https://doi.org/10.1016/j.ejor.2021.06.053
Durand, D. (1941). Risk elements in consumer instalment financing [NBER Studies in Consumer Instalment Financing]. (8). https://www.nber.org/books-and-chapters/risk-elements-consumer-instalment-financing
Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2012). Fairness through awareness. Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, 214–226. https://doi.org/10.1145/2090236.2090255
Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006). Calibrating noise to sensitivity in private data analysis. Proceedings of the Third Conference on Theory of Cryptography (TCC), 265–284. https://doi.org/10.1007/11681878_14
Dwork, C., & Roth, A. (2014). The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3-4), 211–407. https://doi.org/10.1561/0400000042
Dyer, T., Lang, M., & Stice-Lawrence, L. (2017). The evolution of 10-K textual disclosure: Evidence from Latent Dirichlet Allocation. Journal of Accounting and Economics, 64(2–3), 221–245. https://doi.org/10.1016/j.jacceco.2017.07.002
Eagle, N., Macy, M., & Claxton, R. (2010). Network diversity and economic development. Science, 328(5981), 1029–1031. https://doi.org/10.1126/science.1186605
Edelberg, W. (2006). Risk-based pricing of interest rates for consumer loans. Journal of Monetary Economics, 53(8), 2283–2298. https://doi.org/10.1016/j.jmoneco.2005.10.018
Efron, B. (1975). The efficiency of logistic regression compared to normal discriminant analysis. Journal of the American Statistical Association, 70(352), 892–898. https://doi.org/10.2307/2285453
Efron, B. (1977). The efficiency of cox’s likelihood function for censored data. Journal of the American Statistical Association, 72(359), 557–565. https://doi.org/10.2307/2286217
Efron, B. (1979). Bootstrap methods: Another look at the jackknife. The Annals of Statistics, 7(1), 1–26. https://doi.org/10.1214/aos/1176344552
Efron, B. (1987). Better bootstrap confidence intervals. Journal of the American Statistical Association, 82(397), 171–185. https://doi.org/10.2307/2289144
Efron, B., & Petrosian, V. (1999). Nonparametric methods for doubly truncated data. Journal of the American Statistical Association, 94(447), 824–834. https://doi.org/10.1080/01621459.1999.10474187
Efron, B., & Tibshirani, R. J. (1993). An introduction to the bootstrap. Chapman; Hall/CRC. https://doi.org/10.1201/9780429246593
Efron, B., & Tibshirani, R. J. (1994). An introduction to the bootstrap. Chapman; Hall/CRC. https://doi.org/10.1201/9780429246593
Egger, D. J., Gambella, C., Marecek, J., McFaddin, S., Mevissen, M., Raymond, R., Simonetto, A., Woerner, S., & Yndurain, E. (2020). Quantum computing for finance: State-of-the-art and future prospects. IEEE Transactions on Quantum Engineering, 1, 1–24. https://doi.org/10.1109/TQE.2020.3030314
Einav, L., Jenkins, M., & Levin, J. (2012). Contract pricing in consumer credit markets. Econometrica, 80(4), 1387–1432. https://doi.org/10.3982/ECTA7677
Einav, L., Jenkins, M., & Levin, J. (2013). The impact of credit scoring on consumer lending. The RAND Journal of Economics, 44(2), 249–274. https://doi.org/10.1111/1756-2171.12019
Eisenberg, L., & Noe, T. H. (2001). Systemic risk in financial systems. Management Science, 47(2), 236–249. https://doi.org/10.1287/mnsc.47.2.236.9835
Elhage, N., Nanda, N., Olsson, C., Henighan, T., Joseph, N., Mann, B., Askell, A., Bai, Y., Chen, A., Conerly, T., DasSarma, N., Drain, D., Ganguli, D., Hatfield-Dodds, Z., Hernandez, D., Jones, A., Kernion, J., Lovitt, L., Ndousse, K., … Olah, C. (2021). A mathematical framework for transformer circuits. Transformer Circuits Thread. https://transformer-circuits.pub/2021/framework/index.html
Elkan, C. (2001). The foundations of cost-sensitive learning. 973–978.
Elkan, C. (2008). The foundations of cost-sensitive learning: An overview. Invited Survey, UCSD Technical Report.
Elliott, M. N., Morrison, P. A., Fremont, A., McCaffrey, D. F., Pantoja, P., & Lurie, N. (2009). Using the census bureau’s surname list to improve estimates of race/ethnicity and associated disparities. Health Services and Outcomes Research Methodology, 9(2), 69–83.
Elliott, M., Golub, B., & Jackson, M. O. (2014). Financial networks and contagion. American Economic Review, 104(10), 3115–3153. https://doi.org/10.1257/aer.104.10.3115
Embrechts, P., McNeil, A. J., & Straumann, D. (2002). Correlation and dependence in risk management: Properties and pitfalls. 176–223. https://doi.org/10.1017/CBO9780511615337.008
Eom, Y. H., Helwege, J., & Huang, J.-Z. (2004). Structural models of corporate bond pricing: An empirical analysis. The Review of Financial Studies, 17(2), 499–544. https://doi.org/10.1093/rfs/hhg053
Equal Employment Opportunity Commission and others. (1978). Uniform guidelines on employee selection procedures. 29 C.F.R. Part 1607.
European Banking Authority. (2017a). Guidelines on credit institutions’ credit risk management practices and accounting for expected credit losses (EBA/GL/2017/06). European Banking Authority. https://www.eba.europa.eu/regulation-and-policy/accounting-and-auditing/guidelines-on-credit-institutions-credit-risk-management-practices-and-accounting-for-expected-credit-losses
European Banking Authority. (2017b). Guidelines on PD estimation, LGD estimation and the treatment of defaulted exposures (EBA/GL/2017/16). European Banking Authority. https://www.eba.europa.eu/sites/default/files/documents/10180/2033363/6b062012-45d6-4655-af04-801d26493ed0/Guidelines\%20on\%20PD\%20and\%20LGD\%20estimation\%20\%28EBA-GL-2017-16\%29.pdf
European Banking Authority. (2017d). Guidelines on PD estimation, LGD estimation and the treatment of defaulted exposures (EBA/GL/2017/16).
European Banking Authority. (2017c). Guidelines on PD estimation, LGD estimation and the treatment of defaulted exposures (EBA/GL/2017/16). European Banking Authority. https://www.eba.europa.eu/regulation-and-policy/credit-risk/guidelines-on-pd-estimation-lgd-estimation-and-treatment-of-defaulted-assets
European Banking Authority. (2019). Guidelines for the estimation of LGD appropriate for an economic downturn (EBA/GL/2019/03). European Banking Authority. https://www.eba.europa.eu/sites/default/files/documents/10180/2551996/Final\%20Report\%20on\%20Guidelines\%20on\%20the\%20estimation\%20of\%20LGD\%20appropriate\%20for\%20an\%20economic\%20downturn.pdf
European Banking Authority. (2021). Report on machine learning for IRB models. European Banking Authority. https://www.eba.europa.eu/sites/default/files/document_library/Publications/Discussions/2021/Discussion\%20on\%20machine\%20learning\%20for\%20IRB\%20models/1023883/Discussion\%20paper\%20on\%20machine\%20learning\%20for\%20IRB\%20models.pdf
European Banking Authority. (2022). Report on the 2022 review of the IRB approach (regulatory products). European Banking Authority.
European Banking Authority. (2023a). 2023 EU-wide stress test results. European Banking Authority. https://www.eba.europa.eu/risk-and-data-analysis/risk-analysis/eu-wide-stress-testing
European Banking Authority. (2023b). Follow-up report on the use of machine learning for IRB models. EBA.
European Central Bank. (2019a). ECB guide to internal models (TRIM). European Central Bank. https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssm.guidetointernalmodels_consolidated_201910.en.pdf
European Central Bank. (2019b). Guide to internal models: Credit risk. European Central Bank. https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssm.guidetointernalmodels_consolidated_201910.en.pdf
European Central Bank. (2024). Supervisory expectations on the use of artificial intelligence and machine learning in internal models. European Central Bank.
European Data Protection Board. (2022). Guidelines 04/2022 on the calculation of administrative fines under the GDPR. https://edpb.europa.eu/
European Parliament and Council. (2016a). Regulation (EU) 2016/679 (GDPR). https://eur-lex.europa.eu/eli/reg/2016/679/oj
European Parliament and Council. (2016b). Regulation (EU) 2016/679 (general data protection regulation). Official Journal of the European Union L 119/1.
European Parliament and Council. (2024a). Regulation (EU) 2024/1689 laying down harmonised rules on artificial intelligence (artificial intelligence act). Official Journal of the European Union. https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024b). Regulation (EU) 2024/1689 laying down harmonised rules on artificial intelligence (EU AI act).
European Parliament and Council. (2024c). Regulation (EU) 2024/1689 laying down harmonised rules on artificial intelligence (EU AI Act). Official Journal of the European Union. https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024d). Regulation (EU) 2024/1689 of 13 June 2024 laying down harmonised rules on artificial intelligence (artificial intelligence act). https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024e). Regulation (EU) 2024/1689 on artificial intelligence (EU AI act). https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council of the European Union. (2015). Directive (EU) 2015/2366 on payment services in the internal market (PSD2). Official Journal of the European Union. https://eur-lex.europa.eu/eli/dir/2015/2366/oj
Fader, P. S., & Hardie, B. G. S. (2007). How to project customer retention. Journal of Interactive Marketing, 21(1), 76–90. https://doi.org/10.1002/dir.20074
Fader, P. S., & Hardie, B. G. S. (2010). Customer-base valuation in a contractual setting: The perils of ignoring heterogeneity. Marketing Science, 29(1), 85–93. https://doi.org/10.1287/mksc.1090.0507
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., & Lin, C.-J. (2008). LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9, 1871–1874.
Farewell, V. T. (1982). The use of mixture models for the analysis of survival data with long-term survivors. Biometrics, 38(4), 1041–1046. https://doi.org/10.2307/2529885
Fayyad, U. M., & Irani, K. B. (1993). Multi-interval discretization of continuous-valued attributes for classification learning. 1022–1027.
Fedaseyeu, V. (2020). Debt collection agencies and the supply of consumer credit. Journal of Financial Economics, 138(1), 193–221. https://doi.org/10.1016/j.jfineco.2020.04.009
Federal Home Loan Mortgage Corporation. (2024). Single-family loan-level dataset. Freddie Mac Public Data Release.
Federal Housing Finance Agency. (2023). Fannie Mae and Freddie Mac public single-family loan-level datasets. Federal Housing Finance Agency. https://www.fhfa.gov/DataTools/Downloads
Federal National Mortgage Association. (2024). Single-family loan performance data. Fannie Mae Data Dynamics.
Federal Republic of Brazil. (2011). Lei no. 12.414: Cadastro positivo. Federal Law, as amended by Complementary Law 166/2019. https://www.planalto.gov.br/ccivil_03/_ato2011-2014/2011/lei/l12414.htm
Federal Republic of Brazil. (2018). Lei geral de protecao de dados pessoais (LGPD), federal law no. 13,709. Presidency of the Republic. https://www.gov.br/cidadania/pt-br/acesso-a-informacao/lgpd
Federal Trade Commission. (2024). Operation AI comply: Actions against deceptive AI claims. FTC.
Feelders, A., & Pardoel, M. (2003). Pruning for monotone classification trees. Lecture Notes in Computer Science, 2810, 1–12. https://doi.org/10.1007/978-3-540-45231-7_1
Feldman, M., Friedler, S. A., Moeller, J., Scheidegger, C., & Venkatasubramanian, S. (2015). Certifying and removing disparate impact. Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 259–268. https://doi.org/10.1145/2783258.2783311
Fellegi, I. P., & Sunter, A. B. (1969). A theory for record linkage. Journal of the American Statistical Association, 64(328), 1183–1210. https://doi.org/10.1080/01621459.1969.10501049
Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research, 15, 3133–3181.
Figlewski, S., Frydman, H., & Liang, W. (2012). Modeling the effect of macroeconomic factors on corporate default and credit rating transitions. International Review of Economics and Finance, 21(1), 87–105. https://doi.org/10.1016/j.iref.2011.05.004
Financial Accounting Standards Board. (2016). Financial instruments - credit losses (topic 326). FASB.
Financial Conduct Authority. (2023). Recommendations for the next phase of open banking in the UK. Financial Conduct Authority. https://www.fca.org.uk/publications/corporate-documents/recommendations-next-phase-open-banking-uk
Fine, J. P., & Gray, R. J. (1999). A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association, 94(446), 496–509. https://doi.org/10.1080/01621459.1999.10474144
Finlay, S. (2011). Multiple classifier architectures and their application to credit risk assessment. European Journal of Operational Research, 210(2), 368–378. https://doi.org/10.1016/j.ejor.2010.09.029
Firth, D. (1993). Bias reduction of maximum likelihood estimates. Biometrika, 80(1), 27–38. https://doi.org/10.1093/biomet/80.1.27
Fisher, A., Rudin, C., & Dominici, F. (2019). All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. Journal of Machine Learning Research, 20(177), 1–81.
Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2), 179–188. https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
Flesch, R. (1948). A new readability yardstick. Journal of Applied Psychology, 32(3), 221–233. https://doi.org/10.1037/h0057532
Fok, D., Paap, R., & Franses, P. H. (2012). Modeling dynamic effects of promotion on interpurchase times. Computational Statistics and Data Analysis, 56(11), 3055–3069. https://doi.org/10.1016/j.csda.2011.02.004
Foote, C. L., Gerardi, K., & Willen, P. S. (2008). Negative equity and foreclosure: Theory and evidence. Journal of Urban Economics, 64(2), 234–245. https://doi.org/10.1016/j.jue.2008.07.006
Fortin, N., Lemieux, T., & Firpo, S. (2011). Decomposition methods in economics. Handbook of Labor Economics, 4A, 1–102. https://doi.org/10.1016/S0169-7218(11)00407-2
Frame, W. S., Srinivasan, A., & Woosley, L. (2001). The effect of credit scoring on small-business lending. Journal of Money, Credit and Banking, 33(3), 813–825. https://doi.org/10.2307/2673896
Franks, J., Serrano-Velarde, N., & Sussman, O. (2021). Marketplace lending, information aggregation, and liquidity. The Review of Financial Studies, 34(5), 2318–2361. https://doi.org/10.1093/rfs/hhaa101
Fredrikson, M., Jha, S., & Ristenpart, T. (2015). Model inversion attacks that exploit confidence information and basic countermeasures. Proceedings of the 22nd ACM SIGSAC Conference on Computer and Communications Security (CCS), 1322–1333. https://doi.org/10.1145/2810103.2813677
Freedman, S., & Jin, G. Z. (2017). The information value of online social networks: Lessons from peer-to-peer lending. International Journal of Industrial Organization, 51, 185–222. https://doi.org/10.1016/j.ijindorg.2016.09.002
Freeman, L. C. (1977). A set of measures of centrality based on betweenness. Sociometry, 40(1), 35–41. https://doi.org/10.2307/3033543
Freixas, X., Parigi, B. M., & Rochet, J.-C. (2000). Systemic risk, interbank relations, and liquidity provision by the central bank. Journal of Money, Credit and Banking, 32(3), 611–638. https://doi.org/10.2307/2601198
Freund, Y., & Schapire, R. E. (1997a). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139. https://doi.org/10.1006/jcss.1997.1504
Freund, Y., & Schapire, R. E. (1997b). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139. https://doi.org/10.1006/jcss.1997.1504
Friedman, J. H. (1989). Regularized discriminant analysis. Journal of the American Statistical Association, 84(405), 165–175. https://doi.org/10.2307/2289860
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. https://doi.org/10.1214/aos/1013203451
Friedman, J. H. (2002). Stochastic gradient boosting. Computational Statistics and Data Analysis, 38(4), 367–378. https://doi.org/10.1016/S0167-9473(01)00065-2
Friedman, J. H., Hastie, T., & Tibshirani, R. (2000). Additive logistic regression: A statistical view of boosting. The Annals of Statistics, 28(2), 337–407. https://doi.org/10.1214/aos/1016218223
Friedman, J. H., & Popescu, B. E. (2008). Predictive learning via rule ensembles. The Annals of Applied Statistics, 2(3), 916–954. https://doi.org/10.1214/07-AOAS148
Friedman, J., Hastie, T., & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33(1), 1–22. https://doi.org/10.18637/jss.v033.i01
Friedman, M. (1937). The use of ranks to avoid the assumption of normality implicit in the analysis of variance. Journal of the American Statistical Association, 32(200), 675–701. https://doi.org/10.2307/2279372
Frost, J. (2020). The economic forces driving FinTech adoption across countries (BIS Working Paper 838). Bank for International Settlements. https://www.bis.org/publ/work838.htm
Frost, J., Gambacorta, L., Huang, Y., Shin, H. S., & Zbinden, P. (2019). BigTech and the changing structure of financial intermediation. Economic Policy, 34(100), 761–799. https://doi.org/10.1093/epolic/eiaa003
Frye, C., Rowat, C., & Feige, I. (2020). Asymmetric Shapley values: Incorporating causal knowledge into model-agnostic explainability.
Frye, J. (2000). Depressing recoveries. Risk Magazine, 13(11), 108–111.
Fuster, A., Goldsmith-Pinkham, P., Ramadorai, T., & Walther, A. (2022b). Predictably unequal? The effects of machine learning on credit markets. Journal of Finance, 77(1), 5–47. https://doi.org/10.1111/jofi.13090
Fuster, A., Goldsmith-Pinkham, P., Ramadorai, T., & Walther, A. (2022a). Predictably unequal? The effects of machine learning on credit markets. Journal of Finance, 77(1), 5–47. https://doi.org/10.1111/jofi.13090
Fuster, A., Hizmo, A., Lambie-Hanson, L., Vickery, J., & Willen, P. S. (2021). How resilient is mortgage credit supply? Evidence from the COVID-19 pandemic. Journal of Financial Economics, 143(2), 735–757. https://doi.org/10.1016/j.jfineco.2021.09.004
Fuster, A., Hizmo, A., Lambie-Hanson, L., Vickery, J., & Willen, P. S. (2024). How resilient is mortgage credit supply? Evidence from the COVID-19 pandemic. Journal of Finance. https://doi.org/10.3386/w28843
Fuster, A., Plosser, M., Schnabl, P., & Vickery, J. (2019a). The role of technology in mortgage lending. The Review of Financial Studies, 32(5), 1854–1899. https://doi.org/10.1093/rfs/hhz018
Fuster, A., Plosser, M., Schnabl, P., & Vickery, J. (2019b). The role of technology in mortgage lending. Review of Financial Studies, 32(5), 1854–1899. https://doi.org/10.1093/rfs/hhz018
Fuster, A., & Willen, P. S. (2017). Payment size, negative equity, and mortgage default. American Economic Journal: Economic Policy, 9(4), 167–191. https://doi.org/10.1257/pol.20150007
Gagliardini, P., & Gourieroux, C. (2013). Granularity adjustment for risk measures: Systematic vs unsystematic risks. International Journal of Approximate Reasoning, 54(6), 717–747. https://doi.org/10.1016/j.ijar.2013.02.001
Gai, P., & Kapadia, S. (2010). Contagion in financial networks. Proceedings of the Royal Society A, 466(2120), 2401–2423. https://doi.org/10.1098/rspa.2009.0410
Gal, Y., & Ghahramani, Z. (2016). Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. Proceedings of the 33rd International Conference on Machine Learning (ICML), 1050–1059.
Gama, J., Medas, P., Castillo, G., & Rodrigues, P. (2004). Learning with drift detection. Advances in Artificial Intelligence (SBIA 2004), Lecture Notes in Computer Science, 3171, 286–295. https://doi.org/10.1007/978-3-540-28645-5_29
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014a). A survey on concept drift adaptation. ACM Computing Surveys, 46(4), 44. https://doi.org/10.1145/2523813
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia, A. (2014b). A survey on concept drift adaptation. ACM Computing Surveys, 46(4), 44:1–44:37. https://doi.org/10.1145/2523813
Gambacorta, L., Huang, Y., Li, Z., Qiu, H., & Chen, S. (2020). Data vs collateral (BIS Working Paper 881). Bank for International Settlements. https://www.bis.org/publ/work881.htm
Gambacorta, L., Huang, Y., Qiu, H., & Wang, J. (2024). How do machine learning and non-traditional data affect credit scoring? New evidence from a chinese fintech firm. Journal of Financial Stability, 73, 101284. https://doi.org/10.1016/j.jfs.2024.101284
Ganin, Y., & Lempitsky, V. (2015). Unsupervised domain adaptation by backpropagation. Proceedings of the 32nd International Conference on Machine Learning (ICML), 1180–1189.
Ganong, P., & Noel, P. (2019). Consumer spending during unemployment: Positive and normative implications. American Economic Review, 109(7), 2383–2424. https://doi.org/10.1257/aer.20170537
Ganong, P., & Noel, P. (2020). Liquidity versus wealth in household debt obligations: Evidence from housing policy in the Great Recession. American Economic Review, 110(10), 3100–3138. https://doi.org/10.1257/aer.20181243
Gao, L., Madaan, A., Zhou, S., Alon, U., Liu, P., Yang, Y., Callan, J., & Neubig, G. (2023). PAL: Program-aided language models. Proceedings of the 40th International Conference on Machine Learning (ICML), 10764–10799.
Gao, Q., Lin, M., & Sias, R. (2023). Words matter: The role of texts in online credit markets. Journal of Financial and Quantitative Analysis, 58(1), 1–28. https://doi.org/10.1017/S0022109021000697
Garcia, D. (2013). Sentiment during recessions. The Journal of Finance, 68(3), 1267–1300. https://doi.org/10.1111/jofi.12027
Garcı́a, S., & Herrera, F. (2008). An extension on “statistical comparisons of classifiers over multiple data sets” for all pairwise comparisons. Journal of Machine Learning Research, 9, 2677–2694.
Garza, A., Challu, C., & Mergenthaler-Canseco, M. (2024). TimeGPT-1. arXiv:2310.03589. https://arxiv.org/abs/2310.03589
Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H., Daumé III, H., & Crawford, K. (2021). Datasheets for datasets. Communications of the ACM, 64, 86–92. https://doi.org/10.1145/3458723
Gelman, A., Jakulin, A., Pittau, M. G., & Su, Y.-S. (2008). A weakly informative default prior distribution for logistic and other regression models. The Annals of Applied Statistics, 2(4), 1360–1383. https://doi.org/10.1214/08-AOAS191
Genest, C., & Favre, A.-C. (2007). Everything you always wanted to know about copula modeling but were afraid to ask. Journal of Hydrologic Engineering, 12(4), 347–368. https://doi.org/10.1061/(ASCE)1084-0699(2007)12:4(347)
Gentzkow, M., Kelly, B., & Taddy, M. (2019). Text as data. Journal of Economic Literature, 57(3), 535–574. https://doi.org/10.1257/jel.20181020
Gerardi, K., Herkenhoff, K. F., Ohanian, L. E., & Willen, P. S. (2018). Can’t pay or won’t pay? Unemployment, negative equity, and strategic default. The Review of Financial Studies, 31(3), 1098–1131. https://doi.org/10.1093/rfs/hhx115
Gerds, T. A., & Schumacher, M. (2006). Consistent estimation of the expected Brier score in general survival models with right-censored event times. Biometrical Journal, 48(6), 1029–1040. https://doi.org/10.1002/bimj.200610301
Geske, R. (1977). The valuation of corporate liabilities as compound options. Journal of Financial and Quantitative Analysis, 12(4), 541–552. https://doi.org/10.2307/2330330
Geskus, R. B. (2011). Cause-specific cumulative incidence estimation and the fine and gray model under both left truncation and right censoring. Biometrics, 67(1), 39–49. https://doi.org/10.1111/j.1541-0420.2010.01420.x
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine Learning, 63(1), 3–42. https://doi.org/10.1007/s10994-006-6226-1
Ghent, A. C., & Kudlyak, M. (2011). Recourse and residential mortgage default: Evidence from US states. The Review of Financial Studies, 24(9), 3139–3186. https://doi.org/10.1093/rfs/hhr055
Ghorbani, A., Abid, A., & Zou, J. (2019). Interpretation of neural networks is fragile. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 3681–3688. https://doi.org/10.1609/aaai.v33i01.33013681
Gibbs, I., & Candès, E. J. (2021). Adaptive conformal inference under distribution shift. Advances in Neural Information Processing Systems 34 (NeurIPS 2021).
Gillis, T. B. (2022). The input fallacy. Minnesota Law Review, 106, 1175–1263.
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O., & Dahl, G. E. (2017). Neural message passing for quantum chemistry. Proceedings of the 34th International Conference on Machine Learning (ICML), 1263–1272.
Glasserman, P., & Young, H. P. (2016). Contagion in financial networks. Journal of Economic Literature, 54(3), 779–831. https://doi.org/10.1257/jel.20151228
Gneiting, T., & Raftery, A. E. (2007). Strictly proper scoring rules, prediction, and estimation. Journal of the American Statistical Association, 102(477), 359–378. https://doi.org/10.1198/016214506000001437
Goldfarb, A., & Tucker, C. (2011). Privacy regulation and online advertising. Management Science, 57(1), 57–71. https://doi.org/10.1287/mnsc.1100.1246
Goldfarb, A., & Tucker, C. (2019). Digital economics. Journal of Economic Literature, 57(1), 3–43. https://doi.org/10.1257/jel.20171452
Goldstein, A., Kapelner, A., Bleich, J., & Pitkin, E. (2015). Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation. Journal of Computational and Graphical Statistics, 24(1), 44–65. https://doi.org/10.1080/10618600.2014.907095
Goldstein, I., Jiang, W., & Karolyi, G. A. (2019). To FinTech and beyond. Review of Financial Studies, 32(5), 1647–1661. https://doi.org/10.1093/rfs/hhz025
Golub, G. H., & Van Loan, C. F. (2013). Matrix computations (4th ed.). Johns Hopkins University Press.
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2020). Generative adversarial nets. Communications of the ACM, 63(11), 139–144. https://doi.org/10.1145/3422622
Goodfellow, I. J., Shlens, J., & Szegedy, C. (2015). Explaining and harnessing adversarial examples. International Conference on Learning Representations (ICLR).
Goodman, B., & Flaxman, S. (2017). European Union regulations on algorithmic decision-making and a “right to explanation.” AI Magazine, 38(3), 50–57. https://doi.org/10.1609/aimag.v38i3.2741
Goodman-Bacon, A. (2021). Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2), 254–277. https://doi.org/10.1016/j.jeconom.2021.03.014
Gordy, M. B. (2003). A risk-factor model foundation for ratings-based bank capital rules. Journal of Financial Intermediation, 12(3), 199–232. https://doi.org/10.1016/S1042-9573(03)00040-8
Gordy, M. B., & Lütkebohmert, E. (2013). Granularity adjustment for regulatory capital assessment. International Journal of Central Banking, 9(3), 38–77.
Gorishniy, Y., Rubachev, I., Khrulkov, V., & Babenko, A. (2021). Revisiting deep learning models for tabular data. Advances in Neural Information Processing Systems 34 (NeurIPS 2021).
Gourieroux, C., Monfort, A., Renault, E., & Trognon, A. (1987). Generalised residuals. Journal of Econometrics, 34(1–2), 5–32. https://doi.org/10.1016/0304-4076(87)90065-0
Government of India. (2005). Credit information companies (regulation) act, 2005 and CIC regulations, 2006. Act No. 30 of 2005. https://www.rbi.org.in/
Government of India. (2023). Digital personal data protection act, 2023. Act No. 22 of 2023. https://www.meity.gov.in/content/digital-personal-data-protection-act-2023
Government of Vietnam. (2021a). Decision no. 942/QD-TTg on e-government development and the research and pilot of virtual currency based on blockchain technology. Prime Minister of Vietnam. https://vanban.chinhphu.vn/
Government of Vietnam. (2021b). Decree 80/2021/ND-CP detailing and guiding implementation of several articles of the law on support for small and medium-sized enterprises. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2022). Decree 53/2022/ND-CP detailing the law on cybersecurity. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2023a). Decree 13/2023/ND-CP on personal data protection. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2023b). Decree no. 13/2023/ND-CP on personal data protection. Government of the Socialist Republic of Vietnam. https://vanbanphapluat.co/decree-13-2023-nd-cp-personal-data-protection
Government of Vietnam. (2023c). Resolution 33/NQ-CP on solutions to remove difficulties for the real estate market and promote its safe, healthy, and sustainable development. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2025a). Decree 94/2025/ND-CP on the controlled testing mechanism for fintech activities in the banking sector. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2025b). Decree no. 94/2025/ND-CP on the controlled testing mechanism (Regulatory Sandbox) in the banking sector. Official Gazette of the Socialist Republic of Vietnam. https://vanban.chinhphu.vn/
Graf, E., Schmoor, C., Sauerbrei, W., & Schumacher, M. (1999). Assessment and comparison of prognostic classification schemes for survival data. Statistics in Medicine, 18(17-18), 2529–2545. https://doi.org/10.1002/(SICI)1097-0258(19990915/30)18:17/18<2529::AID-SIM274>3.0.CO;2-5
Grambsch, P. M., & Therneau, T. M. (1994). Proportional hazards tests and diagnostics based on weighted residuals. Biometrika, 81(3), 515–526.
Gray, R. J. (1988). A class of K-sample tests for comparing the cumulative incidence of a competing risk. Annals of Statistics, 16(3), 1141–1154. https://doi.org/10.1214/aos/1176350951
Green, P. J. (1984). Iteratively reweighted least squares for maximum likelihood estimation, and some robust and resistant alternatives. Journal of the Royal Statistical Society. Series B (Methodological), 46(2), 149–192.
Greene, W. H. (2003). Econometric analysis (5th ed.). Prentice Hall.
Greenwood, R., Hanson, S. G., Shleifer, A., & Sørensen, J. A. (2022). Predictable financial crises. Journal of Finance, 77(2), 863–921. https://doi.org/10.1111/jofi.13105
Greer, C. C. (1967). The optimal credit acceptance policy. Journal of Financial and Quantitative Analysis, 2(4), 399–415. https://doi.org/10.2307/2329825
Grembi, V., Nannicini, T., & Troiano, U. (2016). Do fiscal rules matter? American Economic Journal: Applied Economics, 8(3), 1–30. https://doi.org/10.1257/app.20150076
Griffin, J. M., & Tang, D. Y. (2012). Did subjectivity play a role in CDO credit ratings? Journal of Finance, 67(4), 1293–1328. https://doi.org/10.1111/j.1540-6261.2012.01748.x
Grinsztajn, L., Oyallon, E., & Varoquaux, G. (2022b). Why do tree-based models still outperform deep learning on typical tabular data?
Grinsztajn, L., Oyallon, E., & Varoquaux, G. (2022a). Why do tree-based models still outperform deep learning on typical tabular data? Advances in Neural Information Processing Systems 35 (NeurIPS), 507–520.
Gross, D. B., & Souleles, N. S. (2002). Do liquidity constraints and interest rates matter for consumer behavior? Evidence from credit card data. Quarterly Journal of Economics, 117(1), 149–185. https://doi.org/10.1162/003355302753399472
Grover, A., & Leskovec, J. (2016). node2vec: Scalable feature learning for networks. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 855–864. https://doi.org/10.1145/2939672.2939754
GSMA. (2023). The state of the industry report on mobile money 2023. GSM Association. https://www.gsma.com/sotir/
Gunnarsson, B. R., Broucke, S. vanden, Baesens, B., Óskarsdóttir, M., & Lemahieu, W. (2021). Deep learning for credit scoring: Do or don’t? European Journal of Operational Research, 295(1), 292–305. https://doi.org/10.1016/j.ejor.2021.03.006
Gunning, R. (1952). The technique of clear writing. McGraw-Hill.
Guo, C., Pleiss, G., Sun, Y., & Weinberger, K. Q. (2017). On calibration of modern neural networks. 1321–1330.
Gupton, G. M., Finger, C. C., & Bhatia, M. (1997). CreditMetrics technical document. J.P. Morgan & Co. https://www.msci.com/documents/1296102/1636401/CreditMetricsTechnicalDoc.pdf
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3, 1157–1182.
Hahn, J., Todd, P., & Klaauw, W. van der. (2001). Identification and estimation of treatment effects with a regression-discontinuity design. Econometrica, 69(1), 201–209. https://doi.org/10.1111/1468-0262.00183
Haldane, A. G., & May, R. M. (2011). Systemic risk in banking ecosystems. Nature, 469(7330), 351–355. https://doi.org/10.1038/nature09659
Hale, G., Kapan, T., & Minoiu, C. (2020). Shock transmission through cross-border bank lending: Credit and real effects. The Review of Financial Studies, 33(10), 4839–4882. https://doi.org/10.1093/rfs/hhz147
Hall, P. (1988). On symmetric bootstrap confidence intervals. Journal of the Royal Statistical Society: Series B (Methodological), 50(1), 35–45.
Hamilton, W. L., Ying, R., & Leskovec, J. (2017). Inductive representation learning on large graphs. Advances in Neural Information Processing Systems 30 (NIPS 2017).
Han, H., Wang, W.-Y., & Mao, B.-H. (2005). Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. Advances in Intelligent Computing (ICIC 2005), Lecture Notes in Computer Science, 3644, 878–887. https://doi.org/10.1007/11538059_91
Han, P. (2014). Multiply robust estimation in regression analysis with missing data. Journal of the American Statistical Association, 109(507), 1159–1173. https://doi.org/10.1080/01621459.2014.880058
Han, P., & Wang, L. (2013). Estimation with missing data: Beyond double robustness. Biometrika, 100(2), 417–430. https://doi.org/10.1093/biomet/ass087
Hand, D. J. (2006). Classifier technology and the illusion of progress. Statistical Science, 21(1), 1–14. https://doi.org/10.1214/088342306000000060
Hand, D. J. (2009). Measuring classifier performance: A coherent alternative to the area under the ROC curve. Machine Learning, 77(1), 103–123. https://doi.org/10.1007/s10994-009-5119-5
Hand, D. J., & Adams, N. M. (2000). Defining attributes for scorecard construction in credit scoring. Journal of Applied Statistics, 27(5), 527–540. https://doi.org/10.1080/02664760050076371
Hand, D. J., & Anagnostopoulos, C. (2013). When is the area under the receiver operating characteristic curve an appropriate measure of classifier performance? Pattern Recognition Letters, 34(5), 492–495. https://doi.org/10.1016/j.patrec.2012.12.004
Hand, D. J., & Henley, W. E. (1997a). Statistical classification methods in consumer credit scoring: A review. Journal of the Royal Statistical Society. Series A (Statistics in Society), 160(3), 523–541. https://doi.org/10.1111/j.1467-985X.1997.00078.x
Hand, D. J., & Henley, W. E. (1997b). Statistical classification methods in consumer credit scoring: A review. Journal of the Royal Statistical Society: Series A, 160(3), 523–541. https://doi.org/10.1111/j.1467-985X.1997.00078.x
Hand, D. J., & Till, R. J. (2001). A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learning, 45(2), 171–186. https://doi.org/10.1023/A:1010920819831
Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36. https://doi.org/10.1148/radiology.143.1.7063747
Hansen, S., McMahon, M., & Prat, A. (2018). Transparency and deliberation within the FOMC: A computational linguistics approach. The Quarterly Journal of Economics, 133(2), 801–870. https://doi.org/10.1093/qje/qjx045
Hardt, M., Price, E., & Srebro, N. (2016). Equality of opportunity in supervised learning. Advances in Neural Information Processing Systems 29 (NIPS 2016).
Hardy, S., Henecka, W., Ivey-Law, H., Nock, R., Patrini, G., Smith, G., & Thorne, B. (2017). Private federated learning on vertically partitioned data via entity resolution and additively homomorphic encryption. NeurIPS Workshop on Privacy-Preserving Machine Learning.
Harrell, F. E., Califf, R. M., Pryor, D. B., Lee, K. L., & Rosati, R. A. (1982). Evaluating the yield of medical tests. Journal of the American Medical Association, 247(18), 2543–2546. https://doi.org/10.1001/jama.1982.03320430047030
Harrell, F. E., Lee, K. L., & Mark, D. B. (1996). Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Statistics in Medicine, 15(4), 361–387. https://doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4
Harris, T. (2013). Quantitative credit risk assessment using support vector machines: Broad versus narrow default definitions. Expert Systems with Applications, 40(11), 4404–4413. https://doi.org/10.1016/j.eswa.2013.01.044
Harrison, J. M., & Kreps, D. M. (1979). Martingales and arbitrage in multiperiod securities markets. Journal of Economic Theory, 20(3), 381–408. https://doi.org/10.1016/0022-0531(79)90043-7
Harrison, J. M., & Pliska, S. R. (1981). Martingales and stochastic integrals in the theory of continuous trading. Stochastic Processes and Their Applications, 11(3), 215–260. https://doi.org/10.1016/0304-4149(81)90026-0
Hart, P. E. (1968). The condensed nearest neighbor rule. IEEE Transactions on Information Theory, 14(3), 515–516. https://doi.org/10.1109/TIT.1968.1054155
Hashimoto, T. B., Srivastava, M., Namkoong, H., & Liang, P. (2018). Fairness without demographics in repeated loss minimization. Proceedings of the 35th International Conference on Machine Learning (ICML).
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning. https://doi.org/10.1007/978-0-387-84858-7
Hau, H., Huang, Y., Lin, C., Shan, H., Sheng, Z., & Wei, L. (2024). FinTech credit and entrepreneurial growth. Journal of Finance, 79(5), 3309–3359. https://doi.org/10.1111/jofi.13384
Hau, H., Huang, Y., Shan, H., & Sheng, Z. (2019). How FinTech enters China’s credit market. AEA Papers and Proceedings, 109, 60–64. https://doi.org/10.1257/pandp.20191012
Hausman, C., & Rapson, D. S. (2018). Regression discontinuity in time: Considerations for empirical applications. Annual Review of Resource Economics, 10, 533–552. https://doi.org/10.1146/annurev-resource-121517-033306
Hauswald, R., & Marquez, R. (2006). Competition and strategic information acquisition in credit markets. The Review of Financial Studies, 19(3), 967–1000. https://doi.org/10.1093/rfs/hhj021
Havlı́ček, V., Córcoles, A. D., Temme, K., Harrow, A. W., Kandala, A., Chow, J. M., & Gambetta, J. M. (2019). Supervised learning with quantum-enhanced feature spaces. Nature, 567(7747), 209–212. https://doi.org/10.1038/s41586-019-0980-2
Havrylchyk, O., Mariotto, C., Rahim, T., & Verdier, M. (2020). The expansion of peer-to-peer lending. The Review of Network Economics, 19(3), 145–187. https://doi.org/10.1515/rne-2020-0033
He, H., Bai, Y., Garcia, E. A., & Li, S. (2008). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN), 1322–1328. https://doi.org/10.1109/IJCNN.2008.4633969
He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263–1284. https://doi.org/10.1109/TKDE.2008.239
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778. https://doi.org/10.1109/CVPR.2016.90
He, Z., Huang, J., & Zhou, J. (2023). Open banking: Credit market competition when borrowers own the data. Journal of Financial Economics, 147(2), 449–474. https://doi.org/10.1016/j.jfineco.2022.12.003
Heckman, J. J. (1974). Shadow prices, market wages, and labor supply. Econometrica, 42(4), 679–694. https://doi.org/10.2307/1913937
Heckman, J. J. (1976). The common structure of statistical models of truncation, sample selection and limited dependent variables and a simple estimator for such models. Annals of Economic and Social Measurement, 5(4), 475–492.
Heckman, J. J. (1979). Sample selection bias as a specification error. Econometrica, 47(1), 153–161. https://doi.org/10.2307/1912352
Helsen, K., & Schmittlein, D. C. (1993). Analyzing duration times in marketing: Evidence for the effectiveness of hazard rate models. Marketing Science, 12(4), 395–414. https://doi.org/10.1287/mksc.12.4.395
Hi! PARIS Center. (2024). XPER: eXplainable PERformance (Python package). https://github.com/hi-paris/XPER
Hillegeist, S. A., Keating, E. K., Cram, D. P., & Lundstedt, K. G. (2004). Assessing the probability of bankruptcy. Review of Accounting Studies, 9(1), 5–34. https://doi.org/10.1023/B:RAST.0000013627.90884.b7
Hinkley, D. V. (1971). Inference about the change-point from cumulative sum tests. Biometrika, 58(3), 509–523. https://doi.org/10.2307/2334386
Hirano, K., Imbens, G. W., & Ridder, G. (2003). Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 71(4), 1161–1189. https://doi.org/10.1111/1468-0262.00442
Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems 33 (NeurIPS).
Hoberg, G., & Phillips, G. (2016). Text-based network industries and endogenous product differentiation. Journal of Political Economy, 124(5), 1423–1465. https://doi.org/10.1086/688176
Hobson, J. L., Mayew, W. J., & Venkatachalam, M. (2012). Analyzing speech to detect financial misreporting. Journal of Accounting Research, 50(2), 349–392. https://doi.org/10.1111/j.1475-679X.2011.00433.x
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hodges, J. L., & Lehmann, E. L. (1962). Rank methods for combination of independent experiments in analysis of variance. The Annals of Mathematical Statistics, 33(2), 482–497. https://doi.org/10.1214/aoms/1177704575
Hoeting, J. A., Madigan, D., Raftery, A. E., & Volinsky, C. T. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14(4), 382–417. https://doi.org/10.1214/ss/1009212519
Hofert, M., Kojadinovic, I., Mächler, M., & Yan, J. (2018). Elements of copula modeling with R. Use R! https://doi.org/10.1007/978-3-319-89635-9
Hofmann, H. (1994). Statlog (german credit data). UCI Machine Learning Repository. https://doi.org/10.24432/C5NC77
Holford, T. R. (1983). The estimation of age, period and cohort effects for vital rates. Biometrics, 39(2), 311–324. https://doi.org/10.2307/2531004
Holland, P. W. (1986). Statistics and causal inference. Journal of the American Statistical Association, 81(396), 945–960. https://doi.org/10.2307/2289064
Holm, S. (1979). A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, 6(2), 65–70.
Holmstrom, B. (1979). Moral hazard and observability. The Bell Journal of Economics, 10(1), 74–91. https://doi.org/10.2307/3003320
Home Credit Group. (2018). Home credit default risk. Kaggle Competition.
Home Credit Vietnam Finance Company Limited. (2023). Annual report 2023. Ho Chi Minh City. https://www.homecredit.vn/
Hooker, S., Erhan, D., Kindermans, P.-J., & Kim, B. (2019). A benchmark for interpretability methods in deep neural networks. Advances in Neural Information Processing Systems 32 (NeurIPS 2019).
Horn, R. A., & Johnson, C. R. (2012). Matrix analysis (2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9781139020411
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2(5), 359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Horvitz, D. G., & Thompson, D. J. (1952). A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47(260), 663–685. https://doi.org/10.1080/01621459.1952.10483446
Hosmer, D. W., & Lemesbow, S. (1980). Goodness of fit tests for the multiple logistic regression model. Communications in Statistics-Theory and Methods, 9(10), 1043–1069.
Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression.
Hothorn, T., Hornik, K., & Zeileis, A. (2006). Unbiased recursive partitioning: A conditional inference framework. Journal of Computational and Graphical Statistics, 15(3), 651–674. https://doi.org/10.1198/106186006X133933
Houlsby, N., Giurgiu, A., Jastrzebski, S., Morrone, B., De Laroussilhe, Q., Gesmundo, A., Attariyan, M., & Gelly, S. (2019). Parameter-efficient transfer learning for NLP. Proceedings of the 36th International Conference on Machine Learning (ICML), 2790–2799.
Howell, S. T., Kuchler, T., Snitkof, D., Stroebel, J., & Wong, J. (2024). Lender automation and racial disparities in credit access. The Journal of Finance, 79(2), 1457–1512. https://doi.org/10.1111/jofi.13303
Hsia, D. C. (1978). Credit scoring and the equal credit opportunity act. Hastings Law Journal, 30(2), 371–448.
Hsieh, C.-J., Chang, K.-W., Lin, C.-J., Keerthi, S. S., & Sundararajan, S. (2008). A dual coordinate descent method for large-scale linear SVM. 408–415. https://doi.org/10.1145/1390156.1390208
Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., & Chen, W. (2022). LoRA: Low-rank adaptation of large language models. International Conference on Learning Representations (ICLR).
Hu, X., Rudin, C., & Seltzer, M. (2019). Optimal sparse decision trees.
Huang, A. H., Wang, H., & Yang, Y. (2023). FinBERT: A large language model for extracting information from financial text. Contemporary Accounting Research, 40(2), 806–841. https://doi.org/10.1111/1911-3846.12832
Huang, C.-L., Chen, M.-C., & Wang, C.-J. (2007). Credit scoring with a data mining approach based on support vector machines. Expert Systems with Applications, 33(4), 847–856. https://doi.org/10.1016/j.eswa.2006.07.007
Huang, H.-Y., Broughton, M., Cotler, J., Chen, S., Li, J., Mohseni, M., Neven, H., Babbush, R., Kueng, R., Preskill, J., & McClean, J. R. (2022). Quantum advantage in learning from experiments. Science, 376(6598), 1182–1186. https://doi.org/10.1126/science.abn7293
Huang, J., Smola, A. J., Gretton, A., Borgwardt, K. M., & Schölkopf, B. (2007). Correcting sample selection bias by unlabeled data. Advances in Neural Information Processing Systems (NeurIPS), 19.
Huang, J.-Z., & Huang, M. (2012). How much of the corporate-treasury yield spread is due to credit risk? The Review of Asset Pricing Studies, 2(2), 153–202. https://doi.org/10.1093/rapstu/ras011
Huang, X., Khetan, A., Cvitkovic, M., & Karnin, Z. (2020). TabTransformer: Tabular data modeling using contextual embeddings. arXiv Preprint arXiv:2012.06678.
Huang, Y., Zhang, L., Li, Z., Qiu, H., Sun, T., & Wang, X. (2020). Fintech credit risk assessment for SMEs: Evidence from China. IMF Working Paper, (20/193). https://www.imf.org/en/Publications/WP/Issues/2020/09/25/Fintech-Credit-Risk-Assessment-for-SMEs-Evidence-from-China-49742
Hué, S., Hurlin, C., Pérignon, C., & Saurin, S. (2022). Measuring the driving forces of predictive performance: Application to credit scoring. arXiv Preprint arXiv:2212.05866.
Hull, J. C., & White, A. (2013). LIBOR vs. OIS: The derivatives discounting dilemma. Journal of Investment Management, 11(3), 14–27.
Hurley, M., & Adebayo, J. (2016). Credit scoring in the era of big data. Yale Journal of Law and Technology, 18, 148–216.
Hurlin, C., Pérignon, C., & Saurin, S. (2026). The fairness of credit scoring models. Management Science, 72(1), 406–425.
IEEE Computational Intelligence Society, & Corporation, V. (2019). IEEE-CIS fraud detection. Kaggle Competition.
Iman, R. L., & Davenport, J. M. (1980). Approximations of the critical region of the Friedman statistic. Communications in Statistics - Theory and Methods, 9(6), 571–595. https://doi.org/10.1080/03610928008827904
Imbens, G. W., & Angrist, J. D. (1994). Identification and estimation of local average treatment effects. Econometrica, 62(2), 467–475. https://doi.org/10.2307/2951620
Imbens, G. W., & Lemieux, T. (2008). Regression discontinuity designs: A guide to practice. Journal of Econometrics, 142(2), 615–635. https://doi.org/10.1016/j.jeconom.2007.05.001
Imbens, G. W., & Rubin, D. B. (2015). Causal inference for statistics, social, and biomedical sciences: An introduction. Cambridge University Press. https://doi.org/10.1017/CBO9781139025751
Indarte, S. (2023). Moral hazard versus liquidity in household bankruptcy. Journal of Finance, 78(5), 2421–2464. https://doi.org/10.1111/jofi.13263
International Accounting Standards Board. (2014). IFRS 9: Financial instruments. IFRS Foundation.
International Finance Corporation. (2019). MSME finance gap: Viet nam country profile. International Finance Corporation. https://www.ifc.org/en/what-we-do/sector-expertise/financial-institutions/msme-finance
International Monetary Fund. (2019). Vietnam: Financial sector assessment program, technical note on systemic risk analysis and stress testing (IMF Country Report 19/373). International Monetary Fund. https://www.imf.org/en/Publications/CR/Issues/2019/12/13/Vietnam-Financial-Sector-Assessment-Program-48885
International Monetary Fund. (2023a). Fintech and financial inclusion in low-income countries (IMF Departmental Paper DP/2023/004). International Monetary Fund. https://www.imf.org/en/Publications/Departmental-Papers-Policy-Papers/Issues/2023/06/23/Fintech-and-Financial-Inclusion-in-Low-Income-Countries-534832
International Monetary Fund. (2023b). Vietnam: 2023 article IV consultation, IMF country report no. 23/352. International Monetary Fund. https://www.imf.org/en/Publications/CR/Issues/2023/10/10/Vietnam-2023-Article-IV-Consultation
International Monetary Fund. (2024). Vietnam: 2024 article IV consultation – press release; staff report; and statement by the executive director for vietnam, IMF country report no. 24/306. International Monetary Fund. https://www.imf.org/en/publications/cr/issues/2024/09/27/vietnam-2024-article-iv-consultation-press-release-staff-report-and-statement-by-the-555679
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning (ICML), 448–456.
Ishwaran, H., Kogalur, U. B., Blackwell, E. H., & Lauer, M. S. (2008). Random survival forests. The Annals of Applied Statistics, 2(3), 841–860. https://doi.org/10.1214/08-AOAS169
Israel, R. B., Rosenthal, J. S., & Wei, J. Z. (2001). Finding generators for Markov chains via empirical transition matrices, with applications to credit ratings. Mathematical Finance, 11(2), 245–265. https://doi.org/10.1111/1467-9965.00114
Ivanov, I. T., Kruttli, M. S., & Watugala, S. W. (2024). Banking on carbon: Corporate lending and cap-and-trade policy. Review of Financial Studies, 37(5), 1640–1684. https://doi.org/10.1093/rfs/hhad080
Iyer, R., Khwaja, A. I., Luttmer, E. F. P., & Shue, K. (2016). Screening peers softly: Inferring the quality of small borrowers. Management Science, 62(6), 1554–1577. https://doi.org/10.1287/mnsc.2015.2181
Iyer, R., & Peydro, J.-L. (2011). Interbank contagion at work: Evidence from a natural experiment. The Review of Financial Studies, 24(4), 1337–1377. https://doi.org/10.1093/rfs/hhp105
Jack, W., & Suri, T. (2014). Risk sharing and transactions costs: Evidence from Kenya’s mobile money revolution. American Economic Review, 104(1), 183–223. https://doi.org/10.1257/aer.104.1.183
Jaffee, D. M., & Russell, T. (1976). Imperfect information, uncertainty, and credit rationing. The Quarterly Journal of Economics, 90(4), 651–666. https://doi.org/10.2307/1885327
Jäger, S., Allhorn, A., & Bießmann, F. (2021). A benchmark for data imputation methods. Frontiers in Big Data, 4, 693674. https://doi.org/10.3389/fdata.2021.693674
Jagtiani, J., & Lemieux, C. (2019). The roles of alternative data and machine learning in fintech lending: Evidence from the LendingClub consumer platform. Financial Management, 48(4), 1009–1029. https://doi.org/10.1111/fima.12295
Jain, D. C., & Vilcassim, N. J. (1991). Investigating household purchase timing decisions: A conditional hazard function approach. Marketing Science, 10(1), 1–23. https://doi.org/10.1287/mksc.10.1.1
Jain, S., & Wallace, B. C. (2019). Attention is not explanation. Proceedings of NAACL-HLT, 3543–3556. https://doi.org/10.18653/v1/N19-1357
Janakiraman, R., Lim, J. H., & Rishika, R. (2018). The effect of a data breach announcement on customer behavior: Evidence from a multichannel retailer. Journal of Marketing, 82(2), 85–105. https://doi.org/10.1509/jm.16.0124
Janzing, D., Minorics, L., & Blöbaum, P. (2020). Feature relevance quantification in explainable AI: A causal problem. Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 2907–2916.
Jarrett, D., Cebere, B. C., Liu, T., Curth, A., & Schaar, M. van der. (2022). HyperImpute: Generalized iterative imputation with automatic model selection. Proceedings of the 39th International Conference on Machine Learning (ICML).
Jarrow, R. A., Lando, D., & Turnbull, S. M. (1997). A Markov model for the term structure of credit risk spreads. The Review of Financial Studies, 10(2), 481–523. https://doi.org/10.1093/rfs/10.2.481
Jarrow, R. A., & Turnbull, S. M. (1995). Pricing derivatives on financial securities subject to credit risk. The Journal of Finance, 50(1), 53–85. https://doi.org/10.1111/j.1540-6261.1995.tb05167.x
Jegadeesh, N., & Wu, D. (2013). Word power: A new approach for content analysis. Journal of Financial Economics, 110(3), 712–729. https://doi.org/10.1016/j.jfineco.2013.08.018
Jethani, N., Sudarshan, M., Covert, I., Lee, S.-I., & Ranganath, R. (2022). FastSHAP: Real-time Shapley value estimation. International Conference on Learning Representations (ICLR).
Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang, Y., Madotto, A., & Fung, P. (2023). Survey of hallucination in natural language generation. ACM Computing Surveys, 55, 1–38. https://doi.org/10.1145/3571730
Joe, H. (2014). Dependence modeling with copulas. Chapman; Hall/CRC. https://doi.org/10.1201/b17116
Johnson, G. A., Shriver, S. K., & Goldberg, S. G. (2023). Privacy and market concentration: Intended and unintended consequences of the GDPR. Management Science, 69(10), 5695–5721. https://doi.org/10.1287/mnsc.2023.4709
Jones, C. I., & Tonetti, C. (2020). Nonrivalry and the economics of data. American Economic Review, 110(9), 2819–2858. https://doi.org/10.1257/aer.20191330
Jones, E. P., Mason, S. P., & Rosenfeld, E. (1984). Contingent claims analysis of corporate capital structures: An empirical investigation. The Journal of Finance, 39(3), 611–625. https://doi.org/10.2307/2327919
Jordon, J., Szpruch, L., Houssiau, F., Bottarelli, M., Cherubin, G., Maple, C., Cohen, S. N., & Weller, A. (2022). Synthetic data - what, why and how? The Royal Society Report (Commissioned by The Alan Turing Institute).
Jordon, J., Yoon, J., & Schaar, M. van der. (2019). PATE-GAN: Generating synthetic data with differential privacy guarantees. International Conference on Learning Representations (ICLR).
Kairouz, P., McMahan, H. B., Avent, B., Bellet, A., Bennis, M., Bhagoji, A. N., Bonawitz, K., Charles, Z., Cormode, G., Cummings, R., et al. (2021). Advances and open problems in federated learning. Foundations and Trends in Machine Learning, 14(1-2), 1–210. https://doi.org/10.1561/2200000083
Kalemli-Özcan, Ş., Di Giovanni, J., Silva, Á., & Yildirim, M. A. (2022). Global supply chain pressures, international trade, and inflation. NBER Working Paper, (30240). https://www.nber.org/papers/w30240
Kamiran, F., & Calders, T. (2012). Data preprocessing techniques for classification without discrimination. Knowledge and Information Systems, 33, 1–33. https://doi.org/10.1007/s10115-011-0463-8
Kang, J. D. Y., & Schafer, J. L. (2007). Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statistical Science, 22(4), 523–539. https://doi.org/10.1214/07-STS227
Kantorovich, L. V. (1960). Mathematical methods of organizing and planning production. Management Science, 6(4), 366–422. https://doi.org/10.1287/mnsc.6.4.366
Kaplan, E. L., & Meier, P. (1958). Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53(282), 457–481. https://doi.org/10.2307/2281868
Karakoulas, G. (2004). Empirical validation of retail credit-scoring models. RMA Journal, 87(1), 56–60.
Karimi, A.-H., Barthe, G., Balle, B., & Valera, I. (2020). Model-agnostic counterfactual explanations for consequential decisions. Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS), 895–905.
Karimi, A.-H., Barthe, G., Schölkopf, B., & Valera, I. (2022). A survey of algorithmic recourse: Contrastive explanations and consequential recommendations. ACM Computing Surveys, 55(5), 1–29. https://doi.org/10.1145/3527848
Karlan, D., McConnell, M., Mullainathan, S., & Zinman, J. (2016). Getting to the top of mind: How reminders increase saving. Management Science, 62(12), 3393–3411. https://doi.org/10.1287/mnsc.2015.2296
Karlan, D., Mobius, M., Rosenblat, T., & Szeidl, A. (2009). Trust and social collateral. The Quarterly Journal of Economics, 124(3), 1307–1361. https://doi.org/10.1162/qjec.2009.124.3.1307
Karlan, D., & Zinman, J. (2009). Observing unobservables: Identifying information asymmetries with a consumer credit field experiment. Econometrica, 77(6), 1993–2008. https://doi.org/10.3982/ECTA5781
Karlan, D., & Zinman, J. (2010). Expanding credit access: Using randomized supply decisions to estimate the impacts. Review of Financial Studies, 23(1), 433–464. https://doi.org/10.1093/rfs/hhp092
Katz, L. (1953). A new status index derived from sociometric analysis. Psychometrika, 18(1), 39–43. https://doi.org/10.1007/BF02289026
Katzman, J. L., Shaham, U., Cloninger, A., Bates, J., Jiang, T., & Kluger, Y. (2018). DeepSurv: Personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Medical Research Methodology, 18(1), 24. https://doi.org/10.1186/s12874-018-0482-1
Kau, J. B., Keenan, D. C., Muller, W. J., & Epperson, J. F. (1992). A generalized valuation model for fixed-rate residential mortgages. Journal of Financial and Quantitative Analysis, 27(3), 279–299. https://doi.org/10.2307/2331201
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T.-Y. (2017). LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems 30 (NIPS 2017).
Keane, M. P., & Neal, T. (2024). A practical guide to weak instruments. Annual Review of Economics, 16, 185–212. https://doi.org/10.1146/annurev-economics-092123-111021
Kearns, M., & Valiant, L. (1994). Cryptographic limitations on learning Boolean formulae and finite automata. Journal of the ACM, 41(1), 67–95. https://doi.org/10.1145/174644.174647
Keerthi, S. S., & Lin, C.-J. (2003). Asymptotic behaviors of support vector machines with Gaussian kernel. Neural Computation, 15(7), 1667–1689. https://doi.org/10.1162/089976603321891855
Kennedy, E. H. (2024). Semiparametric doubly robust targeted double machine learning: A review. https://arxiv.org/abs/2203.06469
Keys, B. J., Mukherjee, T., Seru, A., & Vig, V. (2010). Did securitization lead to lax screening? Evidence from subprime loans. The Quarterly Journal of Economics, 125(1), 307–362. https://doi.org/10.1162/qjec.2010.125.1.307
Khandani, A. E., Kim, A. J., & Lo, A. W. (2010). Consumer credit-risk models via machine-learning algorithms. Journal of Banking and Finance, 34(11), 2767–2787. https://doi.org/10.1016/j.jbankfin.2010.06.001
Khattab, O., & Zaharia, M. (2020). ColBERT: Efficient and effective passage search via contextualized late interaction over BERT. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 39–48. https://doi.org/10.1145/3397271.3401075
Khieu, H. D., Mullineaux, D. J., & Yi, H.-C. (2012). The determinants of bank loan recovery rates. Journal of Banking and Finance, 36(4), 923–933. https://doi.org/10.1016/j.jbankfin.2011.10.005
Kilbertus, N., Rojas-Carulla, M., Parascandolo, G., Hardt, M., Janzing, D., & Schölkopf, B. (2017). Avoiding discrimination through causal reasoning. Advances in Neural Information Processing Systems 30 (NIPS 2017).
Kim, B., Khanna, R., & Koyejo, O. O. (2016). Examples are not enough, learn to criticize! Criticism for interpretability. Advances in Neural Information Processing Systems 29 (NeurIPS 2016).
Kim, B., Wattenberg, M., Gilmer, J., Cai, C., Wexler, J., Viegas, F., & Sayres, R. (2018). Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (TCAV). Proceedings of the 35th International Conference on Machine Learning (ICML), 2668–2677.
Kimeldorf, G., & Wahba, G. (1971). Some results on Tchebycheffian spline functions. Journal of Mathematical Analysis and Applications, 33(1), 82–95. https://doi.org/10.1016/0022-247X(71)90184-3
King, G., & Zeng, L. (2001). Logistic regression in rare events data. Political Analysis, 9(2), 137–163. https://doi.org/10.1093/oxfordjournals.pan.a004868
Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR).
Kingma, D. P., & Welling, M. (2014). Auto-encoding variational Bayes. International Conference on Learning Representations (ICLR).
Kipf, T. N., & Welling, M. (2017). Semi-supervised classification with graph convolutional networks. International Conference on Learning Representations (ICLR).
Kiryo, R., Niu, G., Plessis, M. C. du, & Sugiyama, M. (2017). Positive-unlabeled learning with non-negative risk estimator. Advances in Neural Information Processing Systems (NeurIPS), 30.
Kisgen, D. J. (2006). Credit ratings and capital structure. Journal of Finance, 61(3), 1035–1072. https://doi.org/10.1111/j.1540-6261.2006.00866.x
Klaise, J., Van Looveren, A., Cox, C., Vacanti, G., & Coca, A. (2020). Monitoring and explainability of models in production. USENIX Conference on Operational Machine Learning (OpML).
Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis: Techniques for censored and truncated data (2nd ed.). Springer. https://doi.org/10.1007/b97377
Kleinberg, J., Ludwig, J., Mullainathan, S., & Rambachan, A. (2018). Algorithmic fairness. AEA Papers and Proceedings, 108, 22–27. https://doi.org/10.1257/pandp.20181018
Kleinberg, J., Mullainathan, S., & Raghavan, M. (2017). Inherent trade-offs in the fair determination of risk scores. 8th Innovations in Theoretical Computer Science Conference (ITCS 2017), 43:1–43:23. https://doi.org/10.4230/LIPIcs.ITCS.2017.43
Klinger, B., Khwaja, A. I., & Carpio, C. del. (2013). Enterprising psychometrics and poverty reduction. SpringerBriefs in Psychology. https://doi.org/10.1007/978-1-4614-7227-8
Koenker, R., & Bassett, G. (1978). Regression quantiles. Econometrica, 46(1), 33–50. https://doi.org/10.2307/1913643
Koh, K., Kim, S.-J., & Boyd, S. (2007). An interior-point method for large-scale L1-regularized logistic regression. Journal of Machine Learning Research, 8, 1519–1555.
Koh, P. W., Sagawa, S., Marklund, H., Xie, S. M., Zhang, M., Balsubramani, A., Hu, W., Yasunaga, M., Phillips, R. L., Beery, S., et al. (2021). WILDS: A benchmark of in-the-wild distribution shifts. Proceedings of the 38th International Conference on Machine Learning (ICML).
Kojima, T., Gu, S. S., Reid, M., Matsuo, Y., & Iwasawa, Y. (2022). Large language models are zero-shot reasoners. Advances in Neural Information Processing Systems 35 (NeurIPS), 22199–22213.
Kokhlikyan, N., Miglani, V., Martin, M., Wang, E., Alsallakh, B., Reynolds, J., Melnikov, A., Kliushkina, N., Araya, C., Yan, S., & Reblitz-Richardson, O. (2020). Captum: A unified and generic model interpretability library for PyTorch. arXiv Preprint arXiv:2009.07896.
Kolen, M. J., & Brennan, R. L. (2014). Test equating, scaling, and linking: Methods and practices (3rd ed.). Springer. https://doi.org/10.1007/978-1-4939-0317-7
Kolmogorov, A. (1933). Sulla determinazione empirica di una legge di distribuzione. Giornale Dell’Istituto Italiano Degli Attuari, 4, 83–91.
Koopman, S. J., Lucas, A., & Monteiro, A. (2008). The multi-state latent factor intensity model for credit rating transitions. Journal of Econometrics, 142(1), 399–424. https://doi.org/10.1016/j.jeconom.2007.07.001
Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. Proceedings of the National Academy of Sciences, 110(15), 5802–5805. https://doi.org/10.1073/pnas.1218772110
Kotelnikov, A., Baranchuk, D., Rubachev, I., & Babenko, A. (2023). TabDDPM: Modelling tabular data with diffusion models. Proceedings of the 40th International Conference on Machine Learning (ICML), 17564–17579.
Kou, G., Xu, Y., Peng, Y., Shen, F., Chen, Y., Chang, K., & Kou, S. (2021). Bankruptcy prediction for SMEs using transactional data and two-stage multiobjective feature selection. Decision Support Systems, 140, 113429. https://doi.org/10.1016/j.dss.2020.113429
Kozodoi, N., Lessmann, S., Alamgir, M., Moreira-Matias, L., & Papakonstantinou, K. (2025). Fighting sampling bias: A framework for training and evaluating credit scoring models. European Journal of Operational Research, 324(2), 616–628.
Kraus, S., & Feuerriegel, S. (2017). Decision support from financial disclosures with deep neural networks and transfer learning. Decision Support Systems, 104, 38–48. https://doi.org/10.1016/j.dss.2017.10.001
Krawczyk, B. (2016). Learning from imbalanced data: Open challenges and future directions. Progress in Artificial Intelligence, 5(4), 221–232. https://doi.org/10.1007/s13748-016-0094-0
Kreps, J., Narkhede, N., & Rao, J. (2011). Kafka: A distributed messaging system for log processing. Proceedings of the 6th International Workshop on Networking Meets Databases (NetDB).
Krishna, S., Han, T., Gu, A., Pombra, J., Jabbari, S., Wu, S., & Lakkaraju, H. (2024). The disagreement problem in explainable machine learning: A practitioner’s perspective. Transactions on Machine Learning Research.
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017). ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90. https://doi.org/10.1145/3065386
Kuk, A. Y. C., & Chen, C.-H. (1992). A mixture model combining logistic regression with proportional hazards regression. Biometrika, 79(3), 531–541. https://doi.org/10.1093/biomet/79.3.531
Kull, M., Silva Filho, T. M., & Flach, P. (2017). Beyond sigmoids: How to obtain well-calibrated probabilities from binary classifiers with beta calibration. Electronic Journal of Statistics, 11(2), 5052–5080. https://doi.org/10.1214/17-EJS1338SI
Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. The Annals of Mathematical Statistics, 22(1), 79–86. https://doi.org/10.1214/aoms/1177729694
Kumar, A., Liang, P. S., & Ma, T. (2019). Verified uncertainty calibration. Advances in Neural Information Processing Systems, 32.
Kumar, I. E., Venkatasubramanian, S., Scheidegger, C., & Friedler, S. (2020). Problems with Shapley-value-based explanations as feature importance measures. Proceedings of the 37th International Conference on Machine Learning, 5491–5500.
Künzel, S. R., Sekhon, J. S., Bickel, P. J., & Yu, B. (2019a). Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences, 116(10), 4156–4165. https://doi.org/10.1073/pnas.1804597116
Künzel, S. R., Sekhon, J. S., Bickel, P. J., & Yu, B. (2019b). Metalearners for estimating heterogeneous treatment effects using machine learning. Proceedings of the National Academy of Sciences, 116(10), 4156–4165. https://doi.org/10.1073/pnas.1804597116
Kupiec, P. H. (2018). On the accuracy of alternative approaches for calibrating bank stress test models. Journal of Financial Stability, 38, 132–146. https://doi.org/10.1016/j.jfs.2018.04.002
Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the Boruta package. Journal of Statistical Software, 36(11), 1–13. https://doi.org/10.18637/jss.v036.i11
Kusner, M. J., Loftus, J. R., Russell, C., & Silva, R. (2017). Counterfactual fairness. Advances in Neural Information Processing Systems 30 (NIPS 2017).
Kvamme, H., Sellereite, N., Aas, K., & Sjursen, S. (2018). Predicting mortgage default using convolutional neural networks. Expert Systems with Applications, 102, 207–217. https://doi.org/10.1016/j.eswa.2018.02.029
Lagakos, S. W., Barraj, L. M., & De Gruttola, V. (1988). Nonparametric analysis of truncated survival data, with application to AIDS. Biometrika, 75(3), 515–523. https://doi.org/10.1093/biomet/75.3.515
Lando, D. (1998). On Cox processes and credit risky securities. Review of Derivatives Research, 2(2-3), 99–120. https://doi.org/10.1007/BF01531332
Lando, D., & Nielsen, M. S. (2010). Correlation in corporate defaults: Contagion or conditional independence? Journal of Financial Intermediation, 19(3), 355–372. https://doi.org/10.1016/j.jfi.2010.03.002
Lando, D., & Skødeberg, T. M. (2002). Analyzing rating transitions and rating drift with continuous observations. Journal of Banking and Finance, 26(2-3), 423–444. https://doi.org/10.1016/S0378-4266(01)00228-X
Larcker, D. F., & Zakolyukina, A. A. (2012). Detecting deceptive discussions in conference calls. Journal of Accounting Research, 50(2), 495–540. https://doi.org/10.1111/j.1475-679X.2012.00450.x
Lauer, J. (2017). Creditworthy: A history of consumer surveillance and financial identity in america.
Laugel, T., Lesot, M.-J., Marsala, C., Renard, X., & Detyniecki, M. (2018). Comparison-based inverse classification for interpretability in machine learning. Communications in Computer and Information Science, 853, 100–111. https://doi.org/10.1007/978-3-319-91473-2\_9
Le Morvan, M., Josse, J., Scornet, E., & Varoquaux, G. (2021). What’s a good imputation to predict with missing values? Advances in Neural Information Processing Systems (NeurIPS), 34.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. https://doi.org/10.1038/nature14539
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278–2324. https://doi.org/10.1109/5.726791
Lee, D. S., & Lemieux, T. (2010). Regression discontinuity designs in economics. Journal of Economic Literature, 48(2), 281–355. https://doi.org/10.1257/jel.48.2.281
Lee, D. S., McCrary, J., Moreira, M. J., & Porter, J. (2022). Valid t-ratio inference for IV. American Economic Review, 112(10), 3260–3290. https://doi.org/10.1257/aer.20211063
Lee, D.-H. (2013). Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks. ICML Workshop on Challenges in Representation Learning.
Lee, L.-F. (1983). Generalized econometric models with selectivity. Econometrica, 51(2), 507–512. https://doi.org/10.2307/1912003
Lehmann, E. L., & Casella, G. (1998). Theory of point estimation (2nd ed.). Springer. https://doi.org/10.1007/b98854
Lei, J., G’Sell, M., Rinaldo, A., Tibshirani, R. J., & Wasserman, L. (2018). Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523), 1094–1111. https://doi.org/10.1080/01621459.2017.1307116
Leland, H. E. (1994). Corporate debt value, bond covenants, and optimal capital structure. The Journal of Finance, 49(4), 1213–1252. https://doi.org/10.2307/2329184
Leland, H. E., & Toft, K. B. (1996). Optimal capital structure, endogenous bankruptcy, and the term structure of credit spreads. The Journal of Finance, 51(3), 987–1019. https://doi.org/10.2307/2329229
Lemaître, G., Nogueira, F., & Aridas, C. K. (2017). Imbalanced-learn: A Python toolbox to tackle the curse of imbalanced datasets in machine learning. Journal of Machine Learning Research, 18(17), 1–5.
Lemmens, A., & Gupta, S. (2020). Managing churn to maximize profits. Marketing Science, 39(5), 956–973. https://doi.org/10.1287/mksc.2020.1229
Lending Club. (2019). Lending club loan data (2007–2018). Kaggle Dataset Mirror.
Leow, M., & Crook, J. (2014). Intensity models and transition probabilities for credit card loan delinquencies. European Journal of Operational Research, 236(2), 685–694. https://doi.org/10.1016/j.ejor.2013.12.026
Lessmann, S., Baesens, B., Seow, H.-V., & Thomas, L. C. (2015b). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
Lessmann, S., Baesens, B., Seow, H.-V., & Thomas, L. C. (2015a). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
Letham, B., Rudin, C., McCormick, T. H., & Madigan, D. (2015). Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model. The Annals of Applied Statistics, 9(3), 1350–1371. https://doi.org/10.1214/15-AOAS848
Letizia, E., & Lillo, F. (2019). Corporate payments networks and credit risk rating. EPJ Data Science, 8(1), 21. https://doi.org/10.1140/epjds/s13688-019-0197-5
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems 33 (NeurIPS), 9459–9474.
Leyshon, A., & Thrift, N. (1999). Lists come alive: Electronic systems of knowledge and the rise of credit-scoring in retail banking. Economy and Society, 28(3), 434–466. https://doi.org/10.1080/03085149900000013
Li, C., Wang, H., Jiang, S., & Gu, B. (2024). The effect of AI-enabled credit scoring on financial inclusion: Evidence from an underserved population of over one million. MIS Quarterly, 48(4), 1803–1834. https://doi.org/10.25300/MISQ/2024/18340
Li, D. X. (2000). On default correlation: A copula function approach. Journal of Fixed Income, 9(4), 43–54. https://doi.org/10.3905/jfi.2000.319253
Li, F. (2008). Annual report readability, current earnings, and earnings persistence. Journal of Accounting and Economics, 45(2–3), 221–247. https://doi.org/10.1016/j.jacceco.2008.02.003
Li, F. (2010). The information content of forward-looking statements in corporate filings: A naïve Bayesian machine learning approach. Journal of Accounting Research, 48(5), 1049–1102. https://doi.org/10.1111/j.1475-679X.2010.00382.x
Li, O., Liu, H., Chen, C., & Rudin, C. (2018). Deep learning for case-based reasoning through prototypes: A neural network that explains its predictions. Proceedings of the 32nd AAAI Conference on Artificial Intelligence, 3530–3537.
Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2020). Federated learning: Challenges, methods, and future directions. IEEE Signal Processing Magazine, 37(3), 50–60. https://doi.org/10.1109/MSP.2020.2975749
Liang, D., Lu, C.-C., Tsai, C.-F., & Shih, G.-A. (2016). Financial ratios and corporate governance indicators in bankruptcy prediction: A comprehensive study. European Journal of Operational Research, 252(2), 561–572.
Liberti, J. M., & Petersen, M. A. (2019). Information: Hard and soft. Review of Corporate Finance Studies, 8(1), 1–41. https://doi.org/10.1093/rcfs/cfy009
Lim, B., Alaa, A. M., & Schaar, M. van der. (2018). Forecasting treatment responses over time using recurrent marginal structural networks. Advances in Neural Information Processing Systems (NeurIPS), 31.
Lim, B., Arık, S. Ö., Loeff, N., & Pfister, T. (2021). Temporal fusion transformers for interpretable multi-horizon time series forecasting. International Journal of Forecasting, 37(4), 1748–1764. https://doi.org/10.1016/j.ijforecast.2021.03.012
Lin, H.-T., Lin, C.-J., & Weng, R. C. (2007). A note on Platt’s probabilistic outputs for support vector machines. Machine Learning, 68(3), 267–276. https://doi.org/10.1007/s10994-007-5018-6
Lin, M., Prabhala, N. R., & Viswanathan, S. (2013). Judging borrowers by the company they keep: Friendship networks and information asymmetry in online peer-to-peer lending. Management Science, 59(1), 17–35. https://doi.org/10.1287/mnsc.1120.1560
Lipton, Z. C. (2018). The mythos of model interpretability. Communications of the ACM, 61(10), 36–43. https://doi.org/10.1145/3233231
Lipton, Z. C., Wang, Y.-X., & Smola, A. (2018). Detecting and correcting for label shift with black box predictors. International Conference on Machine Learning (ICML), 3122–3130.
Little, R. J. A. (1993). Pattern-mixture models for multivariate incomplete data. Journal of the American Statistical Association, 88(421), 125–134. https://doi.org/10.1080/01621459.1993.10594302
Little, R. J. A., & Rubin, D. B. (2019). Statistical analysis with missing data.
Liu, Y., Hu, T., Zhang, H., Wu, H., Wang, S., Ma, L., & Long, M. (2024). iTransformer: Inverted transformers are effective for time series forecasting. Proceedings of the International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=JePfAI8fah
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2020). RoBERTa: A robustly optimized BERT pretraining approach. Proceedings of ICLR (Workshop Track).
Löffler, G. (2004). An anatomy of rating through the cycle. Journal of Banking & Finance, 28(3), 695–720. https://doi.org/10.1016/S0378-4266(03)00041-4
Löffler, G. (2013). Can rating agencies look through the cycle? Review of Quantitative Finance and Accounting, 40(4), 623–646. https://doi.org/10.1007/s11156-012-0289-9
Löffler, G., & Posch, P. N. (2011). Credit risk modeling using Excel and VBA (2nd ed.). Wiley Finance.
Loh, W.-Y. (2014). Fifty years of classification and regression trees. International Statistical Review, 82(3), 329–348. https://doi.org/10.1111/insr.12016
Longstaff, F. A., Pan, J., Pedersen, L. H., & Singleton, K. J. (2011). How sovereign is sovereign credit risk? American Economic Journal: Macroeconomics, 3(2), 75–103. https://doi.org/10.1257/mac.3.2.75
Longstaff, F. A., & Schwartz, E. S. (1995). A simple approach to valuing risky fixed and floating rate debt. The Journal of Finance, 50(3), 789–819. https://doi.org/10.2307/2329288
López de Prado, M. (2018). Advances in financial machine learning.
Loshchilov, I., & Hutter, F. (2019). Decoupled weight decay regularization. International Conference on Learning Representations (ICLR).
Lou, Y., Caruana, R., Gehrke, J., & Hooker, G. (2013). Accurate intelligible models with pairwise interactions. Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 623–631. https://doi.org/10.1145/2487575.2487579
Loughran, T., & McDonald, B. (2011). When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. The Journal of Finance, 66(1), 35–65. https://doi.org/10.1111/j.1540-6261.2010.01625.x
Loughran, T., & McDonald, B. (2016). Textual analysis in accounting and finance: A survey. Journal of Accounting Research, 54(4), 1187–1230. https://doi.org/10.1111/1475-679X.12123
Loukas, L., Stogiannidis, I., Diamantopoulos, O., Malakasiotis, P., & Vassos, S. (2023). Making LLMs worth every penny: Resource-limited text classification in banking. Proceedings of the Fourth ACM International Conference on AI in Finance (ICAIF), 392–400. https://doi.org/10.1145/3604237.3626891
Loutskina, E. (2011). The role of securitization in bank liquidity and funding management. Journal of Financial Economics, 100(3), 663–684. https://doi.org/10.1016/j.jfineco.2011.02.005
Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., & Zhang, G. (2019). Learning under concept drift: A review. IEEE Transactions on Knowledge and Data Engineering, 31(12), 2346–2363. https://doi.org/10.1109/TKDE.2018.2876857
Lu, T., Zhang, Y., & Li, B. (2023). Profit vs. Equality? The case of financial risk assessment and a new perspective on alternative data. MIS Quarterly, 47(4), 1517–1556. https://doi.org/10.25300/MISQ/2023/17330
Lu, Y., Bartolo, M., Moore, A., Riedel, S., & Stenetorp, P. (2022). Fantastically ordered prompts and where to find them: Overcoming few-shot prompt order sensitivity. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL), 8086–8098. https://doi.org/10.18653/v1/2022.acl-long.556
Lundberg, S. M., Erion, G. G., & Lee, S.-I. (2018). Consistent individualized feature attribution for tree ensembles. ICML Workshop on Human Interpretability in Machine Learning.
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., & Lee, S.-I. (2020). From local explanations to global understanding with explainable AI for trees. Nature Machine Intelligence, 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9
Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30.
Luo, D., Cheng, W., Xu, D., Yu, W., Zong, B., Chen, H., & Zhang, X. (2020). Parameterized explainer for graph neural network. Advances in Neural Information Processing Systems 33 (NeurIPS 2020).
MacKay, D. J. C. (1992). A practical Bayesian framework for backpropagation networks. Neural Computation, 4(3), 448–472. https://doi.org/10.1162/neco.1992.4.3.448
MacKinlay, A. C. (1997). Event studies in economics and finance. Journal of Economic Literature, 35(1), 13–39.
Madras, D., Creager, E., Pitassi, T., & Zemel, R. (2018). Learning adversarially fair and transferable representations. Proceedings of the 35th International Conference on Machine Learning (ICML), 3384–3393.
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2018). Towards deep learning models resistant to adversarial attacks. International Conference on Learning Representations (ICLR).
Mahalanobis, P. C. (1936). On the generalised distance in statistics. Proceedings of the National Institute of Sciences of India, 2(1), 49–55.
Mahoney, N. (2015). Bankruptcy as implicit health insurance. American Economic Review, 105(2), 710–746. https://doi.org/10.1257/aer.20131408
Malesky, E., & Taussig, M. (2009). Out of the gray: The impact of provincial institutions on business formalization in vietnam. Journal of East Asian Studies, 9(2), 249–290.
Malgieri, G., & Comandé, G. (2017). Why a right to legibility of automated decision-making exists in the general data protection regulation. International Data Privacy Law, 7(4), 243–265. https://doi.org/10.1093/idpl/ipx019
Malik, M., & Thomas, L. C. (2010). Modelling credit risk of portfolio of consumer loans. Journal of the Operational Research Society, 61(3), 411–420. https://doi.org/10.1057/jors.2009.123
Mancisidor, R. A., Kampffmeyer, M., Aas, K., & Jenssen, R. (2020). Deep generative models for reject inference in credit scoring. Knowledge-Based Systems, 196, 105758. https://doi.org/10.1016/j.knosys.2020.105758
Manela, A., & Moreira, A. (2017). News implied volatility and disaster concerns. Journal of Financial Economics, 123(1), 137–162. https://doi.org/10.1016/j.jfineco.2016.01.032
Mani, I., & Zhang, I. (2003). kNN approach to unbalanced data distributions: A case study involving information extraction. Proceedings of the ICML Workshop on Learning from Imbalanced Datasets.
Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of two random variables is stochastically larger than the other. The Annals of Mathematical Statistics, 18(1), 50–60. https://doi.org/10.1214/aoms/1177730491
Manski, C. F. (1989). Anatomy of the selection problem. Journal of Human Resources, 24(3), 343–360. https://doi.org/10.2307/145818
Manski, C. F. (1990). Nonparametric bounds on treatment effects. American Economic Review, 80(2), 319–323.
Manski, C. F. (1993). Identification of endogenous social effects: The reflection problem. The Review of Economic Studies, 60(3), 531–542. https://doi.org/10.2307/2298123
Marchenko, Y. V., & Genton, M. G. (2012). A heckman selection-t model. Journal of the American Statistical Association, 107(497), 304–317. https://doi.org/10.1080/01621459.2012.656011
Marqués, A. I., García, V., & Sánchez, J. S. (2013). On the suitability of resampling techniques for the class imbalance problem in credit scoring. Journal of the Operational Research Society, 64(7), 1060–1070. https://doi.org/10.1057/jors.2012.120
Marra, G., & Radice, R. (2013). A penalized likelihood estimation approach to semiparametric sample selection binary response modeling. Electronic Journal of Statistics, 7, 1432–1455. https://doi.org/10.1214/13-EJS814
Marra, G., & Radice, R. (2017). Bivariate copula additive models for location, scale and shape. Computational Statistics and Data Analysis, 112, 99–113. https://doi.org/10.1016/j.csda.2017.03.004
Martin, K. D., Borah, A., & Palmatier, R. W. (2017). Data privacy: Effects on customer and firm performance. Journal of Marketing, 81(1), 36–58. https://doi.org/10.1509/jm.15.0497
Martins, A., & Astudillo, R. (2016). From softmax to sparsemax: A sparse model of attention and multi-label classification. International Conference on Machine Learning, 1614–1623.
Mason, K. O., Mason, W. M., Winsborough, H. H., & Poole, W. K. (1973). Some methodological issues in cohort analysis of archival data. American Sociological Review, 38(2), 242–258. https://doi.org/10.2307/2094398
Mason, L., Baxter, J., Bartlett, P., & Frean, M. (1999). Boosting algorithms as gradient descent.
Mattei, P.-A., & Frellsen, J. (2019). MIWAE: Deep generative modelling and imputation of incomplete data sets. Proceedings of the 36th International Conference on Machine Learning (ICML).
Matz, S. C., Kosinski, M., Nave, G., & Stillwell, D. J. (2017). Psychological targeting as an effective approach to digital mass persuasion. Proceedings of the National Academy of Sciences, 114(48), 12714–12719. https://doi.org/10.1073/pnas.1710966114
Mayew, W. J., & Venkatachalam, M. (2012). The power of voice: Managerial affective states and future firm performance. The Journal of Finance, 67(1), 1–43. https://doi.org/10.1111/j.1540-6261.2011.01705.x
Mazumder, R., Hastie, T., & Tibshirani, R. (2010). Spectral regularization algorithms for learning large incomplete matrices. Journal of Machine Learning Research, 11, 2287–2322.
Mbiti, I., & Weil, D. N. (2011). Mobile banking: The impact of m-pesa in kenya. NBER Working Paper, (17129). https://doi.org/10.3386/w17129
McClish, D. K. (1989). Analyzing a portion of the ROC curve. Medical Decision Making, 9(3), 190–195. https://doi.org/10.1177/0272989X8900900307
McCrary, J. (2008). Manipulation of the running variable in the regression discontinuity design: A density test. Journal of Econometrics, 142(2), 698–714. https://doi.org/10.1016/j.jeconom.2007.05.005
McCullagh, P., & Nelder, J. A. (1989a). Generalized linear models.
McCullagh, P., & Nelder, J. A. (1989b). Generalized linear models (2nd ed.). Chapman; Hall/CRC. https://doi.org/10.1201/9780203753736
McFadden, D. (1974). Conditional logit analysis of qualitative choice behavior. 105–142.
McKenzie, D., & Paffhausen, A. L. (2019). Small firm death in developing countries. Review of Economics and Statistics, 101(4), 645–657. https://doi.org/10.1162/rest_a_00798
McMahan, B., Moore, E., Ramage, D., Hampson, S., & Agüera y Arcas, B. (2017). Communication-efficient learning of deep networks from decentralized data. Proceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), 1273–1282.
McNeil, A. J., Frey, R., & Embrechts, P. (2015). Quantitative risk management: Concepts, techniques and tools.
Mease, D., & Wyner, A. (2008). Evidence contrary to the statistical view of boosting. Journal of Machine Learning Research, 9, 131–156.
Medina, P. C. (2021). Side effects of nudging: Evidence from a randomized intervention in the credit card market. Review of Financial Studies, 34(5), 2580–2607. https://doi.org/10.1093/rfs/hhaa108
Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on bias and fairness in machine learning. ACM Computing Surveys, 54(6), 1–35. https://doi.org/10.1145/3457607
Meinshausen, N., & Bühlmann, P. (2010). Stability selection. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 72(4), 417–473. https://doi.org/10.1111/j.1467-9868.2010.00740.x
Melnychuk, V., Frauen, D., & Feuerriegel, S. (2022). Causal transformer for estimating counterfactual outcomes. International Conference on Machine Learning (ICML).
Mercer, J. (1909). Functions of positive and negative type, and their connection with the theory of integral equations. Philosophical Transactions of the Royal Society of London. Series A, 209, 415–446. https://doi.org/10.1098/rsta.1909.0016
Merrick, L., & Taly, A. (2020). The explanation game: Explaining machine learning models using Shapley values. 17–38. https://doi.org/10.1007/978-3-030-57321-8\_2
Merton, R. C. (1974). On the pricing of corporate debt: The risk structure of interest rates. The Journal of Finance, 29(2), 449–470. https://doi.org/10.2307/2978814
Mester, L. J. (1997). What’s the point of credit scoring? Federal Reserve Bank of Philadelphia Business Review, 3–16.
Mian, A., & Sufi, A. (2009). The consequences of mortgage credit expansion: Evidence from the U.S. Mortgage default crisis. The Quarterly Journal of Economics, 124(4), 1449–1496. https://doi.org/10.1162/qjec.2009.124.4.1449
Mian, A., Sufi, A., & Verner, E. (2017). Household debt and business cycles worldwide. Quarterly Journal of Economics, 132(4), 1755–1817. https://doi.org/10.1093/qje/qjx017
Miao, W., Liu, L., Tchetgen Tchetgen, E. J., & Geng, Z. (2024). Identification, doubly robust estimation, and semiparametric efficiency theory of nonignorable missing data with a shadow variable. Annals of Statistics, 52(4), 1448–1473. https://doi.org/10.1214/24-AOS2391
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. Proceedings of the International Conference on Learning Representations (ICLR).
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems (NeurIPS).
Miller, A. R., & Tucker, C. E. (2018). Privacy protection, personalized medicine, and genetic testing. Management Science, 64(10), 4648–4668. https://doi.org/10.1287/mnsc.2017.2858
Miller, T. (2019). Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence, 267, 1–38. https://doi.org/10.1016/j.artint.2018.07.007
Ministry of Finance of Vietnam. (2014). Vietnamese accounting standards framework and circular 200/2014/TT-BTC on the corporate accounting regime. Hanoi. https://mof.gov.vn/
Ministry of Finance of Vietnam. (2020). Decision no. 345/QD-BTC approving the scheme on application of financial reporting standards in Vietnam. Ministry of Finance of Vietnam. https://mof.gov.vn/
Mironov, I. (2017). Rényi differential privacy. Proceedings of the IEEE 30th Computer Security Foundations Symposium (CSF), 263–275. https://doi.org/10.1109/CSF.2017.11
Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L., Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model cards for model reporting. Proceedings of the Conference on Fairness, Accountability, and Transparency, 220–229. https://doi.org/10.1145/3287560.3287596
Miu, P., & Ozdemir, B. (2006). Basel requirements of downturn loss given default: Modeling and estimating probability of default and loss given default correlations. Journal of Credit Risk, 2(2), 43–68. https://doi.org/10.21314/JCR.2006.038
Molnar, C. (2022). Interpretable machine learning.
Montiel Olea, J. L., & Pflueger, C. (2013). A robust test for weak instruments. Journal of Business and Economic Statistics, 31(3), 358–369. https://doi.org/10.1080/00401706.2013.806694
Moreno-Torres, J. G., Raeder, T., Alaiz-Rodrı́guez, R., Chawla, N. V., & Herrera, F. (2012). A unifying view on dataset shift in classification. Pattern Recognition, 45(1), 521–530.
Moritz, P., Nishihara, R., Wang, S., Tumanov, A., Liaw, R., Liang, E., Elibol, M., Yang, Z., Paul, W., Jordan, M. I., & Stoica, I. (2018). Ray: A distributed framework for emerging AI applications. USENIX Symposium on Operating Systems Design and Implementation (OSDI), 561–577.
Morse, A. (2015). Peer-to-peer crowdfunding: Information and the potential for disruption in consumer lending. Annual Review of Financial Economics, 7, 463–482. https://doi.org/10.1146/annurev-financial-111914-041939
Moscatelli, M., Parlapiano, F., Narizzano, S., & Viggiano, G. (2020). Corporate default forecasting with machine learning. Expert Systems with Applications, 161, 113567. https://doi.org/10.1016/j.eswa.2020.113567
Moscato, V., Picariello, A., & Sperlì, G. (2021). A benchmark of machine learning approaches for credit score prediction. Expert Systems with Applications, 165, 113986. https://doi.org/10.1016/j.eswa.2020.113986
Mothilal, R. K., Sharma, A., & Tan, C. (2020). Explaining machine learning classifiers through diverse counterfactual explanations. Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 607–617. https://doi.org/10.1145/3351095.3372850
M_Service Joint Stock Company. (2022). MoMo alternative credit scoring pilot with TPBank and consumer finance partners. Company press release, Ho Chi Minh City. https://momo.vn/
Munnell, A. H., Tootell, G. M. B., Browne, L. E., & McEneaney, J. (1996). Mortgage lending in Boston: Interpreting HMDA data. American Economic Review, 86(1), 25–53.
Murfin, J., & Spiegel, M. (2020). Is the risk of sea level rise capitalized in residential real estate? Review of Financial Studies, 33(3), 1217–1255. https://doi.org/10.1093/rfs/hhz134
Murphy, A. H. (1973). A new vector partition of the probability score. Journal of Applied Meteorology, 12(4), 595–600. https://doi.org/10.1175/1520-0450(1973)012<0595:ANVPOT>2.0.CO;2
Murphy, K. M., & Topel, R. H. (1985). Estimation and inference in two-step econometric models. Journal of Business and Economic Statistics, 3(4), 370–379. https://doi.org/10.1080/07350015.1985.10509471
Myers, J. H., & Forgy, E. W. (1963). The development of numerical credit evaluation systems. Journal of the American Statistical Association, 58(303), 799–806. https://doi.org/10.2307/2282727
Nakkiran, P., Kaplun, G., Bansal, Y., Yang, T., Barak, B., & Sutskever, I. (2020). Deep double descent: Where bigger models and more data hurt. International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=B1g5sA4twr
Nakkiran, P., Venkat, P., Kakade, S., & Ma, T. (2021). Optimal regularization can mitigate double descent. International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=7R7fAoUygoa
Narain, B. (1992). Survival analysis and the credit granting decision. Credit Scoring and Credit Control, Oxford University Press, 109–121.
National Assembly of Vietnam. (2006). Law on gender equality, no. 73/2006/QH11. Hanoi. https://vanbanphapluat.co/
National Assembly of Vietnam. (2010). Law on persons with disabilities, no. 51/2010/QH12. Hanoi. https://vanbanphapluat.co/
National Assembly of Vietnam. (2018). Law on cybersecurity, no. 24/2018/QH14. Hanoi. https://vanbanphapluat.co/
National Credit Information Centre of Vietnam. (2023). Annual report of the Credit Information Centre (CIC). State Bank of Vietnam. https://cic.gov.vn/
National Institute of Standards and Technology. (2023). Artificial intelligence risk management framework (AI RMF 1.0) [NIST AI 100-1]. U.S. Department of Commerce. https://doi.org/10.6028/NIST.AI.100-1
National Payment Corporation of Vietnam. (2023). NAPAS annual report on interbank electronic payment switching. Hanoi. https://napas.com.vn/
Navas-Palencia, G. (2020). Optimal binning: Mathematical programming formulation. arXiv Preprint arXiv:2001.08025.
Neal, R. M. (1996). Bayesian learning for neural networks. Lecture Notes in Statistics, 118.
Nelder, J. A., & Wedderburn, R. W. M. (1972). Generalized linear models. Journal of the Royal Statistical Society. Series A (General), 135(3), 370–384. https://doi.org/10.2307/2344614
Nelsen, R. B. (2006). An introduction to copulas (2nd ed.). Springer. https://doi.org/10.1007/0-387-28678-0
Nelson, S. (2024). Private information and price regulation in the US credit card market. Working Paper, Chicago Booth.
Nemenyi, P. B. (1963). Distribution-free multiple comparisons. Princeton University Press.
Network for Greening the Financial System. (2022). NGFS climate scenarios for central banks and supervisors. NGFS.
Netzer, O., Lemaire, A., & Herzenstein, M. (2019). When words sweat: Identifying signals for loan default in the text of loan applications. Journal of Marketing Research, 56(6), 960–980. https://doi.org/10.1177/0022243719852959
Newman, M. E. J. (2003). The structure and function of complex networks. SIAM Review, 45(2), 167–256. https://doi.org/10.1137/S003614450342480
Newman, M. E. J. (2005). A measure of betweenness centrality based on random walks. Social Networks, 27(1), 39–54. https://doi.org/10.1016/j.socnet.2004.11.009
Neyman, J. (1959). Optimal asymptotic tests of composite statistical hypotheses. Probability and Statistics, 213–234.
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on partially consistent observations. Econometrica, 16(1), 1–32. https://doi.org/10.2307/1914288
Ngai, E. W. T., Hu, Y., Wong, Y. H., Chen, Y., & Sun, X. (2011). The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature. Decision Support Systems, 50(3), 559–569. https://doi.org/10.1016/j.dss.2010.08.006
Nguyen, D. Q., & Nguyen, A.-T. (2020). PhoBERT: Pre-trained language models for vietnamese. Findings of the Association for Computational Linguistics: EMNLP 2020, 1037–1042.
Nickell, P., Perraudin, W., & Varotto, S. (2000). Stability of rating transitions. Journal of Banking and Finance, 24(1-2), 203–227. https://doi.org/10.1016/S0378-4266(99)00057-6
Niculescu-Mizil, A., & Caruana, R. (2005). Predicting good probabilities with supervised learning. Proceedings of the 22nd International Conference on Machine Learning (ICML), 625–632. https://doi.org/10.1145/1102351.1102430
Nie, Y., Nguyen, N. H., Sinthong, P., & Kalagnanam, J. (2023). A time series is worth 64 words: Long-term forecasting with transformers. Proceedings of the International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=Jbdc0vTOcol
Nocedal, J., & Wright, S. J. (2006). Numerical optimization (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-40065-5
Nogueira, R., & Cho, K. (2019). Passage re-ranking with BERT. arXiv:1901.04085.
Nori, H., Jenkins, S., Koch, P., & Caruana, R. (2019). InterpretML: A unified framework for machine learning interpretability. https://arxiv.org/abs/1909.09223
Oaxaca, R. (1973). Male-female wage differentials in urban labor markets. International Economic Review, 14(3), 693–709. https://doi.org/10.2307/2525981
Office of the Comptroller of the Currency. (2011a). Supervisory guidance on model risk management (OCC bulletin 2011-12). https://www.occ.treas.gov/news-issuances/bulletins/2011/bulletin-2011-12.html
Office of the Comptroller of the Currency. (2011b). Supervisory guidance on model risk management (OCC bulletin 2011-12). U.S. Department of the Treasury. https://www.occ.treas.gov/news-issuances/bulletins/2011/bulletin-2011-12.html
Office of the Comptroller of the Currency. (2013). OCC bulletin 2013-29: Third-party relationships. OCC Risk Management Guidance. https://www.occ.gov/news-issuances/bulletins/2013/bulletin-2013-29.html
Office of the Comptroller of the Currency. (2015). Comptroller’s handbook: Credit card lending. Office of the Comptroller of the Currency. https://www.occ.gov/publications-and-resources/publications/comptrollers-handbook/files/credit-card-lending/index-credit-card-lending.html
Office of the Comptroller of the Currency. (2021). Model risk management: Comptroller’s handbook. OCC. https://www.occ.gov/publications-and-resources/publications/comptrollers-handbook/files/model-risk-management/index-model-risk-management.html
Ohlson, J. A. (1980). Financial ratios and the probabilistic prediction of bankruptcy. Journal of Accounting Research, 18(1), 109–131. https://doi.org/10.2307/2490395
Ólafsson, A., & Pagel, M. (2018). The liquid hand-to-mouth: Evidence from personal finance management software. The Review of Financial Studies, 31(11), 4398–4446. https://doi.org/10.1093/rfs/hhy055
Olegario, R. (2006). A culture of credit: Embedding trust and transparency in american business.
Olson, D. L., Delen, D., & Meng, Y. (2012). Comparative analysis of data mining methods for bankruptcy prediction. Decision Support Systems, 52(2), 464–473. https://doi.org/10.1016/j.dss.2011.10.007
Onnela, J.-P., Saramaki, J., Hyvonen, J., Szabo, G., Lazer, D., Kaski, K., Kertesz, J., & Barabasi, A.-L. (2007). Structure and tie strengths in mobile communication networks. Proceedings of the National Academy of Sciences, 104(18), 7332–7336. https://doi.org/10.1073/pnas.0610245104
Oreshkin, B. N., Carpov, D., Chapados, N., & Bengio, Y. (2020). N-BEATS: Neural basis expansion analysis for interpretable time series forecasting. Proceedings of the International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=r1ecqn4YwB
Orgler, Y. E. (1970). A credit scoring model for commercial loans. Journal of Money, Credit and Banking, 2(4), 435–445. https://doi.org/10.2307/1991095
Orús, R., Mugel, S., & Lizaso, E. (2019). Quantum computing for finance: Overview and prospects. Reviews in Physics, 4, 100028. https://doi.org/10.1016/j.revip.2019.100028
Otoritas Jasa Keuangan. (2016). POJK 11/POJK.03/2016 on minimum capital adequacy requirement for commercial banks (KPMM). Indonesian Financial Services Authority. https://www.ojk.go.id/
Otoritas Jasa Keuangan. (2022). Regulation number 10/POJK.05/2022 on information technology-based lending services. Indonesian Financial Services Authority. https://www.ojk.go.id/
Otoritas Jasa Keuangan. (2023). POJK 22/2023 on consumer and community protection in the financial services sector. Indonesian Financial Services Authority. https://www.ojk.go.id/
Owen, A. B. (2014). Sobol’ indices and Shapley value. SIAM/ASA Journal on Uncertainty Quantification, 2(1), 245–251. https://doi.org/10.1137/130936233
Pagan, A., & Vella, F. (1989). Diagnostic tests for models based on individual data: A survey. Journal of Applied Econometrics, 4(S1), S29–S59. https://doi.org/10.1002/jae.3950040504
Page, E. S. (1954). Continuous inspection schemes. Biometrika, 41(1/2), 100–115. https://doi.org/10.2307/2333009
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999a). The PageRank citation ranking: Bringing order to the web. Stanford InfoLab Technical Report.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999b). The PageRank citation ranking: Bringing order to the Web. Stanford InfoLab Technical Report.
Paleyes, A., Urma, R.-G., & Lawrence, N. D. (2022). Challenges in deploying machine learning: A survey of case studies. ACM Computing Surveys, 55(6), 114:1–114:29. https://doi.org/10.1145/3533378
Papadopoulos, H., Proedrou, K., Vovk, V., & Gammerman, A. (2002). Inductive confidence machines for regression. European Conference on Machine Learning (ECML), 345–356. https://doi.org/10.1007/3-540-36755-1\_29
Paravisini, D., & Schoar, A. (2015). The incentive effect of scores: Randomized evidence from credit committees (NBER Working Paper 19303). National Bureau of Economic Research. https://doi.org/10.3386/w19303
Parikh, N., & Boyd, S. (2014). Proximal algorithms. Foundations and Trends in Optimization, 1(3), 127–239. https://doi.org/10.1561/2400000003
Park, M. Y., & Hastie, T. (2007). L1-regularization path algorithm for generalized linear models. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 69(4), 659–677. https://doi.org/10.1111/j.1467-9868.2007.00607.x
Parlour, C. A., Rajan, U., & Walden, J. (2022). Payment system externalities. The Journal of Finance, 77(2), 1019–1053. https://doi.org/10.1111/jofi.13110
Parlour, C. A., Rajan, U., & Zhu, H. (2022). When FinTech competes for payment flows. Review of Financial Studies, 35(11), 4985–5024. https://doi.org/10.1093/rfs/hhac022
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., et al. (2019). PyTorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems, 32.
Pattabhiramaiah, A., Sriram, S., & Sridhar, S. (2018). Rising prices under declining preferences: The case of the U.S. Print newspaper industry. Marketing Science, 37(1), 97–122. https://doi.org/10.1287/mksc.2017.1051
Pearl, J. (1995). Causal diagrams for empirical research. Biometrika, 82(4), 669–688. https://doi.org/10.1093/biomet/82.4.669
Pearl, J. (2009). Causality: Models, reasoning, and inference (2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511803161
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12, 2825–2830.
Pennington, J., Socher, R., & Manning, C. D. (2014). GloVe: Global vectors for word representation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). https://doi.org/10.3115/v1/D14-1162
Perdomo, J. C., Zrnic, T., Mendler-Dünner, C., & Hardt, M. (2020). Performative prediction. Proceedings of the 37th International Conference on Machine Learning (ICML).
Perozzi, B., Al-Rfou, R., & Skiena, S. (2014). DeepWalk: Online learning of social representations. Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 701–710. https://doi.org/10.1145/2623330.2623732
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K., & Zettlemoyer, L. (2018). Deep contextualized word representations. Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). https://doi.org/10.18653/v1/N18-1202
Petersen, M. A., & Rajan, R. G. (1994). The benefits of lending relationships: Evidence from small business data. Journal of Finance, 49(1), 3–37. https://doi.org/10.1111/j.1540-6261.1994.tb04418.x
Petersen, M. A., & Rajan, R. G. (2002). Does distance still matter? The information revolution in small business lending. Journal of Finance, 57(6), 2533–2570. https://doi.org/10.1111/1540-6261.00505
Petsiuk, V., Das, A., & Saenko, K. (2018). RISE: Randomized input sampling for explanation of black-box models. British Machine Vision Conference (BMVC).
Phan, L., Tran, H., Nguyen, H., & Trinh, T. H. (2022). ViT5: Pretrained text-to-text transformer for vietnamese language generation. Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 136–142.
Philippon, T. (2016). The FinTech opportunity (NBER Working Paper 22476). National Bureau of Economic Research. https://doi.org/10.3386/w22476
Philippon, T. (2020). On fintech and financial inclusion. NBER Working Paper, (26330).
Pineau, J., Vincent-Lamarre, P., Sinha, K., Larivière, V., Beygelzimer, A., d’Alché-Buc, F., Fox, E., & Larochelle, H. (2021). Improving reproducibility in machine learning research: A report from the NeurIPS 2019 reproducibility program. Journal of Machine Learning Research, 22, 1–20. https://jmlr.org/papers/v22/20-303.html
Piskorski, T., Seru, A., & Witkin, J. (2015). Asset quality misrepresentation by financial intermediaries: Evidence from the RMBS market. The Journal of Finance, 70(6), 2635–2678. https://doi.org/10.1111/jofi.12271
Platt, J. C. (1998). Sequential minimal optimization: A fast algorithm for training support vector machines (Technical Report MSR-TR-98-14). Microsoft Research.
Platt, J. C. (1999). Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. 61–74.
Pleiss, G., Raghavan, M., Wu, F., Kleinberg, J., & Weinberger, K. Q. (2017). On fairness and calibration. Advances in Neural Information Processing Systems 30 (NIPS 2017).
Plosser, M. C., & Santos, J. A. C. (2018). Banks’ incentives and inconsistent risk models. The Review of Financial Studies, 31(6), 2080–2112. https://doi.org/10.1093/rfs/hhy028
Pluto, K., & Tasche, D. (2005a). Thinking positively. Risk, 18(8), 72–78.
Pluto, K., & Tasche, D. (2005b). Thinking positively. Risk Magazine.
Polyzotis, N., Roy, S., Whang, S. E., & Zinkevich, M. (2018). Data lifecycle challenges in production machine learning: A survey. ACM SIGMOD Record, 47, 17–28. https://doi.org/10.1145/3299887.3299891
Pope, D. G., & Sydnor, J. R. (2011). What’s in a picture? Evidence of discrimination from Prosper.com. Journal of Human Resources, 46(1), 53–92. https://doi.org/10.3368/jhr.46.1.53
Popov, S., Morozov, S., & Babenko, A. (2020). Neural oblivious decision ensembles for deep learning on tabular data. International Conference on Learning Representations (ICLR).
Potharst, R., & Feelders, A. J. (2002). Classification trees for problems with monotonicity constraints. ACM SIGKDD Explorations Newsletter, 4(1), 1–10. https://doi.org/10.1145/568574.568577
Poyiadzi, R., Sokol, K., Santos-Rodriguez, R., De Bie, T., & Flach, P. (2020). FACE: Feasible and actionable counterfactual explanations. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 344–350. https://doi.org/10.1145/3375627.3375850
Prechelt, L. (1998). Early stopping—but when? Neural Networks: Tricks of the Trade, Lecture Notes in Computer Science, 1524, 55–69. https://doi.org/10.1007/3-540-49430-8_3
Pregibon, D. (1980). Goodness of link tests for generalized linear models. Journal of the Royal Statistical Society Series C: Applied Statistics, 29(1), 15–24.
Prentice, R. L., & Gloeckler, L. A. (1978). Regression analysis of grouped survival data with application to breast cancer data. Biometrics, 34(1), 57–67. https://doi.org/10.2307/2529588
Prentice, R. L., Kalbfleisch, J. D., Peterson, A. V., Flournoy, N., Farewell, V. T., & Breslow, N. E. (1978). The analysis of failure times in the presence of competing risks. Biometrics, 34(4), 541–554. https://doi.org/10.2307/2530374
Preskill, J. (2018). Quantum computing in the NISQ era and beyond. Quantum, 2, 79. https://doi.org/10.22331/q-2018-08-06-79
Press, S. J., & Wilson, S. (1978). Choosing between logistic regression and discriminant analysis. Journal of the American Statistical Association, 73(364), 699–705. https://doi.org/10.2307/2286261
Prieger, J. E. (2003). A flexible parametric selection model for non-normal data with application to health care usage. Journal of Applied Econometrics, 18(3), 367–392. https://doi.org/10.1002/jae.696
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: Unbiased boosting with categorical features. Advances in Neural Information Processing Systems 31 (NeurIPS 2018).
Provost, F., & Fawcett, T. (2001). Robust classification for imprecise environments. Machine Learning, 42(3), 203–231. https://doi.org/10.1023/A:1007601015854
Prudential Regulation Authority. (2018). Model risk management principles for stress testing (SS3/18). Bank of England. https://www.bankofengland.co.uk/prudential-regulation/publication/2018/model-risk-management-principles-for-stress-testing
Prudential Regulation Authority. (2023). Supervisory statement SS1/23: Model risk management principles for banks. Bank of England. https://www.bankofengland.co.uk/prudential-regulation/publication/2023/may/model-risk-management-principles-for-banks-ss
Puhani, P. A. (2000). The heckman correction for sample selection and its critique. Journal of Economic Surveys, 14(1), 53–68. https://doi.org/10.1111/1467-6419.00104
Purda, L., & Skillicorn, D. (2015). Accounting variables, deception, and a bag of words: Assessing the tools of fraud detection. Contemporary Accounting Research, 32(3), 1193–1223. https://doi.org/10.1111/1911-3846.12089
Qi, M., & Zhao, X. (2011). Comparison of modeling methods for loss given default. Journal of Banking and Finance, 35(11), 2842–2855. https://doi.org/10.1016/j.jbankfin.2011.03.011
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81–106. https://doi.org/10.1007/BF00116251
Quinlan, J. R. (1993). C4.5: Programs for machine learning. Morgan Kaufmann.
Quiñonero-Candela, J., Sugiyama, M., Schwaighofer, A., & Lawrence, N. D. (2009). Dataset shift in machine learning. MIT Press.
Rabanser, S., Günnemann, S., & Lipton, Z. C. (2019). Failing loudly: An empirical study of methods for detecting dataset shift. Advances in Neural Information Processing Systems (NeurIPS), 32, 1394–1406.
Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), 257–286. https://doi.org/10.1109/5.18626
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., & Sutskever, I. (2021). Learning transferable visual models from natural language supervision. Proceedings of the 38th International Conference on Machine Learning (ICML).
Rafieian, O., & Yoganarasimhan, H. (2021). Targeting and privacy in mobile advertising. Marketing Science, 40(2), 193–218. https://doi.org/10.1287/mksc.2020.1235
Raftery, A. E. (1995). Bayesian model selection in social research. Sociological Methodology, 25, 111–163. https://doi.org/10.2307/271063
Rahimi, A., & Recht, B. (2007). Random features for large-scale kernel machines. Advances in Neural Information Processing Systems 20 (NIPS 2007).
Rajan, R. G. (1992). Insiders and outsiders: The choice between informed and arm’s-length debt. Journal of Finance, 47(4), 1367–1400. https://doi.org/10.1111/j.1540-6261.1992.tb04662.x
Rajan, U., Seru, A., & Vig, V. (2010). Statistical default models and incentives. American Economic Review Papers and Proceedings, 100(2), 506–510. https://doi.org/10.1257/aer.100.2.506
Rajan, U., Seru, A., & Vig, V. (2015). The failure of models that predict failure: Distance, incentives, and defaults. Journal of Financial Economics, 115(2), 237–260. https://doi.org/10.1016/j.jfineco.2014.09.012
Rambachan, A., Kleinberg, J., Ludwig, J., & Mullainathan, S. (2020). An economic perspective on algorithmic fairness. AEA Papers and Proceedings, 110, 91–95. https://doi.org/10.1257/pandp.20201036
Rambachan, A., & Roth, J. (2023). A more credible approach to parallel trends. Review of Economic Studies, 90(5), 2555–2591. https://doi.org/10.1093/restud/rdad018
Rao, C. R. (1948). The utilization of multiple measurements in problems of biological classification. Journal of the Royal Statistical Society. Series B (Methodological), 10(2), 159–203. https://doi.org/10.1111/j.2517-6161.1948.tb00008.x
Rasul, K., Ashok, A., Williams, A. R., Ghonia, H., Bhagwatkar, R., Khorasani, A., Bayazi, M. J. D., Adamopoulos, G., Riachi, R., Hassen, N., Biloš, M., Garg, S., Schneider, A., Chapados, N., Drouin, A., Zantedeschi, V., Nevmyvaka, Y., & Rish, I. (2024). Lag-Llama: Towards foundation models for probabilistic time series forecasting. arXiv:2310.08278. https://arxiv.org/abs/2310.08278
Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence embeddings using siamese BERT-networks. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3982–3992. https://doi.org/10.18653/v1/D19-1410
Republic of Indonesia. (2022). Law no. 27/2022 on personal data protection (UU PDP). State Gazette of the Republic of Indonesia. https://www.bphn.go.id/
Republic of Kenya. (2019). Data protection act, 2019. Kenya Gazette Supplement No. 181, Act No. 24 of 2019. https://www.odpc.go.ke/
Republic of South Africa. (2013). Protection of personal information act (POPIA). Government Gazette. https://popia.co.za/
Reserve Bank of India. (2016). Master direction – non-banking financial company – account aggregator (Reserve Bank) directions. Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2022). Guidelines on digital lending. Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2023a). Guidelines on default loss guarantee (FLDG) in digital lending. Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2023b). Master circular: Basel III capital regulations. Reserve Bank of India. https://www.rbi.org.in/Scripts/BS_ViewMasCirculardetails.aspx
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why Should I Trust You?”: Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135–1144. https://doi.org/10.1145/2939672.2939778
Ribeiro, M. T., Singh, S., & Guestrin, C. (2018). Anchors: High-precision model-agnostic explanations. Proceedings of the AAAI Conference on Artificial Intelligence, 32. https://doi.org/10.1609/aaai.v32i1.11491
Robbins, H., & Monro, S. (1951). A stochastic approximation method. The Annals of Mathematical Statistics, 22(3), 400–407. https://doi.org/10.1214/aoms/1177729586
Robertson, S., & Zaragoza, H. (2009). The probabilistic relevance framework: BM25 and beyond. Foundations and Trends in Information Retrieval, 3(4), 333–389. https://doi.org/10.1561/1500000019
Robins, J. M., & Rotnitzky, A. (1992). Recovery of information and adjustment for dependent censoring using surrogate markers. In N. P. Jewell, K. Dietz, & V. T. Farewell (Eds.), AIDS epidemiology: Methodological issues (pp. 297–331). Birkhäuser. https://doi.org/10.1007/978-1-4757-1229-2_14
Robins, J. M., Rotnitzky, A., & Scharfstein, D. O. (2000). Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In Statistical models in epidemiology, the environment, and clinical trials (Vol. 116, pp. 1–94). Springer. https://doi.org/10.1007/978-1-4612-1284-3_1
Robins, J. M., Rotnitzky, A., & Zhao, L. P. (1994). Estimation of regression coefficients when some regressors are not always observed. Journal of the American Statistical Association, 89(427), 846–866. https://doi.org/10.1080/01621459.1994.10476818
Robinson, P. M. (1988). Root-N-consistent semiparametric regression. Econometrica, 56(4), 931–954. https://doi.org/10.2307/1912705
Rogers, A., Kovaleva, O., & Rumshisky, A. (2020). A primer in BERTology: What we know about how BERT works. Transactions of the Association for Computational Linguistics, 8, 842–866. https://doi.org/10.1162/tacl_a_00349
Romano, Y., Patterson, E., & Candès, E. J. (2019). Conformalized quantile regression. Advances in Neural Information Processing Systems 32 (NeurIPS 2019).
Romano, Y., Sesia, M., & Candès, E. J. (2020). Classification with valid and adaptive coverage. Advances in Neural Information Processing Systems, 33.
Rona-Tas, A. (2020). Predicting the future: Art and algorithms. Socio-Economic Review, 18(3), 893–911. https://doi.org/10.1093/ser/mwaa040
Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1), 41–55. https://doi.org/10.1093/biomet/70.1.41
Ross, S. L., Turner, M. A., Godfrey, E., & Smith, R. R. (2008). Mortgage lending in chicago and los angeles: A paired testing study of the pre-application process. Journal of Urban Economics, 63(3), 902–919. https://doi.org/10.1016/j.jue.2007.07.008
Roth, J., Sant’Anna, P. H. C., Bilinski, A., & Poe, J. (2023). What’s trending in difference-in-differences? A synthesis of the recent econometrics literature. Journal of Econometrics, 235(2), 2218–2244. https://doi.org/10.1016/j.jeconom.2023.03.008
Rothschild, M., & Stiglitz, J. E. (1976). Equilibrium in competitive insurance markets: An essay on the economics of imperfect information. The Quarterly Journal of Economics, 90(4), 629–649. https://doi.org/10.2307/1885326
Roure, C. de, Pelizzon, L., & Thakor, A. V. (2022). P2P lenders versus banks: Cream skimming or bottom fishing? The Review of Corporate Finance Studies, 11(2), 213–262. https://doi.org/10.1093/rcfs/cfab026
Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5), 688–701. https://doi.org/10.1037/h0037350
Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3), 581–592. https://doi.org/10.1093/biomet/63.3.581
Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206–215. https://doi.org/10.1038/s42256-019-0048-x
Rudin, C., Chen, C., Chen, Z., Huang, H., Semenova, L., & Zhong, C. (2022). Interpretable machine learning: Fundamental principles and 10 grand challenges. Statistics Surveys, 16, 1–85. https://doi.org/10.1214/21-SS133
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323(6088), 533–536. https://doi.org/10.1038/323533a0
Sadhwani, A., Giesecke, K., & Sirignano, J. (2021). Deep learning for mortgage risk. Journal of Financial Econometrics, 19(2), 313–368. https://doi.org/10.1093/jjfinec/nbaa025
Saito, T., & Rehmsmeier, M. (2015). The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLOS ONE, 10, e0118432. https://doi.org/10.1371/journal.pone.0118432
Sakurada, M., & Yairi, T. (2014). Anomaly detection using autoencoders with nonlinear dimensionality reduction. Proceedings of the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data Analysis, 4–11. https://doi.org/10.1145/2689746.2689747
Salinas, D., Flunkert, V., Gasthaus, J., & Januschowski, T. (2020). DeepAR: Probabilistic forecasting with autoregressive recurrent networks. International Journal of Forecasting, 36(3), 1181–1191. https://doi.org/10.1016/j.ijforecast.2019.07.001
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). DistilBERT, a distilled version of BERT: Smaller, faster, cheaper and lighter. NeurIPS EMC2 Workshop.
Santurkar, S., Tsipras, D., Ilyas, A., & Madry, A. (2018). How does batch normalization help optimization? Advances in Neural Information Processing Systems (NeurIPS).
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M., & Monfardini, G. (2009). The graph neural network model. IEEE Transactions on Neural Networks, 20(1), 61–80. https://doi.org/10.1109/TNN.2008.2005605
Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5(2), 197–227. https://doi.org/10.1007/BF00116037
Scharfstein, D. O., Rotnitzky, A., & Robins, J. M. (1999). Adjusting for nonignorable drop-out using semiparametric nonresponse models. Journal of the American Statistical Association, 94(448), 1096–1120. https://doi.org/10.1080/01621459.1999.10473862
Schelter, S., Biessmann, F., Januschowski, T., Salinas, D., Seufert, S., & Szarvas, G. (2018). On challenges in machine learning model management. IEEE Data Engineering Bulletin, 41(4), 5–15.
Schmittlein, D. C., Morrison, D. G., & Colombo, R. (1987). Counting your customers: Who are they and what will they do next? Management Science, 33(1), 1–24. https://doi.org/10.1287/mnsc.33.1.1
Schölkopf, B., Platt, J. C., Shawe-Taylor, J., Smola, A. J., & Williamson, R. C. (2001). Estimating the support of a high-dimensional distribution. Neural Computation, 13(7), 1443–1471. https://doi.org/10.1162/089976601750264965
Schölkopf, B., Smola, A., & Müller, K.-R. (1998). Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation, 10(5), 1299–1319. https://doi.org/10.1162/089976698300017467
Schuermann, T., & Jafry, Y. (2004). Measurement, estimation, and comparison of credit migration matrices. Journal of Banking & Finance, 28(11), 2603–2639. https://doi.org/10.1016/j.jbankfin.2004.06.004
Schularick, M., & Taylor, A. M. (2012). Credit booms gone bust: Monetary policy, leverage cycles, and financial crises, 1870–2008. American Economic Review, 102(2), 1029–1061. https://doi.org/10.1257/aer.102.2.1029
Schwartz, E. S., & Torous, W. N. (1989). Prepayment and the valuation of mortgage-backed securities. The Journal of Finance, 44(2), 375–392. https://doi.org/10.1111/j.1540-6261.1989.tb05062.x
Schweidel, D. A., Fader, P. S., & Bradlow, E. T. (2008). Understanding service retention within and across cohorts using limited information. Journal of Marketing, 72(1), 82–94. https://doi.org/10.1509/jmkg.72.1.082
Scornet, E., Biau, G., & Vert, J.-P. (2015). Consistency of random forests. The Annals of Statistics, 43(4), 1716–1741. https://doi.org/10.1214/15-AOS1321
Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., Chaudhary, V., Young, M., Crespo, J.-F., & Dennison, D. (2015). Hidden technical debt in machine learning systems. Advances in Neural Information Processing Systems (NeurIPS), 28, 2503–2511.
Seetharaman, P. B. (2004). Modeling multiple sources of state dependence in random utility models: A distributed lag approach. Marketing Science, 23(2), 263–271. https://doi.org/10.1287/mksc.1030.0024
Seetharaman, P. B., & Chintagunta, P. K. (2003). The proportional hazard model for purchase timing: A comparison of alternative specifications. Journal of Business and Economic Statistics, 21(3), 368–382. https://doi.org/10.1198/073500103288619025
Seiffert, C., Khoshgoftaar, T. M., Van Hulse, J., & Napolitano, A. (2010). RUSBoost: A hybrid approach to alleviating class imbalance. IEEE Transactions on Systems, Man, and Cybernetics, Part A, 40(1), 185–197. https://doi.org/10.1109/TSMCA.2009.2029559
Selbst, A. D., & Powles, J. (2017). Meaningful information and the right to explanation. International Data Privacy Law, 7(4), 233–242. https://doi.org/10.1093/idpl/ipx022
Self, S. G., & Liang, K.-Y. (1987). Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions. Journal of the American Statistical Association, 82(398), 605–610. https://doi.org/10.1080/01621459.1987.10478472
Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). Grad-CAM: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 618–626. https://doi.org/10.1109/ICCV.2017.74
Sezer, O. B., Gudelek, M. U., & Ozbayoglu, A. M. (2020). Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. Applied Soft Computing, 90, 106181. https://doi.org/10.1016/j.asoc.2020.106181
Shafer, G., & Vovk, V. (2008). A tutorial on conformal prediction. Journal of Machine Learning Research, 9, 371–421.
Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 379–423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
Shapley, L. S. (1953). A value for n-person games. Contributions to the Theory of Games, 2(28), 307–317.
Shi, F., Chen, X., Misra, K., Scales, N., Dohan, D., Chi, E. H., Schärli, N., & Zhou, D. (2023). Large language models can be easily distracted by irrelevant context. Proceedings of the 40th International Conference on Machine Learning (ICML), 31210–31227.
Shokri, R., Stronati, M., Song, C., & Shmatikov, V. (2017). Membership inference attacks against machine learning models. Proceedings of the 2017 IEEE Symposium on Security and Privacy (SP), 3–18. https://doi.org/10.1109/SP.2017.41
Shrikumar, A., Greenside, P., & Kundaje, A. (2017). Learning important features through propagating activation differences. Proceedings of the 34th International Conference on Machine Learning, 3145–3153.
Shumway, T. (2001). Forecasting bankruptcy more accurately: A simple hazard model. The Journal of Business, 74(1), 101–124. https://doi.org/10.1086/209665
Shwartz-Ziv, R., & Armon, A. (2022a). Tabular data: Deep learning is not all you need. Information Fusion, 81, 84–90. https://doi.org/10.1016/j.inffus.2021.11.011
Shwartz-Ziv, R., & Armon, A. (2022b). Tabular data: Deep learning is not all you need. Information Fusion, 81, 84–90. https://doi.org/10.1016/j.inffus.2021.11.011
Siddiqi, N. (2017a). Intelligent credit scoring: Building and implementing better credit risk scorecards.
Siddiqi, N. (2017b). Intelligent credit scoring: Building and implementing better credit risk scorecards. John Wiley and Sons, 2nd Edition.
Sill, J. (1998). Monotonic networks. Advances in Neural Information Processing Systems (NeurIPS), 10.
Simester, D., Timoshenko, A., & Zoumpoulis, S. I. (2020). Targeting prospective customers: Robustness of machine-learning methods to typical data challenges. Management Science, 66(6), 2495–2522. https://doi.org/10.1287/mnsc.2019.3308
Sinha, R. K., & Chandrashekaran, M. (1992). A split hazard model for analyzing the diffusion of innovations. Journal of Marketing Research, 29(1), 116–127. https://doi.org/10.1177/002224379202900110
Skiba, P. M., & Tobacman, J. (2019). Do payday loans cause bankruptcy? Journal of Law and Economics, 62(3), 485–519. https://doi.org/10.1086/706201
Sklar, A. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris, 8, 229–231.
Skoglund, J., & Chen, W. (2015). Financial risk management: Applications in market, credit, asset and liability management, and firmwide risk. Wiley.
Slack, D., Hilgard, S., Jia, E., Singh, S., & Lakkaraju, H. (2020). Fooling LIME and SHAP: Adversarial attacks on post hoc explanation methods. Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 180–186. https://doi.org/10.1145/3375627.3375830
Smilkov, D., Thorat, N., Kim, B., Viegas, F., & Wattenberg, M. (2017). SmoothGrad: Removing noise by adding noise. arXiv Preprint arXiv:1706.03825.
Smirnov, N. (1948). Table for estimating the goodness of fit of empirical distributions. The Annals of Mathematical Statistics, 19(2), 279–281. https://doi.org/10.1214/aoms/1177730256
Smith, M. D. (2003). Modelling sample selection using Archimedean copulas. Econometrics Journal, 6(1), 99–123. https://doi.org/10.1111/1368-423X.00101
Smith, R. J. (1989). On the use of distributional mis-specification checks in limited dependent variable models. The Economic Journal, 99(395), 178–192. https://doi.org/10.2307/2234212
Smola, A. J., & Schölkopf, B. (2004). A tutorial on support vector regression. Statistics and Computing, 14(3), 199–222. https://doi.org/10.1023/B:STCO.0000035301.49549.88
So, M. C., & Thomas, L. C. (2011). Modelling the profitability of credit cards by Markov decision processes. European Journal of Operational Research, 212(1), 123–130. https://doi.org/10.1016/j.ejor.2011.01.023
Sonnenburg, S., Braun, M. L., Ong, C. S., Bengio, S., Bottou, L., Holmes, G., LeCun, Y., Müller, K.-R., Pereira, F., Rasmussen, C. E., Rätsch, G., Schölkopf, B., Smola, A., Vincent, P., Weston, J., & Williamson, R. (2007). The need for open source software in machine learning. Journal of Machine Learning Research, 8, 2443–2466. https://www.jmlr.org/papers/v8/sonnenburg07a.html
Spärck Jones, K. (1972). A statistical interpretation of term specificity and its application in retrieval. Journal of Documentation, 28(1), 11–21. https://doi.org/10.1108/eb026526
Spence, M. (1973). Job market signaling. The Quarterly Journal of Economics, 87(3), 355–374. https://doi.org/10.2307/1882010
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15(1), 1929–1958.
Stadler, T., Oprisanu, B., & Troncoso, C. (2022). Synthetic data - anonymisation groundhog day. Proceedings of the 31st USENIX Security Symposium (USENIX Security), 1451–1468.
Staiger, D., & Stock, J. H. (1997). Instrumental variables regression with weak instruments. Econometrica, 65(3), 557–586. https://doi.org/10.2307/2171753
Stango, V., & Zinman, J. (2014). Limited and varying consumer attention: Evidence from shocks to the salience of bank overdraft fees. Review of Financial Studies, 27(4), 990–1030. https://doi.org/10.1093/rfs/hhu008
Stanton, R. (1995). Rational prepayment and the valuation of mortgage-backed securities. The Review of Financial Studies, 8(3), 677–708. https://doi.org/10.1093/rfs/8.3.677
State Bank of Vietnam. (2016a). Circular 39/2016/TT-NHNN on lending activities of credit institutions and foreign bank branches to customers. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2016b). Circular 41/2016/TT-NHNN on capital adequacy ratios for banks and foreign bank branches. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2018). Circular no. 13/2018/TT-NHNN on the system of internal control of commercial banks and foreign bank branches. State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2020a). Circular 16/2020/TT-NHNN on electronic know-your-customer for payment account opening. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2020b). Circular no. 16/2020/TT-NHNN amending circular 23/2014 on opening and use of payment accounts, including electronic know-your-customer (eKYC). State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2021a). Circular no. 11/2021/TT-NHNN on classification of assets, levels and method of setting up of risk provisions, and use of provisions against credit risks. State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2021b). Circular no. 11/2021/TT-NHNN on loan classification and provisioning for credit institutions. State Bank of Vietnam. https://english.luatvietnam.vn/circular-no-11-2021-tt-nhnn-dated-july-30-2021-of-the-state-bank-of-vietnam-providing-the-classification-of-assets-risk-provisioning-levels-and-met-206806-doc1.html
State Bank of Vietnam. (2021c). Decision no. 810/QD-NHNN approving the plan for digital transformation of the banking sector to 2025, orientation to 2030. State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2022). Annual report 2022. State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2023a). Circular 22/2023/TT-NHNN amending circular 41/2016/TT-NHNN on capital adequacy ratios for banks and foreign bank branches. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2023b). Decision 2345/QD-NHNN on solutions for safety and security in online payments and bank card transactions. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2024). Regulatory sandbox for fintech activities in the banking sector: Decree 94/2025/ND-CP. State Bank of Vietnam. https://www.sbv.gov.vn/
State of California. (2018). California consumer privacy act of 2018. Cal. Civ. Code §§1798.100–1798.199.
Stein, J. C. (2002). Information production and capital allocation: Decentralized versus hierarchical firms. Journal of Finance, 57(5), 1891–1921. https://doi.org/10.1111/0022-1082.00483
Steinwart, I., & Christmann, A. (2008). Support vector machines. Information Science and Statistics. https://doi.org/10.1007/978-0-387-77242-4
Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118. https://doi.org/10.1093/bioinformatics/btr597
Stepanova, M., & Thomas, L. C. (2001). PHAB scores: Proportional hazards analysis behavioural scores. Journal of the Operational Research Society, 52(9), 1007–1016. https://doi.org/10.1057/palgrave.jors.2601189
Stepanova, M., & Thomas, L. C. (2002). Survival analysis methods for personal loan data. Operations Research, 50(2), 277–289. https://doi.org/10.1287/opre.50.2.277.426
Stevenson, M., Mues, C., & Bravo, C. (2021). The value of text for small business default prediction: A deep learning approach. European Journal of Operational Research, 295(2), 758–771. https://doi.org/10.1016/j.ejor.2021.03.008
Stiglitz, J. E., & Weiss, A. (1981). Credit rationing in markets with imperfect information. The American Economic Review, 71(3), 393–410.
Stock, J. H., Wright, J. H., & Yogo, M. (2002). A survey of weak instruments and weak identification in generalized method of moments. Journal of Business and Economic Statistics, 20(4), 518–529. https://doi.org/10.1198/073500102288618658
Stock, J. H., & Yogo, M. (2005). Testing for weak instruments in linear IV regression. Identification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg, 80–108.
Stodden, V., McNutt, M., Bailey, D. H., Deelman, E., Gil, Y., Hanson, B., Heroux, M. A., Ioannidis, J. P. A., & Taufer, M. (2016). Enhancing reproducibility for computational methods. Science, 354(6317), 1240–1241. https://doi.org/10.1126/science.aah6168
Stone, M. (1974). Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society. Series B (Methodological), 36(2), 111–147.
Strahan, P. E. (1999). Borrower risk and the price and nonprice terms of bank loans. Federal Reserve Bank of New York Staff Report, (90). https://www.newyorkfed.org/research/staff_reports/sr90.html
Štrumbelj, E., & Kononenko, I. (2014). Explaining prediction models and individual predictions with feature contributions. Knowledge and Information Systems, 41(3), 647–665. https://doi.org/10.1007/s10115-013-0679-x
Stulz, R. M. (2019). FinTech, BigTech, and the future of banks. Journal of Applied Corporate Finance, 31(4), 86–97. https://doi.org/10.1111/jacf.12378
Sugiyama, M., Krauledat, M., & Müller, K.-R. (2007). Covariate shift adaptation by importance weighted cross validation. Journal of Machine Learning Research, 8, 985–1005.
Sugiyama, M., Suzuki, T., Nakajima, S., Kashima, H., Bünau, P. von, & Kawanabe, M. (2008). Direct importance estimation for covariate shift adaptation. Annals of the Institute of Statistical Mathematics, 60(4), 699–746. https://doi.org/10.1007/s10463-008-0197-x
Sun, B., Liu, L., Miao, W., Wirth, K., Robins, J., & Tchetgen Tchetgen, E. J. (2018). Semiparametric estimation with data missing not at random using an instrumental variable. Statistica Sinica, 28(4), 1965–1983. https://doi.org/10.5705/ss.202016.0324
Sun, L., & Abraham, S. (2021). Estimating dynamic treatment effects in event studies with heterogeneous treatment effects. Journal of Econometrics, 225(2), 175–199. https://doi.org/10.1016/j.jeconom.2020.09.006
Sun, X., & Xu, W. (2014). Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Processing Letters, 21(11), 1389–1393. https://doi.org/10.1109/LSP.2014.2337313
Sundararajan, M., & Najmi, A. (2020). The many Shapley values for model explanation. Proceedings of the 37th International Conference on Machine Learning, 9269–9278.
Sundararajan, M., Taly, A., & Yan, Q. (2017). Axiomatic attribution for deep networks. Proceedings of the 34th International Conference on Machine Learning, 3319–3328.
Sundaresan, S. (2013). A review of Merton’s model of the firm’s capital structure with its wide applications. Annual Review of Financial Economics, 5, 21–41. https://doi.org/10.1146/annurev-financial-110112-120923
Suri, T. (2017). Mobile money. Annual Review of Economics, 9, 497–520. https://doi.org/10.1146/annurev-economics-063016-103638
Suri, T., & Jack, W. (2016). The long-run poverty and gender impacts of mobile money. Science, 354(6317), 1288–1292. https://doi.org/10.1126/science.aah5309
Suykens, J. A. K., & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural Processing Letters, 9(3), 293–300. https://doi.org/10.1023/A:1018628609742
Swaminathan, A., & Joachims, T. (2015). Counterfactual risk minimization: Learning from logged bandit feedback. International Conference on Machine Learning (ICML).
Sy, J. P., & Taylor, J. M. G. (2000). Estimation in a Cox proportional hazards cure model. Biometrics, 56(1), 227–236. https://doi.org/10.1111/j.0006-341X.2000.00227.x
Tang, H. (2019). Peer-to-peer lenders versus banks: Substitutes or complements? Review of Financial Studies, 32(5), 1900–1938. https://doi.org/10.1093/rfs/hhy137
Tax, D. M. J., & Duin, R. P. W. (2004). Support vector data description. Machine Learning, 54(1), 45–66. https://doi.org/10.1023/B:MACH.0000008084.60811.49
Tenney, I., Das, D., & Pavlick, E. (2019). BERT rediscovers the classical NLP pipeline. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 4593–4601. https://doi.org/10.18653/v1/P19-1452
Tetlock, P. C. (2007). Giving content to investor sentiment: The role of media in the stock market. The Journal of Finance, 62(3), 1139–1168. https://doi.org/10.1111/j.1540-6261.2007.01232.x
Tetlock, P. C., Saar-Tsechansky, M., & Macskassy, S. (2008). More than words: Quantifying language to measure firms’ fundamentals. The Journal of Finance, 63(3), 1437–1467. https://doi.org/10.1111/j.1540-6261.2008.01362.x
Thistlethwaite, D. L., & Campbell, D. T. (1960). Regression-discontinuity analysis: An alternative to the ex post facto experiment. Journal of Educational Psychology, 51(6), 309–317. https://doi.org/10.1037/h0044319
Thomas, L. C. (2000a). A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers. International Journal of Forecasting, 16(2), 149–172. https://doi.org/10.1016/S0169-2070(00)00034-0
Thomas, L. C. (2000b). A survey of credit and behavioural scoring: Forecasting financial risk of lending to consumers. International Journal of Forecasting, 16(2), 149–172. https://doi.org/10.1016/S0169-2070(00)00034-0
Thomas, L. C., Crook, J., & Edelman, D. (2017). Credit scoring and its applications (2nd ed.). Society for Industrial; Applied Mathematics (SIAM). https://doi.org/10.1137/1.9781611974560
Thomas, L. C., Ho, J., & Scherer, W. T. (2001). Time will tell: Behavioural scoring and the dynamics of consumer credit assessment. IMA Journal of Management Mathematics, 12(1), 89–103. https://doi.org/10.1093/imaman/12.1.89
Tian, S., Yu, Y., & Guo, H. (2015). Variable selection and corporate bankruptcy forecasts. Journal of Banking & Finance, 52, 89–100. https://doi.org/10.1016/j.jbankfin.2014.12.003
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological), 58(1), 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
Tomek, I. (1976). Two modifications of CNN. IEEE Transactions on Systems, Man and Cybernetics, SMC-6(11), 769–772. https://doi.org/10.1109/TSMC.1976.4309452
Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., Bikel, D., Blecher, L., Ferrer, C. C., Chen, M., Cucurull, G., Esiobu, D., Fernandes, J., Fu, J., Fu, W., … Scialom, T. (2023). Llama 2: Open foundation and fine-tuned chat models. arXiv:2307.09288.
Townsend, R. M. (1979). Optimal contracts and competitive markets with costly state verification. Journal of Economic Theory, 21(2), 265–293. https://doi.org/10.1016/0022-0531(79)90031-0
Tran, K. Q., Duong, B. V., Tran, L. Q., Tran, A. L.-H., Nguyen, A. T., & Nguyen, K. V. (2021). Machine learning-based empirical investigation for credit scoring in vietnam’s banking. International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, 564–574.
Treacy, W. F., & Carey, M. (2000). Credit risk rating systems at large US banks. Journal of Banking & Finance, 24(1–2), 167–201. https://doi.org/10.1016/S0378-4266(99)00056-4
Trefethen, L. N., & Bau, D. (1997). Numerical linear algebra. SIAM. https://doi.org/10.1137/1.9780898719574
Trench, M. S., Pederson, S. P., Lau, E. T., Ma, L., Wang, H., & Nair, S. K. (2003). Managing credit lines and prices for Bank One credit cards. Interfaces, 33(5), 4–21. https://doi.org/10.1287/inte.33.5.4.19245
Truong, C., Oudre, L., & Vayatis, N. (2020). Selective review of offline change point detection methods. Signal Processing, 167, 107299. https://doi.org/10.1016/j.sigpro.2019.107299
Tsai, Y.-H. H., Bai, S., Yamada, M., Morency, L.-P., & Salakhutdinov, R. (2019). Transformer dissection: An unified understanding for transformer’s attention via the lens of kernel. Proceedings of EMNLP, 4335–4344. https://doi.org/10.18653/v1/D19-1443
Tsiatis, A. (1975). A nonidentifiability aspect of the problem of competing risks. Proceedings of the National Academy of Sciences, 72(1), 20–22. https://doi.org/10.1073/pnas.72.1.20
Tsiatis, A. A. (1981). A large sample study of cox’s regression model. The Annals of Statistics, 9(1), 93–108. https://doi.org/10.1214/aos/1176345335
Tsybakov, A. B. (2008). Introduction to nonparametric estimation. Springer Series in Statistics.
Turjeman, D., & Feinberg, F. M. (2024). When the data are out: Measuring behavioral changes following a data breach. Marketing Science, 43(2), 440–461. https://doi.org/10.1287/mksc.2019.0208
United Mexican States. (2002). Ley para regular las sociedades de información crediticia. Federal Official Gazette, 15 January 2002. https://www.diputados.gob.mx/
United Mexican States. (2010). Ley federal de protección de datos personales en posesión de los particulares (LFPDPPP). Federal Official Gazette, 5 July 2010. https://www.diputados.gob.mx/
United Mexican States. (2018). Ley para regular las instituciones de tecnología financiera (Fintech Law). Federal Official Gazette, 9 March 2018. https://www.diputados.gob.mx/
United States Congress. (1970). Fair credit reporting act, 15 u.s.c. §§ 1681 et seq. Public Law 91-508. https://www.consumer.ftc.gov/articles/pdf-0111-fair-credit-reporting-act.pdf
United States Congress. (1975). Home mortgage disclosure act of 1975. Public Law 94-200; 12 U.S.C. 2801 et seq.
Uno, H., Cai, T., Pencina, M. J., D’Agostino, R. B., & Wei, L. J. (2011). On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in Medicine, 30(10), 1105–1117. https://doi.org/10.1002/sim.4154
Upper, C. (2011). Simulation methods to assess the danger of contagion in interbank markets. Journal of Financial Stability, 7(3), 111–125. https://doi.org/10.1016/j.jfs.2010.12.001
U.S. Congress. (1974). Equal credit opportunity act, 15 u.s.c. §1691. United States Code.
U.S. Department of Housing and Urban Development. (2013). Implementation of the fair housing act’s discriminatory effects standard (24 CFR § 100.500). HUD. https://www.federalregister.gov/documents/2013/02/15/2013-03375/implementation-of-the-fair-housing-acts-discriminatory-effects-standard
U.S. Department of the Treasury. (2024). Managing AI-specific cybersecurity risks in the financial services sector. U.S. Department of the Treasury. https://home.treasury.gov/system/files/136/Managing-Artificial-Intelligence-Specific-Cybersecurity-Risks-In-The-Financial-Services-Sector.pdf
Ustun, B., Spangher, A., & Liu, Y. (2019). Actionable recourse in linear classification. Proceedings of the 2019 Conference on Fairness, Accountability, and Transparency, 10–19. https://doi.org/10.1145/3287560.3287566
Vaart, A. W. van der. (1998). Asymptotic statistics. Cambridge University Press. https://doi.org/10.1017/CBO9780511802256
Vallée, B., & Zeng, Y. (2019). Marketplace lending: A new banking paradigm? The Review of Financial Studies, 32(5), 1939–1982. https://doi.org/10.1093/rfs/hhz005
Vansteelandt, S., Rotnitzky, A., & Robins, J. M. (2007). Estimation of regression models for the mean of repeated outcomes under nonignorable nonmonotone nonresponse. Biometrika, 94(4), 841–860. https://doi.org/10.1093/biomet/asm070
Vapnik, V. N. (1999). An overview of statistical learning theory. IEEE Transactions on Neural Networks, 10(5), 988–999. https://doi.org/10.1109/72.788640
Vapnik, V. N., & Chervonenkis, A. Y. (1971). On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and Its Applications, 16(2), 264–280. https://doi.org/10.1137/1116025
Vasicek, O. A. (2002a). The distribution of loan portfolio value. Risk, 15(12), 160–162.
Vasicek, O. A. (2002b). The distribution of loan portfolio value. Risk Magazine, 15(12), 160–162.
Vassalou, M., & Xing, Y. (2004). Default risk in equity returns. The Journal of Finance, 59(2), 831–868. https://doi.org/10.1111/j.1540-6261.2004.00650.x
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems 30 (NeurIPS), 5998–6008.
Vaupel, J. W., Manton, K. G., & Stallard, E. (1979). The impact of heterogeneity in individual frailty on the dynamics of mortality. Demography, 16(3), 439–454. https://doi.org/10.2307/2061224
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., & Bengio, Y. (2018). Graph attention networks. International Conference on Learning Representations (ICLR).
Vella, F. (1998). Estimating models with sample selection bias: A survey. Journal of Human Resources, 33(1), 127–169. https://doi.org/10.2307/146317
Verbraken, T., Bravo, C., Weber, R., & Baesens, B. (2014). Development and application of consumer credit scoring models using profit-based classification measures. European Journal of Operational Research, 238(2), 505–513. https://doi.org/10.1016/j.ejor.2014.04.001
Verbraken, T., Verbeke, W., & Baesens, B. (2013). A novel profit maximizing metric for measuring classification performance of customer churn prediction models. IEEE Transactions on Knowledge and Data Engineering, 25(5), 961–973. https://doi.org/10.1109/TKDE.2012.50
Vilcassim, N. J., & Jain, D. C. (1991). Modeling purchase-timing and brand-switching behavior incorporating explanatory variables and unobserved heterogeneity. Journal of Marketing Research, 28(1), 29–41. https://doi.org/10.1177/002224379102800103
Villani, C. (2009). Optimal transport: Old and new. Grundlehren Der Mathematischen Wissenschaften, 338. https://doi.org/10.1007/978-3-540-71050-9
Voigt, P., & Bussche, A. von dem. (2017). The EU general data protection regulation (GDPR): A practical guide. https://doi.org/10.1007/978-3-319-57959-7
Vovk, V., Gammerman, A., & Shafer, G. (2005). Algorithmic learning in a random world. Springer. https://doi.org/10.1007/b106715
VPBank SMBC Finance Company Limited (FE Credit). (2023). Annual report 2023. Ho Chi Minh City. https://fecredit.com.vn/
Vu, T., Nguyen, D. Q., Dras, M., Johnson, M., et al. (2018). VnCoreNLP: A vietnamese natural language processing toolkit. Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, 56–60.
Wachter, S., Mittelstadt, B., & Floridi, L. (2017a). Why a right to explanation of automated decision-making does not exist in the general data protection regulation. International Data Privacy Law, 7(2), 76–99. https://doi.org/10.1093/idpl/ipx005
Wachter, S., Mittelstadt, B., & Floridi, L. (2017b). Why a right to explanation of automated decision-making does not exist in the general data protection regulation. International Data Privacy Law, 7(2), 76–99. https://doi.org/10.1093/idpl/ipx005
Wachter, S., Mittelstadt, B., & Russell, C. (2018). Counterfactual explanations without opening the black box: Automated decisions and the GDPR. Harvard Journal of Law and Technology, 31(2), 841–887.
Wager, S., & Athey, S. (2018). Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523), 1228–1242. https://doi.org/10.1080/01621459.2017.1319839
Wager, S., Wang, S., & Liang, P. (2013). Dropout training as adaptive regularization. Advances in Neural Information Processing Systems, 26.
Wang, S., Shao, J., & Kim, J. K. (2014). An instrumental variable approach for identification and estimation with nonignorable nonresponse. Statistica Sinica, 24(3), 1097–1116. https://doi.org/10.5705/ss.2012.074
Wang, X., Wei, J., Schuurmans, D., Le, Q. V., Chi, E. H., Narang, S., Chowdhery, A., & Zhou, D. (2023). Self-consistency improves chain of thought reasoning in language models. International Conference on Learning Representations (ICLR).
Wedel, M., Kamakura, W. A., DeSarbo, W. S., & Ter Hofstede, F. (1995). Implications for asymmetry, nonproportionality, and heterogeneity in brand switching from piece-wise exponential mixture hazard models. Journal of Marketing Research, 32(4), 457–462. https://doi.org/10.1177/002224379503200408
Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., Xia, F., Chi, E. H., Le, Q. V., & Zhou, D. (2022). Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35 (NeurIPS), 24824–24837.
Wei, Z., & Lin, M. (2017). Market mechanisms in online peer-to-peer lending. Management Science, 63(12), 4236–4257. https://doi.org/10.1287/mnsc.2016.2531
Wen, R., Torkkola, K., Narayanaswamy, B., & Madeka, D. (2017). A multi-horizon quantile recurrent forecaster. NeurIPS Time Series Workshop. https://arxiv.org/abs/1711.11053
West, D. (2000). Neural network credit scoring models. Computers & Operations Research, 27(11–12), 1131–1152. https://doi.org/10.1016/S0305-0548(99)00149-5
White, I. R., Royston, P., & Wood, A. M. (2011). Multiple imputation using chained equations: Issues and guidance for practice. Statistics in Medicine, 30(4), 377–399. https://doi.org/10.1002/sim.4067
Wiegreffe, S., & Pinter, Y. (2019). Attention is not not explanation. Proceedings of EMNLP, 11–20. https://doi.org/10.18653/v1/D19-1002
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics Bulletin, 1(6), 80–83. https://doi.org/10.2307/3001968
Williams, C. K. I., & Seeger, M. (2000). Using the Nyström method to speed up kernel machines. Advances in Neural Information Processing Systems 13 (NIPS 2000).
Wilson, T. C. (1997a). Portfolio credit risk (i). Risk Magazine, 10(9), 111–117.
Wilson, T. C. (1997b). Portfolio credit risk (II). Risk Magazine, 10(10), 56–61.
Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5(2), 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
Woo, G., Liu, C., Kumar, A., Xiong, C., Savarese, S., & Sahoo, D. (2024). Unified training of universal time series forecasting transformers. Proceedings of the 41st International Conference on Machine Learning (ICML), PMLR 235. https://proceedings.mlr.press/v235/woo24a.html
World Bank. (2022a). The global findex database 2021. World Bank Group. https://www.worldbank.org/en/publication/globalfindex/Data
World Bank. (2022b). The global findex database 2021: Financial inclusion, digital payments, and resilience in the age of COVID-19. Washington, DC. https://www.worldbank.org/en/publication/globalfindex
World Bank. (2022c). Vietnam: Financial sector assessment. World Bank Group. https://www.worldbank.org/en/country/vietnam
World Bank. (2023). Vietnam: Digital economy policy note. World Bank. https://www.worldbank.org/en/country/vietnam
Wu, H., Hu, T., Liu, Y., Zhou, H., Wang, J., & Long, M. (2023). TimesNet: Temporal 2D-variation modeling for general time series analysis. Proceedings of the International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=ju_Uqw384Oq
Wu, H., Xu, J., Wang, J., & Long, M. (2021). Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Advances in Neural Information Processing Systems 34 (NeurIPS). https://arxiv.org/abs/2106.13008
Wu, S., Irsoy, O., Lu, S., Dabravolski, V., Dredze, M., Gehrmann, S., Kambadur, P., Rosenberg, D., & Mann, G. (2023). BloombergGPT: A large language model for finance. arXiv:2303.17564.
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., & Yu, P. S. (2021). A comprehensive survey on graph neural networks. IEEE Transactions on Neural Networks and Learning Systems, 32(1), 4–24. https://doi.org/10.1109/TNNLS.2020.2978386
Wyner, A. J., Olson, M., Bleich, J., & Mease, D. (2017). Explaining the success of AdaBoost and random forests as interpolating classifiers. Journal of Machine Learning Research, 18, 1–33.
Xia, Y., Liu, C., Li, Y.-Y., & Liu, N. (2017). A boosted decision tree approach using Bayesian hyper-parameter optimization for credit scoring. Expert Systems with Applications, 78, 225–241. https://doi.org/10.1016/j.eswa.2017.02.017
Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2019). How powerful are graph neural networks? International Conference on Learning Representations (ICLR).
Xu, L., Skoularidou, M., Cuesta-Infante, A., & Veeramachaneni, K. (2019). Modeling tabular data using conditional GAN. Advances in Neural Information Processing Systems 32 (NeurIPS).
Yale Law Journal. (1979). Credit scoring and the ECOA: Applying the effects test. The Yale Law Journal, 88(7), 1450–1486. https://doi.org/10.2307/795759
Yang, H., Liu, X.-Y., & Wang, C. D. (2023). FinGPT: Open-source financial large language models. FinLLM Symposium at IJCAI.
Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine learning: Concept and applications. ACM Transactions on Intelligent Systems and Technology, 10(2), 1–19. https://doi.org/10.1145/3298981
Yang, Y., & Land, K. C. (2008). Age-period-cohort analysis of repeated cross-section surveys: Fixed or random effects? Sociological Methods & Research, 36(3), 297–326. https://doi.org/10.1177/0049124106292360
Yang, Y., Uy, M. C. S., & Huang, A. (2020). FinBERT: A pretrained language model for financial communications. arXiv Preprint.
Yeh, C.-K., Hsieh, C.-Y., Suggala, A. S., Inouye, D. I., & Ravikumar, P. (2019). On the (in)fidelity and sensitivity of explanations. Advances in Neural Information Processing Systems 32 (NeurIPS 2019).
Yeh, I.-C. (2016). Default of credit card clients. UCI Machine Learning Repository. https://doi.org/10.24432/C55S3H
Yeh, I.-C., & Lien, C.-H. (2009). The comparisons of data mining techniques for the predictive accuracy of probability of default of credit card clients. Expert Systems with Applications, 36(2), 2473–2480. https://doi.org/10.1016/j.eswa.2007.12.020
Yin, W., Hay, J., & Roth, D. (2019). Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3914–3923. https://doi.org/10.18653/v1/D19-1404
Ying, R., Bourgeois, D., You, J., Zitnik, M., & Leskovec, J. (2019). GNNExplainer: Generating explanations for graph neural networks. Advances in Neural Information Processing Systems 32 (NeurIPS).
Yoon, J., Jordon, J., & Schaar, M. van der. (2018). GAIN: Missing data imputation using generative adversarial nets. Proceedings of the 35th International Conference on Machine Learning (ICML).
Young, H. P. (1985). Monotonic solutions of cooperative games. International Journal of Game Theory, 14(2), 65–72. https://doi.org/10.1007/BF01769885
Yurdakul, B. (2018). Statistical properties of population stability index [Master’s thesis]. Western Michigan University.
Zadrozny, B., & Elkan, C. (2002). Transforming classifier scores into accurate multiclass probability estimates. 694–699. https://doi.org/10.1145/775047.775151
Zaharia, M., Chen, A., Davidson, A., Ghodsi, A., Hong, S. A., Konwinski, A., Murching, S., Nykodym, T., Ogilvie, P., Parkhe, M., Xie, F., & Zumar, C. (2018). Accelerating the machine learning lifecycle with MLflow. IEEE Data Engineering Bulletin, 41, 39–45.
Zaharia, M., Das, T., Li, H., Hunter, T., Shenker, S., & Stoica, I. (2013). Discretized streams: Fault-tolerant streaming computation at scale. Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP), 423–438. https://doi.org/10.1145/2517349.2522737
Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., et al. (2016). Apache Spark: A unified engine for big data processing. Communications of the ACM, 59(11), 56–65. https://doi.org/10.1145/2934664
Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A., Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., Ghodsi, A., Gonzalez, J., Shenker, S., & Stoica, I. (2016). Apache Spark: A unified engine for big data processing. Communications of the ACM, 59(11), 56–65. https://doi.org/10.1145/2934664
Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding convolutional networks. European Conference on Computer Vision (ECCV), 818–833. https://doi.org/10.1007/978-3-319-10590-1\_53
Zemel, R., Wu, Y., Swersky, K., Pitassi, T., & Dwork, C. (2013). Learning fair representations. Proceedings of the 30th International Conference on Machine Learning (ICML 2013), 325–333.
Zeng, H., Zhou, H., Srivastava, A., Kannan, R., & Prasanna, V. (2020). GraphSAINT: Graph sampling based inductive learning method. International Conference on Learning Representations (ICLR).
Zhang, B. H., Lemoine, B., & Mitchell, M. (2018). Mitigating unwanted biases with adversarial learning. Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society (AIES), 335–340. https://doi.org/10.1145/3278721.3278779
Zhang, T., & Yu, B. (2005). Boosting with early stopping: Convergence and consistency. The Annals of Statistics, 33(4), 1538–1579. https://doi.org/10.1214/009053605000000255
Zheng, M., & Klein, J. P. (1995). Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika, 82(1), 127–138. https://doi.org/10.1093/biomet/82.1.127
Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., & Zhang, W. (2021). Informer: Beyond efficient transformer for long sequence time-series forecasting. Proceedings of the AAAI Conference on Artificial Intelligence, 35(12), 11106–11115. https://doi.org/10.1609/aaai.v35i12.17325
Zhu, X., & Goldberg, A. B. (2009). Introduction to semi-supervised learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 3(1), 1–130. https://doi.org/10.2200/S00196ED1V01Y200906AIM006
Zmijewski, M. E. (1984). Methodological issues related to the estimation of financial distress prediction models. Journal of Accounting Research, 22, 59–82. https://doi.org/10.2307/2490859
Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 67(2), 301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x