References
Aalen, O. (1978). Nonparametric inference for a family of counting
processes. The Annals of Statistics, 6(4), 701–726. https://doi.org/10.1214/aos/1176344247
Aas, K., Czado, C., Frigessi, A., & Bakken, H. (2009). Pair-copula
constructions of multiple dependence. Insurance: Mathematics and
Economics, 44(2), 182–198. https://doi.org/10.1016/j.insmatheco.2007.02.001
Aas, K., Jullum, M., & Løland, A. (2021). Explaining individual
predictions when features are dependent: More accurate approximations to
Shapley values. Artificial Intelligence,
298, 103502. https://doi.org/10.1016/j.artint.2021.103502
Abadi, M., Chu, A., Goodfellow, I., McMahan, H. B., Mironov, I., Talwar,
K., & Zhang, L. (2016). Deep learning with differential privacy.
Proceedings of the 2016 ACM SIGSAC Conference on Computer and
Communications Security (CCS), 308–318. https://doi.org/10.1145/2976749.2978318
Abnar, S., & Zuidema, W. (2020). Quantifying attention flow in
transformers. Proceedings of the 58th Annual Meeting of the
Association for Computational Linguistics (ACL), 4190–4197. https://doi.org/10.18653/v1/2020.acl-main.385
Acemoglu, D., Carvalho, V. M., Ozdaglar, A., & Tahbaz-Salehi, A.
(2012). The network origins of aggregate fluctuations.
Econometrica, 80(5), 1977–2016. https://doi.org/10.3982/ECTA9623
Acemoglu, D., Ozdaglar, A., & Tahbaz-Salehi, A. (2015). Systemic
risk and stability in financial networks. American Economic
Review, 105(2), 564–608. https://doi.org/10.1257/aer.20130456
Acharya, V. V., Berger, A. N., & Roman, R. A. (2018). Lending
implications of u.s. Bank stress tests: Costs or benefits? Journal
of Financial Intermediation, 34, 58–90. https://doi.org/10.1016/j.jfi.2018.01.004
Acharya, V. V., Berner, R., Engle, R. F., Jung, H., Stroebel, J., Zeng,
X., & Zhao, Y. (2023). Climate stress testing. Annual Review of
Financial Economics, 15, 291–326. https://doi.org/10.1146/annurev-financial-110921-101555
Acharya, V. V., Engle, R. F., & Pierret, D. (2014). Testing
macroprudential stress tests: The risk of regulatory risk weights.
Journal of Monetary Economics, 65, 36–53. https://doi.org/10.1016/j.jmoneco.2014.04.014
Acharya, V. V., Schnabl, P., & Suarez, G. (2013). Securitization
without risk transfer. Journal of Financial Economics,
107(3), 515–536. https://doi.org/10.1016/j.jfineco.2012.09.004
Acquisti, A., Brandimarte, L., & Loewenstein, G. (2015). Privacy and
human behavior in the age of information. Science,
347(6221), 509–514. https://doi.org/10.1126/science.aaa1465
Acquisti, A., Taylor, C., & Wagman, L. (2016). The economics of
privacy. Journal of Economic Literature, 54(2),
442–492. https://doi.org/10.1257/jel.54.2.442
Adams, P., Guttman-Kenney, B., Hayes, L., Hunt, S., Laibson, D., &
Stewart, N. (2022). Do nudges reduce borrowing and consumer confusion in
the credit card market? Economica, 89(S1), S178–S199.
https://doi.org/10.1111/ecca.12427
Adams, W., Einav, L., & Levin, J. (2009a). Liquidity constraints and
imperfect information in subprime lending. American Economic
Review, 99(1), 49–84. https://doi.org/10.1257/aer.99.1.49
Adams, W., Einav, L., & Levin, J. (2009b). Liquidity constraints and
imperfect information in subprime lending. American Economic
Review, 99(1), 49–84. https://doi.org/10.1257/aer.99.1.49
Agarwal, A., Beygelzimer, A., Dudı́k, M., Langford, J., & Wallach, H.
(2018). A reductions approach to fair classification. Proceedings of
the 35th International Conference on Machine Learning (ICML),
60–69.
Agarwal, S., Alok, S., Ghosh, P., & Gupta, S. (2020). Financial
inclusion and alternate credit scoring for the millennials: Role of big
data and machine learning in fintech. SSRN Working Paper,
(3507827). https://doi.org/10.2139/ssrn.3507827
Agarwal, S., Amromin, G., Ben-David, I., Chomsisengphet, S., Piskorski,
T., & Seru, A. (2017). Policy intervention in debt renegotiation:
Evidence from the Home Affordable
Modification Program. Journal of Political
Economy, 125(3), 654–712. https://doi.org/10.1086/691701
Agarwal, S., Chomsisengphet, S., Liu, C., Song, C., & Souleles, N.
S. (2018). Benefits of relationship banking: Evidence from consumer
credit markets. Journal of Monetary Economics, 96,
16–32. https://doi.org/10.1016/j.jmoneco.2018.02.005
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J. (2015).
Regulating consumer financial products: Evidence from credit cards.
The Quarterly Journal of Economics, 130(1), 111–164.
https://doi.org/10.1093/qje/qju037
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J.
(2018a). Do banks pass through credit expansions to consumers who want
to borrow? The Quarterly Journal of Economics, 133(1),
129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J.
(2018b). Do banks pass through credit expansions to consumers who want
to borrow? The Quarterly Journal of Economics, 133(1),
129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., Chomsisengphet, S., Mahoney, N., & Stroebel, J.
(2018c). Do banks pass through credit expansions to consumers who want
to borrow? Quarterly Journal of Economics, 133(1),
129–190. https://doi.org/10.1093/qje/qjx027
Agarwal, S., & Hauswald, R. (2010). Distance and private information
in lending. Review of Financial Studies, 23(7),
2757–2788. https://doi.org/10.1093/rfs/hhq001
Agarwal, S., Qian, W., Yeung, B. Y., & Zou, X. (2019). Mobile wallet
and entrepreneurial growth. AEA Papers and Proceedings,
109, 48–53. https://doi.org/10.1257/pandp.20191010
Agarwal, V., & Taffler, R. (2008). Comparing the performance of
market-based and accounting-based bankruptcy prediction models.
Journal of Banking & Finance, 32(8), 1541–1551. https://doi.org/10.1016/j.jbankfin.2007.07.014
Aguiar, M., & Gopinath, G. (2006). Defaultable debt, interest rates
and the current account. Journal of International Economics,
69(1), 64–83. https://doi.org/10.1016/j.jinteco.2005.05.005
Aker, J. C., & Mbiti, I. M. (2010). Mobile phones and economic
development in Africa. Journal of Economic
Perspectives, 24(3), 207–232. https://doi.org/10.1257/jep.24.3.207
Akerlof, G. A. (1970). The market for “lemons”: Quality
uncertainty and the market mechanism. The Quarterly Journal of
Economics, 84(3), 488–500. https://doi.org/10.2307/1879431
Akidau, T., Bradshaw, R., Chambers, C., Chernyak, S.,
Fernández-Moctezuma, R. J., Lax, R., McVeety, S., Mills, D., Perry, F.,
Schmidt, E., & Whittle, S. (2015). The dataflow model: A practical
approach to balancing correctness, latency, and cost in massive-scale,
unbounded, out-of-order data processing. Proceedings of the VLDB
Endowment, 8(12), 1792–1803. https://doi.org/10.14778/2824032.2824076
Allen, F., & Gale, D. (2000). Financial contagion. Journal of
Political Economy, 108(1), 1–33. https://doi.org/10.1086/262109
Allen, J., Clark, R., & Houde, J.-F. (2014). The effect of mergers
in search markets: Evidence from the Canadian mortgage
industry. American Economic Review, 104(10),
3365–3396. https://doi.org/10.1257/aer.104.10.3365
Allen, J., Clark, R., & Houde, J.-F. (2019). Search frictions and
market power in negotiated-price markets. Journal of Political
Economy, 127(4), 1550–1598. https://doi.org/10.1086/701684
Allison, P. D. (1982). Discrete-time methods for the analysis of event
histories. Sociological Methodology, 13, 61–98. https://doi.org/10.2307/270718
Alpaydin, E. (1999). Combined 5×2 CV
F test for comparing supervised classification learning
algorithms. Neural Computation, 11(8), 1885–1892. https://doi.org/10.1162/089976699300016007
Altman, E. I. (1968). Financial ratios, discriminant analysis and the
prediction of corporate bankruptcy. The Journal of Finance,
23(4), 589–609. https://doi.org/10.2307/2978933
Altman, E. I. (2000). Predicting financial distress of companies:
Revisiting the Z-score and ZETA models.
Stern School of Business, New York University Working Paper.
Altman, E. I. (2005). An emerging market credit scoring system for
corporate bonds. Emerging Markets Review, 6(4),
311–323. https://doi.org/10.1016/j.ememar.2005.09.007
Altman, E. I., Brady, B., Resti, A., & Sironi, A. (2005). The link
between default and recovery rates: Theory, empirical evidence, and
implications. The Journal of Business, 78(6),
2203–2228. https://doi.org/10.1086/497044
Altman, E. I., Haldeman, R. G., & Narayanan, P. (1977a).
ZETA analysis: A new model to identify bankruptcy risk of
corporations. Journal of Banking & Finance, 1(1),
29–54. https://doi.org/10.1016/0378-4266(77)90017-6
Altman, E. I., Haldeman, R. G., & Narayanan, P. (1977b).
ZETA analysis: A new model to identify bankruptcy risk of
corporations. Journal of Banking & Finance, 1(1),
29–54. https://doi.org/10.1016/0378-4266(77)90017-6
Altman, E. I., Iwanicz-Drozdowska, M., Laitinen, E. K., & Suvas, A.
(2017). Financial distress prediction in an international context: A
review and empirical analysis of Altman’s
Z-score model. Journal of International Financial
Management & Accounting, 28(2), 131–171. https://doi.org/10.1111/jifm.12053
Altman, E. I., & Sabato, G. (2007). Modelling credit risk for
SMEs: Evidence from the US market.
Abacus, 43(3), 332–357. https://doi.org/10.1111/j.1467-6281.2007.00234.x
Altmann, A., Toloşi, L., Sander, O., & Lengauer, T. (2010).
Permutation importance: A corrected feature importance measure.
Bioinformatics, 26(10), 1340–1347. https://doi.org/10.1093/bioinformatics/btq134
Alvarez-Melis, D., & Jaakkola, T. S. (2018). On the robustness
of interpretability methods.
Ambrose, B. W., & LaCour-Little, M. (2001). Prepayment risk in
adjustable rate mortgages subject to initial year discounts: Some new
evidence. Real Estate Economics, 29(2), 305–327. https://doi.org/10.1111/1080-8620.00012
Amershi, S., Begel, A., Bird, C., DeLine, R., Gall, H., Kamar, E.,
Nagappan, N., Nushi, B., & Zimmermann, T. (2019). Software
engineering for machine learning: A case study. IEEE/ACM
International Conference on Software Engineering (ICSE-SEIP),
291–300. https://doi.org/10.1109/ICSE-SEIP.2019.00042
An, X., Cordell, L., Smith, L., & Wang, K. (2022). Racial and ethnic
disparities in mortgage lending: New evidence from Expanded
HMDA data. Federal Reserve Bank of Philadelphia Working
Paper, (22-02). https://www.philadelphiafed.org/the-economy/banking-and-financial-markets/racial-and-ethnic-disparities-in-mortgage-lending
Andersen, P. K., & Gill, R. D. (1982). Cox’s regression model for
counting processes: A large sample study. The Annals of
Statistics, 10(4), 1100–1120. https://doi.org/10.1214/aos/1176345976
Anderson, R. (2007). The credit scoring toolkit: Theory and practice
for retail credit risk management and decision automation.
Anderson, T. W. (1951). Classification by multivariate analysis.
Psychometrika, 16(1), 31–50. https://doi.org/10.1007/BF02313425
Andrews, I., Stock, J. H., & Sun, L. (2019). Weak instruments in
instrumental variables regression: Theory and practice. Annual
Review of Economics, 11, 727–753. https://doi.org/10.1146/annurev-economics-080218-025643
Angelino, E., Larus-Stone, N., Alabi, D., Seltzer, M., & Rudin, C.
(2018). Learning certifiably optimal rule lists for categorical data.
Journal of Machine Learning Research, 18, 1–78.
Angelopoulos, A. N., & Bates, S. (2023). Conformal prediction: A
gentle introduction. Foundations and Trends in Machine
Learning, 16(4), 494–591. https://doi.org/10.1561/2200000101
Angelopoulos, A. N., Bates, S., Jordan, M., & Malik, J. (2021).
Uncertainty sets for image classifiers using conformal prediction.
International Conference on Learning Representations (ICLR).
Angrist, J. D., Imbens, G. W., & Rubin, D. B. (1996). Identification
of causal effects using instrumental variables. Journal of the
American Statistical Association, 91(434), 444–455. https://doi.org/10.2307/2291629
Ansari, A. F., Stella, L., Turkmen, C., Zhang, X., Mercado, P., Shen,
H., Shchur, O., Rangapuram, S. S., Arango, S. P., Kapoor, S.,
Zschiegner, J., Maddix, D. C., Wang, H., Mahoney, M. W., Torkkola, K.,
Wilson, A. G., Bohlke-Schneider, M., & Wang, Y. (2024).
Chronos: Learning the language of time series.
Transactions on Machine Learning Research; arXiv:2403.07815. https://arxiv.org/abs/2403.07815
Antweiler, W., & Frank, M. Z. (2004). Is all that talk just noise?
The information content of internet stock message boards. The
Journal of Finance, 59(3), 1259–1294. https://doi.org/10.1111/j.1540-6261.2004.00662.x
Apley, D. W., & Zhu, J. (2020). Visualizing the effects of predictor
variables in black box supervised learning models. Journal of the
Royal Statistical Society: Series B (Statistical Methodology),
82(4), 1059–1086. https://doi.org/10.1111/rssb.12377
Araci, D. (2019). FinBERT: Financial sentiment analysis
with pre-trained language models. arXiv:1908.10063.
Arellano, C. (2008). Default risk and income fluctuations in emerging
economies. American Economic Review, 98(3), 690–712.
https://doi.org/10.1257/aer.98.3.690
Argyle, B. S., Nadauld, T. D., & Palmer, C. J. (2020). Monthly
payment targeting and the demand for maturity. Review of Financial
Studies, 33(11), 5416–5462. https://doi.org/10.1093/rfs/hhaa004
Aridor, G., Che, Y.-K., & Salz, T. (2024). The effect of privacy
regulation on the data industry: Empirical evidence from
GDPR. RAND Journal of Economics, 55(4),
503–530. https://doi.org/10.1111/1756-2171.12586
Arik, S. Ö., & Pfister, T. (2021). TabNet: Attentive
interpretable tabular learning. Proceedings of the AAAI Conference
on Artificial Intelligence, 35, 6679–6687.
Arkhangelsky, D., Athey, S., Hirshberg, D. A., Imbens, G. W., &
Wager, S. (2021). Synthetic difference-in-differences. American
Economic Review, 111(12), 4088–4118. https://doi.org/10.1257/aer.20190159
Arlot, S., & Celisse, A. (2010). A survey of cross-validation
procedures for model selection. Statistics Surveys, 4,
40–79. https://doi.org/10.1214/09-SS054
Armbrust, M., Das, T., Torres, J., Yavuz, B., Zhu, S., Xin, R., Ghodsi,
A., Stoica, I., & Zaharia, M. (2018). Structured streaming: A
declarative API for real-time applications in
Apache Spark. Proceedings of the 2018 ACM
International Conference on Management of Data (SIGMOD), 601–613.
https://doi.org/10.1145/3183713.3190664
Arnold, D., Dobbie, W., & Yang, C. S. (2018). Racial bias in bail
decisions. The Quarterly Journal of Economics, 133(4),
1885–1932. https://doi.org/10.1093/qje/qjy012
Aronszajn, N. (1950). Theory of reproducing kernels. Transactions of
the American Mathematical Society, 68(3), 337–404. https://doi.org/10.2307/1990404
Arrieta, A. B., Dı́az-Rodrı́guez, N., Del Ser, J., Bennetot, A., Tabik,
S., Barbado, A., Garcı́a, S., Gil-López, S., Molina, D., Benjamins, R.,
Chatila, R., & Herrera, F. (2020). Explainable artificial
intelligence (XAI): Concepts, taxonomies, opportunities and
challenges toward responsible AI. Information
Fusion, 58, 82–115. https://doi.org/10.1016/j.inffus.2019.12.012
Ascarza, E. (2018). Retention futility: Targeting high-risk customers
might be ineffective. Journal of Marketing Research,
55(1), 80–98. https://doi.org/10.1509/jmr.16.0163
Asian Development Bank. (2022a). Fintech policy tool kit for
regulators and policy makers in Asia and the
Pacific. Asian Development Bank. https://www.adb.org/publications/fintech-policy-tool-kit-regulators-policy-makers-asia-pacific
Asian Development Bank. (2022b). Viet nam financial sector report:
Deepening financial inclusion. Asian Development Bank. https://www.adb.org/countries/viet-nam/main
Asian Development Bank. (2023). Digital financial inclusion in
Southeast Asia. Asian Development Bank. https://www.adb.org/publications/digital-financial-inclusion-southeast-asia
Assefa, S. A., Dervovic, D., Mahfouz, M., Tillman, R. E., Reddy, P.,
& Veloso, M. (2020). Generating synthetic data in finance:
Opportunities, challenges and pitfalls. Proceedings of the First ACM
International Conference on AI in Finance. https://doi.org/10.1145/3383455.3422554
Athey, S., & Imbens, G. (2016). Recursive partitioning for
heterogeneous causal effects. Proceedings of the National Academy of
Sciences, 113(27), 7353–7360. https://doi.org/10.1073/pnas.1510489113
Athey, S., Tibshirani, J., & Wager, S. (2019). Generalized random
forests. The Annals of Statistics, 47(2), 1148–1178.
https://doi.org/10.1214/18-AOS1709
Athey, S., & Wager, S. (2021). Policy learning with observational
data. Econometrica, 89(1), 133–161. https://doi.org/10.3982/ECTA15732
Atiya, A. F. (2001). Bankruptcy prediction for credit risk using neural
networks: A survey and new results. IEEE Transactions on Neural
Networks, 12(4), 929–935. https://doi.org/10.1109/72.935101
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2007). The 2006
HMDA data. Federal Reserve Bulletin, 93,
A73–A109.
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2009b). Credit
scoring and its effects on the availability and affordability of credit.
Journal of Consumer Affairs, 43(3), 516–537. https://doi.org/10.1111/j.1745-6606.2009.01151.x
Avery, R. B., Brevoort, K. P., & Canner, G. B. (2009a). Credit
scoring and its effects on the availability and affordability of credit.
Journal of Consumer Affairs, 43(3), 516–537. https://doi.org/10.1111/j.1745-6606.2009.01151.x
Avery, R. B., Calem, P. S., Canner, G. B., & Bostic, R. W. (2003).
An overview of consumer data and credit reporting. Federal Reserve
Bulletin, 89, 47–73.
Azizpour, S., Giesecke, K., & Schwenkler, G. (2018). Exploring the
sources of default clustering. Journal of Financial Economics,
129(1), 154–183. https://doi.org/10.1016/j.jfineco.2018.04.008
Ba, J. L., Kiros, J. R., & Hinton, G. E. (2016). Layer
normalization. arXiv Preprint arXiv:1607.06450.
Babaev, D., Ovsov, N., Kireev, I., Ivanova, M., Gusev, G., Nazarov, I.,
& Tuzhilin, A. (2022). CoLES: Contrastive learning
for event sequences with self-supervision. https://doi.org/10.1145/3514221.3526129
Babina, T., Bahaj, S. A., Buchak, G., De Marco, F., Foulis, A. K.,
Gornall, W., Mazzola, F., & Yu, T. (2024). Customer data access and
fintech entry: Early evidence from open banking. National Bureau of
Economic Research Working Paper, (32089). https://doi.org/10.3386/w32089
Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., &
Samek, W. (2015). On pixel-wise explanations for non-linear classifier
decisions by layer-wise relevance propagation. PLOS ONE,
10(7), e0130140. https://doi.org/10.1371/journal.pone.0130140
Baesens, B., Van Gestel, T., Stepanova, M., Van den Poel, D., &
Vanthienen, J. (2005). Neural network survival analysis for personal
loan data. Journal of the Operational Research Society,
56(9), 1089–1098. https://doi.org/10.1057/palgrave.jors.2601990
Baesens, B., Van Gestel, T., Viaene, S., Stepanova, M., Suykens, J.,
& Vanthienen, J. (2003). Benchmarking state-of-the-art
classification algorithms for credit scoring. Journal of the
Operational Research Society, 54(6), 627–635. https://doi.org/10.1057/palgrave.jors.2601545
Baghai, R. P., Servaes, H., & Tamayo, A. (2014). Have rating
agencies become more conservative? Implications for capital
structure and debt pricing. Journal of Finance, 69(5),
1961–2005. https://doi.org/10.1111/jofi.12153
Bai, S., Kolter, J. Z., & Koltun, V. (2018). An empirical evaluation
of generic convolutional and recurrent networks for sequence modeling.
arXiv Preprint arXiv:1803.01271.
Bai, X., Tsiatis, A. A., & O’Brien, S. M. (2013). Doubly robust
estimators of treatment-specific survival distributions in observational
studies with stratified sampling. Biometrics, 69(4),
830–839. https://doi.org/10.1111/biom.12076
Bai, Y., Kadavath, S., Kundu, S., Askell, A., Kernion, J., Jones, A.,
Chen, A., Goldie, A., Mirhoseini, A., McKinnon, C., Chen, C., Olsson,
C., Olah, C., Hernandez, D., Drain, D., Ganguli, D., Li, D.,
Tran-Johnson, E., Perez, E., … Kaplan, J. (2022). Constitutional
AI: Harmlessness from AI feedback.
arXiv:2212.08073.
Baker, S. R. (2018). Debt and the response to household income shocks:
Validation and application of linked financial account data. Journal
of Political Economy, 126(4), 1504–1557. https://doi.org/10.1086/698106
Baker, S. R., Bloom, N., & Davis, S. J. (2016). Measuring economic
policy uncertainty. The Quarterly Journal of Economics,
131(4), 1593–1636. https://doi.org/10.1093/qje/qjw024
Baldauf, M., Garlappi, L., & Yannelis, C. (2020). Does climate
change affect real estate prices? Only if you believe in it. Review
of Financial Studies, 33(3), 1256–1295. https://doi.org/10.1093/rfs/hhz073
Balyuk, T., & Davydenko, S. A. (2024). Reintermediation in FinTech:
Evidence from online lending. Journal of Financial and Quantitative
Analysis, 59(5), 1997–2037. https://doi.org/10.1017/S0022109023000789
Bamber, D. (1975). The area above the ordinal dominance graph and the
area below the receiver operating characteristic graph. Journal of
Mathematical Psychology, 12(4), 387–415. https://doi.org/10.1016/0022-2496(75)90001-2
Banasik, J., & Crook, J. (2007). Reject inference, augmentation, and
sample selection. European Journal of Operational Research,
183(3), 1582–1594. https://doi.org/10.1016/j.ejor.2006.06.072
Banasik, J., Crook, J. N., & Thomas, L. C. (1999a). Not if but when
will borrowers default. Journal of the Operational Research
Society, 50(12), 1185–1190. https://doi.org/10.1057/palgrave.jors.2600851
Banasik, J., Crook, J. N., & Thomas, L. C. (1999b). Not if but when
will borrowers default. Journal of the Operational Research
Society, 50(12), 1185–1190. https://doi.org/10.1057/palgrave.jors.2600851
Banasik, J., Crook, J. N., & Thomas, L. C. (2003). Sample selection
bias in credit scoring models. Journal of the Operational Research
Society, 54(8), 822–832. https://doi.org/10.1057/palgrave.jors.2601578
Banco Central do Brasil. (2013). Circular no. 3.648:
IRB approach for credit risk capital requirement.
Banco Central do Brasil. https://www.bcb.gov.br/
Banco Central do Brasil. (2020). Joint resolution no. 1:
Implementation of Open Finance in brazil. Banco
Central do Brasil; Conselho Monetário Nacional. https://www.bcb.gov.br/estabilidadefinanceira/openfinance
Bangia, A., Diebold, F. X., Kronimus, A., Schagen, C., & Schuermann,
T. (2002). Ratings migration and the business cycle, with application to
credit portfolio stress testing. Journal of Banking &
Finance, 26(2–3), 445–474. https://doi.org/10.1016/S0378-4266(01)00229-1
Bank for International Settlements. (2020). Financial stability
considerations in emerging market economies: BIS papers no.
113. Bank for International Settlements. https://www.bis.org/publ/bppdf/bispap113.htm
Bank for International Settlements. (2022a). Big tech regulation: In
search of a new framework (FSI Occasional Paper 20). Bank for
International Settlements. https://www.bis.org/fsi/fsipapers20.htm
Bank for International Settlements. (2022b). Credit markets in
emerging market economies: Evolution and policy challenges (BIS
Papers 125). Bank for International Settlements. https://www.bis.org/publ/bppdf/bispap125.htm
Bank for International Settlements. (2023a). Big tech regulation: In
search of a new framework (BIS Papers 141). Bank for International
Settlements. https://www.bis.org/publ/bppdf/bispap141.htm
Bank for International Settlements. (2023b). Financial stability
risks from non-bank financial intermediation in emerging market
economies. BIS Papers. https://www.bis.org/
Bank for International Settlements, Financial Stability Institute.
(2024). Regulating AI in the financial sector: Recent developments
and main challenges (FSI insights no. 63). Bank for International
Settlements.
Bank of England. (2022). Stress testing the UK banking system: Key
elements of the 2022 annual cyclical scenario. Bank of England. https://www.bankofengland.co.uk/stress-testing/2022/key-elements-of-the-2022-annual-cyclical-scenario
Barber, R. F., Candès, E. J., Ramdas, A., & Tibshirani, R. J.
(2021). Predictive inference with the jackknife+. The Annals of
Statistics, 49(1), 486–507. https://doi.org/10.1214/20-AOS1965
Barboni, G., Cárdenas, J. C., & De Roux, N. (2026). Behavioral
messages and debt repayment. Review of Finance, rfag015.
Barboza, F., Kimura, H., & Altman, E. (2017). Machine learning
models and bankruptcy prediction. Expert Systems with
Applications, 83, 405–417. https://doi.org/10.1016/j.eswa.2017.04.006
Bardoscia, M., Barucca, P., Battiston, S., Caccioli, F., Cimini, G.,
Garlaschelli, D., Saracco, F., Squartini, T., & Caldarelli, G.
(2021). The physics of financial networks. Nature Reviews
Physics, 3(7), 490–507. https://doi.org/10.1038/s42254-021-00322-5
Barocas, S., & Selbst, A. D. (2016). Big data’s disparate impact.
California Law Review, 104(3), 671–732.
Barron, A. R. (1993). Universal approximation bounds for superpositions
of a sigmoidal function. IEEE Transactions on Information
Theory, 39(3), 930–945. https://doi.org/10.1109/18.256500
Barrot, J.-N., & Sauvagnat, J. (2016). Input specificity and the
propagation of idiosyncratic shocks in production networks.
Quarterly Journal of Economics, 131(3), 1543–1592. https://doi.org/10.1093/qje/qjw018
Bartlett, P. L., & Mendelson, S. (2002). Rademacher and
Gaussian complexities: Risk bounds and structural results.
Journal of Machine Learning Research, 3, 463–482.
Bartlett, R., Morse, A., Stanton, R., & Wallace, N. (2022b).
Consumer-lending discrimination in the FinTech era. Journal of
Financial Economics, 143(1), 30–56. https://doi.org/10.1016/j.jfineco.2021.05.047
Bartlett, R., Morse, A., Stanton, R., & Wallace, N. (2022a).
Consumer-lending discrimination in the FinTech era.
Journal of Financial Economics, 143(1), 30–56. https://doi.org/10.1016/j.jfineco.2021.05.047
Basel Committee on Banking Supervision. (2005b). An explanatory note
on the basel II IRB risk weight
functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005c). An explanatory note
on the basel II IRB risk weight functions. Bank for
International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005a). An explanatory note
on the basel II IRB risk weight
functions. Bank for International Settlements. https://www.bis.org/bcbs/irbriskweight.htm
Basel Committee on Banking Supervision. (2005d). An explanatory note
on the basel II IRB risk weight functions. Bank for International
Settlements. https://www.bis.org/bcbs/irbriskweight.pdf
Basel Committee on Banking Supervision. (2005e). Studies on the
validation of internal rating systems (Working Paper 14). Bank for
International Settlements.
Basel Committee on Banking Supervision. (2006). International
convergence of capital measurement and capital standards: A revised
framework, comprehensive version [Technical Report]. https://www.bis.org/publ/bcbs128.htm
Basel Committee on Banking Supervision. (2010). Sound practices for
backtesting counterparty credit risk models. Bank for International
Settlements. https://www.bis.org/publ/bcbs185.htm
Basel Committee on Banking Supervision. (2013). Principles for
effective risk data aggregation and risk reporting (BCBS 239). Bank
for International Settlements. https://www.bis.org/publ/bcbs239.htm
Basel Committee on Banking Supervision. (2015). Guidance on credit
risk and accounting for expected credit losses (BCBS 350). Bank for
International Settlements. https://www.bis.org/bcbs/publ/d350.htm
Basel Committee on Banking Supervision. (2016). Guidelines on the
supervisory review and evaluation process and pillar 2 capital
(BCBS 355). Bank for International Settlements. https://www.bis.org/bcbs/publ/d355.htm
Basel Committee on Banking Supervision. (2017a). Basel III:
Finalising post-crisis reforms [Technical Report]. https://www.bis.org/bcbs/publ/d424.htm
Basel Committee on Banking Supervision. (2017b). Guidelines on
credit risk and accounting for expected credit losses (BCBS
Guidance d350). Bank for International Settlements. https://www.bis.org/bcbs/publ/d350.htm
Basel Committee on Banking Supervision. (2021). Principles for the
effective management of third-party risks (revisions in the context of
AI use). Bank for International Settlements.
Bastos, J. A. (2010). Forecasting bank loans loss-given-default.
Journal of Banking & Finance, 34(10), 2510–2517.
https://doi.org/10.1016/j.jbankfin.2010.04.011
Batista, G. E. A. P. A., Prati, R. C., & Monard, M. C. (2004). A
study of the behavior of several methods for balancing machine learning
training data. ACM SIGKDD Explorations Newsletter,
6(1), 20–29. https://doi.org/10.1145/1007730.1007735
Battiston, S., Puliga, M., Kaushik, R., Tasca, P., & Caldarelli, G.
(2012). DebtRank: Too central to fail? Financial networks,
the FED and systemic risk. Scientific Reports,
2, 541. https://doi.org/10.1038/srep00541
Baum, L. E., Petrie, T., Soules, G., & Weiss, N. (1970). A
maximization technique occurring in the statistical analysis of
probabilistic functions of Markov chains. The Annals of
Mathematical Statistics, 41(1), 164–171. https://doi.org/10.1214/aoms/1177697196
Bayer, P., Ferreira, F., & Ross, S. L. (2018). What drives racial
and ethnic differences in high-cost mortgages? The role of high-risk
lenders. The Review of Financial Studies, 31(1),
175–205. https://doi.org/10.1093/rfs/hhx035
Bazarbash, M. (2019). FinTech in financial inclusion: Machine
learning applications in assessing credit risk [IMF Working Paper].
(WP/19/109).
Bazot, G. (2018). Financial consumption and the cost of finance:
Measuring financial efficiency in europe (1950–2007). Journal of the
European Economic Association, 16(1), 123–160. https://doi.org/10.1093/jeea/jvx008
Beaver, W. H. (1966). Financial ratios as predictors of failure.
Journal of Accounting Research, 4, 71–111. https://doi.org/10.2307/2490171
Becker, B., & Milbourn, T. (2011). How did increased competition
affect credit ratings? Journal of Financial Economics,
101(3), 493–514. https://doi.org/10.1016/j.jfineco.2011.03.012
Begenau, J., Farboodi, M., & Veldkamp, L. (2018). Big data in
finance and the growth of large firms. Journal of Monetary
Economics, 97, 71–87. https://doi.org/10.1016/j.jmoneco.2018.05.013
Begley, J., Ming, J., & Watts, S. (1996). Bankruptcy classification
errors in the 1980s: An empirical analysis of Altman’s and Ohlson’s
models. Review of Accounting Studies, 1(4), 267–284.
https://doi.org/10.1007/BF00570833
Begley, T. A., & Purnanandam, A. (2017). Design of financial
securities: Empirical evidence from private-label RMBS
deals. Review of Financial Studies, 30(1), 120–161. https://doi.org/10.1093/rfs/hhw068
Begley, T. A., & Purnanandam, A. (2021). Color and credit: Race,
regulation, and the quality of financial services. Journal of
Financial Economics, 141(1), 48–65. https://doi.org/10.1016/j.jfineco.2021.02.013
Behn, M., Haselmann, R., & Vig, V. (2022). The limits of model-based
regulation. The Journal of Finance, 77(3), 1635–1684.
https://doi.org/10.1111/jofi.13124
Belkin, M., Hsu, D., Ma, S., & Mandal, S. (2019). Reconciling modern
machine-learning practice and the classical bias–variance trade-off.
Proceedings of the National Academy of Sciences,
116(32), 15849–15854. https://doi.org/10.1073/pnas.1903070116
Belloni, A., Chernozhukov, V., & Hansen, C. (2014). Inference on
treatment effects after selection among high-dimensional controls.
The Review of Economic Studies, 81(2), 608–650. https://doi.org/10.1093/restud/rdt044
Bellotti, T., & Crook, J. (2009b). Support vector machines for
credit scoring and discovery of significant features. Expert Systems
with Applications, 36(2), 3302–3308. https://doi.org/10.1016/j.eswa.2008.01.005
Bellotti, T., & Crook, J. (2009a). Support vector machines for
credit scoring and discovery of significant features. Expert Systems
with Applications, 36(2), 3302–3308. https://doi.org/10.1016/j.eswa.2008.01.005
Bellotti, T., & Crook, J. (2013). Forecasting and stress testing
credit card default using dynamic models. International Journal of
Forecasting, 29(4), 563–574. https://doi.org/10.1016/j.ijforecast.2013.04.003
Bengio, Y., Simard, P., & Frasconi, P. (1994). Learning long-term
dependencies with gradient descent is difficult. IEEE Transactions
on Neural Networks, 5(2), 157–166. https://doi.org/10.1109/72.279181
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false
discovery rate: A practical and powerful approach to multiple testing.
Journal of the Royal Statistical Society: Series B
(Methodological), 57(1), 289–300. https://doi.org/10.1111/j.2517-6161.1995.tb02031.x
Benmelech, E., & Dlugosz, J. (2009). The alchemy of CDO
credit ratings. Journal of Monetary Economics, 56(5),
617–634. https://doi.org/10.1016/j.jmoneco.2009.04.007
Berg, T., Burg, V., Gombović, A., & Puri, M. (2020). On the rise of
FinTechs: Credit scoring using digital footprints. The Review of
Financial Studies, 33(7), 2845–2897. https://doi.org/10.1093/rfs/hhz099
Berg, T., Puri, M., & Rocholl, J. (2020). Loan officer incentives,
internal rating models, and default rates. Review of Finance,
24(3), 529–578. https://doi.org/10.1093/rof/rfz018
Berger, A. N., Miller, N. H., Petersen, M. A., Rajan, R. G., &
Stein, J. C. (2005). Does function follow organizational form?
Evidence from the lending practices of large and small
banks. Journal of Financial Economics, 76(2), 237–269.
https://doi.org/10.1016/j.jfineco.2004.06.003
Berger, A. N., & Udell, G. F. (2002). Small business credit
availability and relationship lending: The importance of bank
organisational structure. Economic Journal, 112(477),
F32–F53. https://doi.org/10.1111/1468-0297.00682
Berger, D. W., Milbradt, K., Tourre, F., & Vavra, J. (2021).
Mortgage prepayment and path-dependent effects of monetary policy.
American Economic Review, 111(9), 2829–2878. https://doi.org/10.1257/aer.20181857
Bergmeir, C., & Benı́tez, J. M. (2012). On the use of
cross-validation for time series predictor evaluation. Information
Sciences, 191, 192–213. https://doi.org/10.1016/j.ins.2011.12.028
Bergmeir, C., Hyndman, R. J., & Koo, B. (2018). A note on the
validity of cross-validation for evaluating autoregressive time series
prediction. Computational Statistics & Data Analysis,
120, 70–83. https://doi.org/10.1016/j.csda.2017.11.003
Berka, P. (1999). PKDD’99 discovery challenge financial
dataset. University of Economics, Prague.
Berkson, J. (1944). Application of the logistic function to bio-assay.
Journal of the American Statistical Association,
39(227), 357–365. https://doi.org/10.1080/01621459.1944.10500699
Berkson, J., & Gage, R. P. (1952). Survival curve for cancer
patients following treatment. Journal of the American Statistical
Association, 47(259), 501–515. https://doi.org/10.2307/2281318
Bernstein, A., Gustafson, M. T., & Lewis, R. (2019). Disaster on the
horizon: The price effect of sea level rise. Journal of Financial
Economics, 134(2), 253–272. https://doi.org/10.1016/j.jfineco.2019.03.013
Bertomeu, J., Cheynel, E., Floyd, E., & Pan, W. (2021). Using
machine learning to detect misstatements. Review of Accounting
Studies, 26(2), 468–519. https://doi.org/10.1007/s11142-020-09563-8
Bertrand, M., Duflo, E., & Mullainathan, S. (2004). How much should
we trust differences-in-differences estimates? The Quarterly Journal
of Economics, 119(1), 249–275. https://doi.org/10.1162/003355304772839588
Bertrand, M., & Morse, A. (2011). Information disclosure, cognitive
biases, and payday borrowing. Journal of Finance,
66(6), 1865–1893. https://doi.org/10.1111/j.1540-6261.2011.01698.x
Bertsimas, D., & Dunn, J. (2017). Optimal classification trees.
Machine Learning, 106(7), 1039–1082. https://doi.org/10.1007/s10994-017-5633-9
Beutel, A., Chen, J., Zhao, Z., & Chi, E. H. (2017). Data decisions
and theoretical implications when adversarially learning fair
representations. FAT/ML Workshop at KDD.
Bharadwaj, P., Jack, W., & Suri, T. (2021). Fintech and household
resilience to shocks: Evidence from digital loans in Kenya.
Journal of Development Economics, 153, 102697. https://doi.org/10.1016/j.jdeveco.2021.102697
Bharath, S. T., & Shumway, T. (2008). Forecasting default with the
Merton distance to default model. Review of Financial
Studies, 21(3), 1339–1369. https://doi.org/10.1093/rfs/hhn044
Bhatt, U., Weller, A., & Moura, J. M. F. (2020). Evaluating and
aggregating feature-based model explanations. Proceedings of the
29th International Joint Conference on Artificial Intelligence
(IJCAI), 3016–3022.
Bhutta, N., & Hizmo, A. (2021). Do minorities pay more for
mortgages? The Review of Financial Studies, 34(2),
763–789. https://doi.org/10.1093/rfs/hhaa047
Bhutta, N., Hizmo, A., & Ringo, D. (2022). How much does racial bias
affect mortgage lending? Evidence from human and
algorithmic credit decisions. Finance and Economics Discussion
Series, (2022-067). https://doi.org/10.17016/FEDS.2022.067
Bhutta, N., Skiba, P. M., & Tobacman, J. (2015b). Payday loan
choices and consequences. Journal of Money, Credit and Banking,
47(2-3), 223–260. https://doi.org/10.1111/jmcb.12175
Bhutta, N., Skiba, P. M., & Tobacman, J. (2015a). Payday loan
choices and consequences. Journal of Money, Credit and Banking,
47(2-3), 223–260. https://doi.org/10.1111/jmcb.12175
Bia, M., Huber, M., & Lafférs, L. (2024). Double machine learning
for sample selection models. Journal of Business and Economic
Statistics, 42(3), 958–969. https://doi.org/10.1080/07350015.2023.2271071
Biamonte, J., Wittek, P., Pancotti, N., Rebentrost, P., Wiebe, N., &
Lloyd, S. (2017). Quantum machine learning. Nature,
549(7671), 195–202. https://doi.org/10.1038/nature23474
Biau, G., & Scornet, E. (2016). A random forest guided tour.
TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Bica, I., Alaa, A. M., Jordon, J., & Schaar, M. van der. (2020).
Estimating counterfactual treatment outcomes over time through
adversarially balanced representations. International Conference on
Learning Representations (ICLR).
Bickel, P. J., & Levina, E. (2004). Some theory for
Fisher’s linear discriminant function, “naive
Bayes,” and some alternatives when there are many
more variables than observations. Bernoulli, 10(6),
989–1010. https://doi.org/10.3150/bj/1106314847
Bickel, S., Brückner, M., & Scheffer, T. (2009). Discriminative
learning under covariate shift. Journal of Machine Learning
Research, 10, 2137–2155.
Bierman, H., & Hausman, W. H. (1970). The credit granting decision.
Management Science, 16(8), B519–B532. https://doi.org/10.1287/mnsc.16.8.B519
Bifet, A., & Gavalda, R. (2007). Learning from time-changing data
with adaptive windowing. Proceedings of the 2007 SIAM International
Conference on Data Mining (SDM), 443–448. https://doi.org/10.1137/1.9781611972771.42
Billingsley, P. (1995). Probability and measure (3rd ed.).
Wiley.
Björkegren, D., & Grissen, D. (2020). Behavior revealed in mobile
phone usage predicts credit repayment. The World Bank Economic
Review, 34(3), 618–634. https://doi.org/10.1093/wber/lhz006
Black, F., & Cox, J. C. (1976). Valuing corporate securities: Some
effects of bond indenture provisions. The Journal of Finance,
31(2), 351–367. https://doi.org/10.2307/2326607
Black, F., & Scholes, M. (1973). The pricing of options and
corporate liabilities. Journal of Political Economy,
81(3), 637–654. https://doi.org/10.1086/260062
Blanche, P., Dartigues, J.-F., & Jacqmin-Gadda, H. (2013).
Estimating and comparing time-dependent areas under receiver operating
characteristic curves for censored event times with competing risks.
Statistics in Medicine, 32(30), 5381–5397. https://doi.org/10.1002/sim.5958
Blattner, L., & Nelson, S. (2022). How costly is noise? Data and
disparities in consumer credit. SSRN Electronic Journal.
Bleier, A., Goldfarb, A., & Tucker, C. (2020). Consumer privacy and
the future of data-based innovation and marketing. International
Journal of Research in Marketing, 37(3), 466–480. https://doi.org/10.1016/j.ijresmar.2020.03.006
Blinder, A. S. (1973). Wage discrimination: Reduced form and structural
estimates. The Journal of Human Resources, 8(4),
436–455. https://doi.org/10.2307/144855
Blume, M. E., Lim, F., & MacKinlay, A. C. (1998). The declining
credit quality of U.S. Corporate debt: Myth or reality?
Journal of Finance, 53(4), 1389–1413. https://doi.org/10.1111/0022-1082.00057
Blumenstock, J., Cadamuro, G., & On, R. (2015). Predicting poverty
and wealth from mobile phone metadata. Science,
350(6264), 1073–1076. https://doi.org/10.1126/science.aac4420
Blundell, R., & Powell, J. L. (2003). Endogeneity in nonparametric
and semiparametric regression models. Advances in Economics and
Econometrics: Theory and Applications, Eighth World Congress,
2, 312–357.
Board of Governors of the Federal Reserve System. (2007). Report to
the congress on credit scoring and its effects on the availability and
affordability of credit. Federal Reserve. https://www.federalreserve.gov/boarddocs/rptcongress/creditscore/
Board of Governors of the Federal Reserve System. (2011).
Supervisory guidance on model risk management (SR 11-7).
Federal Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Board of Governors of the Federal Reserve System. (2015a). Federal
reserve supervisory assessment of capital planning and positions for
large and noncomplex firms (SR 15-19). Federal Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1519.htm
Board of Governors of the Federal Reserve System. (2015b). Federal
reserve supervisory assessment of capital planning and positions for
LISCC firms and large and complex firms (SR 15-18). Federal
Reserve. https://www.federalreserve.gov/supervisionreg/srletters/sr1518.htm
Board of Governors of the Federal Reserve System. (2023). 2023
supervisory stress test results. Federal Reserve. https://www.federalreserve.gov/publications/2023-june-dodd-frank-act-stress-test.htm
Board of Governors of the Federal Reserve System and Federal Deposit
Insurance Corporation and Office of the Comptroller of the Currency.
(2023). Interagency guidance on third-party relationships: Risk
management. 88 Federal Register 37920. https://www.federalregister.gov/documents/2023/06/09/2023-12340/
Board of Governors of the Federal Reserve System and Office of the
Comptroller of the Currency. (2011a). SR 11-7: Guidance
on model risk management. Federal Reserve System. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Board of Governors of the Federal Reserve System and Office of the
Comptroller of the Currency. (2011b). Supervisory guidance on model
risk management (SR 11-7 / OCC 2011-12).
Federal Reserve Supervision and Regulation Letter SR 11-7.
Board of Governors of the Federal Reserve System and Office of the
Comptroller of the Currency. (2011c). Supervisory guidance on model
risk management (SR 11-7).
Board of Governors of the Federal Reserve System, & Office of the
Comptroller of the Currency. (2011). Supervisory guidance on model
risk management (SR 11-7 / OCC 2011-12) (SR 11-7). Board of
Governors of the Federal Reserve System. https://www.federalreserve.gov/supervisionreg/srletters/sr1107.htm
Bodnaruk, A., Loughran, T., & McDonald, B. (2015). Using
10-K text to gauge financial constraints. Journal of
Financial and Quantitative Analysis, 50(4), 623–646. https://doi.org/10.1017/S0022109015000411
Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017).
Enriching word vectors with subword information. Transactions of the
Association for Computational Linguistics, 5, 135–146. https://doi.org/10.1162/tacl_a_00051
Bolton, P., & Kacperczyk, M. (2021). Do investors care about carbon
risk? Journal of Financial Economics, 142(2), 517–549.
https://doi.org/10.1016/j.jfineco.2021.05.008
Bolton, R. J., & Hand, D. J. (2002). Statistical fraud detection: A
review. Statistical Science, 17(3), 235–249. https://doi.org/10.1214/ss/1042727940
Bonacich, P. (1972). Factoring and weighting approaches to status scores
and clique identification. Journal of Mathematical Sociology,
2(1), 113–120. https://doi.org/10.1080/0022250X.1972.9989806
Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B.,
Patel, S., Ramage, D., Segal, A., & Seth, K. (2017). Practical
secure aggregation for privacy-preserving machine learning.
Proceedings of the 2017 ACM SIGSAC Conference on Computer and
Communications Security (CCS), 1175–1191. https://doi.org/10.1145/3133956.3133982
Bonsall, S. B., Koharki, K., & Neamtiu, M. (2017). The disciplining
effect of credit default swap trading on the quality of credit rating
agencies. Journal of Accounting and Economics,
63(2–3), 182–208. https://doi.org/10.1016/j.jacceco.2016.12.002
Bonvini, M., & Kennedy, E. H. (2022). Sensitivity analysis via the
proportion of unmeasured confounding. Journal of the American
Statistical Association, 117(539), 1540–1550. https://doi.org/10.1080/01621459.2020.1864382
Boot, A. W. A. (2000). Relationship banking: What do we know?
Journal of Financial Intermediation, 9(1), 7–25. https://doi.org/10.1006/jfin.2000.0282
Boot, A., Hoffmann, P., Laeven, L., & Ratnovski, L. (2021). Fintech:
What’s old, what’s new? Journal of Financial Stability,
53, 100836. https://doi.org/10.1016/j.jfs.2020.100836
Borisov, V., Leemann, T., Seßler, K., Haug, J., Pawelczyk, M., &
Kasneci, G. (2024). Deep neural networks and tabular data: A survey.
IEEE Transactions on Neural Networks and Learning Systems,
35(6), 7499–7519. https://doi.org/10.1109/TNNLS.2022.3229161
Borri, N., & Verdelhan, A. (2023). Sovereign risk premia and global
macroeconomic conditions. Journal of Financial Economics,
147(1), 172–197. https://doi.org/10.1016/j.jfineco.2022.10.001
Borusyak, K., Jaravel, X., & Spiess, J. (2024). Revisiting
event-study designs: Robust and efficient estimation. Review of
Economic Studies, 91(6), 3253–3285. https://doi.org/10.1093/restud/rdae007
Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training
algorithm for optimal margin classifiers. 144–152. https://doi.org/10.1145/130385.130401
Bowen, D., & Ungar, L. (2020). Generalized SHAP: Generating multiple
types of explanations in machine learning. arXiv Preprint
arXiv:2006.07155.
Boyd, S., & Vandenberghe, L. (2004). Convex optimization.
Cambridge University Press. https://doi.org/10.1017/CBO9780511804441
Bracke, P., Datta, A., Jung, C., & Sen, S. (2019). Machine learning
explainability in finance: An application to default risk analysis.
Bank of England Staff Working Paper, (816).
Braun, M., & Schweidel, D. A. (2011). Modeling customer lifetimes
with multiple causes of churn. Marketing Science,
30(5), 881–902. https://doi.org/10.1287/mksc.1110.0665
Breck, E., Cai, S., Nielsen, E., Salib, M., & Sculley, D. (2017).
The ML test score: A rubric for ML production
readiness and technical debt reduction. IEEE International
Conference on Big Data, 1123–1132. https://doi.org/10.1109/BigData.2017.8258038
Breeden, J. L. (2007a). Modeling data with multiple time dimensions.
Computational Statistics & Data Analysis, 51(9),
4761–4785. https://doi.org/10.1016/j.csda.2007.01.023
Breeden, J. L. (2007b). Modeling data with multiple time dimensions.
Computational Statistics and Data Analysis, 51(9),
4761–4785. https://doi.org/10.1016/j.csda.2006.07.026
Breeden, J. L. (2020). A survey of machine learning in credit risk.
Journal of Credit Risk, 16(1), 1–62.
Breiman, L. (1996a). Bagging predictors. Machine Learning,
24(2), 123–140. https://doi.org/10.1007/BF00058655
Breiman, L. (1996b). Bagging predictors. Machine Learning,
24(2), 123–140. https://doi.org/10.1007/BF00058655
Breiman, L. (1996c). Heuristics of instability and stabilization in
model selection. The Annals of Statistics, 24(6),
2350–2383. https://doi.org/10.1214/aos/1032181158
Breiman, L. (1996d). Stacked regressions. Machine Learning,
24(1), 49–64. https://doi.org/10.1007/BF00117832
Breiman, L. (2001). Random forests. Machine Learning,
45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984).
Classification and regression trees. Wadsworth.
Breslow, N. E. (1974). Covariance analysis of censored survival data.
Biometrics, 30(1), 89–99. https://doi.org/10.2307/2529620
Brevoort, K. P., Grimm, P., & Kambara, M. (2016). Credit invisibles
and the unscored. Cityscape, 18(2), 9–34.
Bricken, T., Templeton, A., Batson, J., Chen, B., Jermyn, A., Conerly,
T., Turner, N., Anil, C., Denison, C., Askell, A., et al. (2023).
Towards monosemanticity: Decomposing language models with dictionary
learning. Transformer Circuits Thread. https://transformer-circuits.pub/2023/monosemantic-features/index.html
Brier, G. W. (1950). Verification of forecasts expressed in terms of
probability. Monthly Weather Review, 78(1), 1–3.
Broder, A. Z. (1997). On the resemblance and containment of
documents. 21–29. https://doi.org/10.1109/SEQUEN.1997.666900
Brodersen, K. H., Gallusser, F., Koehler, J., Remy, N., & Scott, S.
L. (2015). Inferring causal impact using Bayesian
structural time-series models. Annals of Applied Statistics,
9(1), 247–274. https://doi.org/10.1214/14-AOAS788
Bronstein, M. M., Bruna, J., LeCun, Y., Szlam, A., & Vandergheynst,
P. (2017). Geometric deep learning: Going beyond Euclidean
data. IEEE Signal Processing Magazine, 34(4), 18–42.
https://doi.org/10.1109/MSP.2017.2693418
Brown, I., & Mues, C. (2012). An experimental comparison of
classification algorithms for imbalanced credit scoring data sets.
Expert Systems with Applications, 39(3), 3446–3453. https://doi.org/10.1016/j.eswa.2011.09.033
Brown, T. B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal,
P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., Agarwal, S.,
Herbert-Voss, A., Krueger, G., Henighan, T., Child, R., Ramesh, A.,
Ziegler, D. M., Wu, J., Winter, C., … Amodei, D. (2020). Language models
are few-shot learners. Advances in Neural Information Processing
Systems 33 (NeurIPS), 1877–1901.
Buchak, G., Matvos, G., Piskorski, T., & Seru, A. (2018). Fintech,
regulatory arbitrage, and the rise of shadow banks. Journal of
Financial Economics, 130(3), 453–483. https://doi.org/10.1016/j.jfineco.2018.03.011
Bücker, M., Kampen, M. van, & Krämer, W. (2013). Reject inference in
consumer credit scoring with nonignorable missing data. Journal of
Banking & Finance, 37(3), 1040–1045. https://doi.org/10.1016/j.jbankfin.2012.11.002
Bühlmann, P., & Hothorn, T. (2007). Boosting algorithms:
Regularization, prediction and model fitting. Statistical
Science, 22(4), 477–505. https://doi.org/10.1214/07-STS242
Buja, A., & Stuetzle, W. (2006). Observations on bagging.
Statistica Sinica, 16(2), 323–351.
Bumacov, V., Ashta, A., & Singh, P. (2014). The use of credit
scoring in microfinance institutions and their outreach. Strategic
Change, 23(7-8), 401–413. https://doi.org/10.1002/jsc.1985
Bursztyn, L., Fiorin, S., Gottlieb, D., & Kanz, M. (2019). Moral
incentives in credit card debt repayment: Evidence from a field
experiment. Journal of Political Economy, 127(4),
1641–1683. https://doi.org/10.1086/701605
Bussmann, N., Giudici, P., Marinelli, D., & Papenbrock, J. (2021).
Explainable AI in fintech risk management. Frontiers in
Artificial Intelligence, 3, 26. https://doi.org/10.3389/frai.2020.00026
Butaru, F., Chen, Q., Clark, B., Das, S., Lo, A. W., & Siddique, A.
(2016). Risk and risk management in the credit card industry.
Journal of Banking and Finance, 72, 218–239. https://doi.org/10.1016/j.jbankfin.2016.07.015
Buuren, S. van, & Groothuis-Oudshoorn, K. (2011). mice: Multivariate imputation by chained equations
in R. Journal of Statistical Software,
45(3), 1–67. https://doi.org/10.18637/jss.v045.i03
Cadena, X., & Schoar, A. (2011). Remembering to pay? Reminders
vs. Financial incentives for loan payments (NBER Working Paper
17020). National Bureau of Economic Research. https://doi.org/10.3386/w17020
Calabrese, R. (2014). Downturn loss given default: Mixture distribution
estimation. European Journal of Operational Research,
237(1), 271–277. https://doi.org/10.1016/j.ejor.2014.01.043
Calabrese, R., Osmetti, S. A., & Zanin, L. (2024). Sample selection
bias in non-traditional lending: A copula-based approach for imbalanced
data. Socio-Economic Planning Sciences, 95, 102045. https://doi.org/10.1016/j.seps.2024.102045
Calabrese, R., & Zenga, M. (2010). Bank loan recovery rates:
Measuring and nonparametric density estimation. Journal of Banking
& Finance, 34(5), 903–911. https://doi.org/10.1016/j.jbankfin.2009.10.001
Callaway, B., & Sant’Anna, P. H. C. (2021).
Difference-in-differences with multiple time periods. Journal of
Econometrics, 225(2), 200–230. https://doi.org/10.1016/j.jeconom.2020.12.001
Calonico, S., Cattaneo, M. D., & Titiunik, R. (2014). Robust
nonparametric confidence intervals for regression-discontinuity designs.
Econometrica, 82(6), 2295–2326. https://doi.org/10.3982/ECTA11757
Calzolari, G., & Nardotto, M. (2017). Effective reminders.
Management Science, 63(9), 2915–2932. https://doi.org/10.1287/mnsc.2016.2499
Cameron, A. C., Gelbach, J. B., & Miller, D. L. (2008).
Bootstrap-based improvements for inference with clustered errors.
Review of Economics and Statistics, 90(3), 414–427. https://doi.org/10.1162/rest.90.3.414
Cameron, A. C., & Miller, D. L. (2015). A practitioner’s guide to
cluster-robust inference. Journal of Human Resources,
50(2), 317–372. https://doi.org/10.3368/jhr.50.2.317
Campbell, J. L., Chen, H., Dhaliwal, D. S., Lu, H., & Steele, L. B.
(2014). The information content of mandatory risk factor disclosures in
corporate filings. Review of Accounting Studies,
19(1), 396–455. https://doi.org/10.1007/s11142-013-9258-3
Campbell, J. Y., & Cocco, J. F. (2015). A model of mortgage default.
The Journal of Finance, 70(4), 1495–1554. https://doi.org/10.1111/jofi.12252
Campbell, J. Y., Hilscher, J., & Szilagyi, J. (2008). In search of
distress risk. The Journal of Finance, 63(6),
2899–2939. https://doi.org/10.1111/j.1540-6261.2008.01416.x
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., &
Tzoumas, K. (2015). Apache Flink: Stream and batch
processing in a single engine. IEEE Data Engineering Bulletin,
38(4), 28–38.
Card, D., & Krueger, A. B. (1994). Minimum wages and employment: A
case study of the fast-food industry in new jersey and pennsylvania.
American Economic Review, 84(4), 772–793.
Carlehed, M., & Petrov, A. (2012). A methodology for
point-in-time-through-the-cycle probability of default decomposition in
risk classification systems. Journal of Risk Model Validation,
6(3), 3–25. https://doi.org/10.21314/JRMV.2012.091
Caruana, R., Lou, Y., Gehrke, J., Koch, P., Sturm, M., & Elhadad, N.
(2015). Intelligible models for healthcare: Predicting pneumonia risk
and hospital 30-day readmission. Proceedings of the 21st ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining,
1721–1730. https://doi.org/10.1145/2783258.2788613
Carvalho, V. M., Nirei, M., Saito, Y. U., & Tahbaz-Salehi, A.
(2021). Supply chain disruptions: Evidence from the Great
East Japan earthquake. Quarterly Journal
of Economics, 136(2), 1255–1321. https://doi.org/10.1093/qje/qjaa044
Casella, G., & Berger, R. L. (2002). Statistical inference
(2nd ed.). Duxbury.
Castrén, O., Dées, S., & Zaher, F. (2010). Stress-testing euro area
corporate default probabilities using a global macroeconomic model.
Journal of Financial Stability, 6(2), 64–78. https://doi.org/10.1016/j.jfs.2009.10.002
Cattaneo, M. D., Jansson, M., & Ma, X. (2020). Simple local
polynomial density estimators. Journal of the American Statistical
Association, 115(531), 1449–1455. https://doi.org/10.1080/01621459.2019.1635480
Cellini, S. R., Ferreira, F., & Rothstein, J. (2010). The value of
school facility investments: Evidence from a dynamic regression
discontinuity design. Quarterly Journal of Economics,
125(1), 215–261. https://doi.org/10.1162/qjec.2010.125.1.215
Central Bank of Kenya. (2013). Prudential guidelines for
institutions licensed under the banking act (CBK/PG/04 risk
management). Central Bank of Kenya. https://www.centralbank.go.ke/
Central Bank of Kenya. (2020). Banking (credit reference bureau)
regulations. Legal Notice, as amended 2020. https://www.centralbank.go.ke/credit-reference-bureaus/
Central Bank of Kenya. (2022). Digital credit providers regulations,
2022. Central Bank of Kenya. https://www.centralbank.go.ke/digital-credit-providers/
Cerezo, M., Arrasmith, A., Babbush, R., Benjamin, S. C., Endo, S.,
Fujii, K., McClean, J. R., Mitarai, K., Yuan, X., Cincio, L., &
Coles, P. J. (2021). Variational quantum algorithms. Nature Reviews
Physics, 3(9), 625–644. https://doi.org/10.1038/s42254-021-00348-9
Cessie, S. le, & Houwelingen, J. C. van. (1992). Ridge estimators in
logistic regression. Journal of the Royal Statistical Society.
Series C (Applied Statistics), 41(1), 191–201. https://doi.org/10.2307/2347628
CGAP. (2019). Digital credit market monitoring in
Tanzania. Consultative Group to Assist the Poor. https://www.cgap.org/research/publication/digital-credit-market-monitoring-tanzania
Chaisemartin, C. de, & D’Haultfœuille, X. (2020). Two-way fixed
effects estimators with heterogeneous treatment effects. American
Economic Review, 110(9), 2964–2996. https://doi.org/10.1257/aer.20181169
Challu, C., Olivares, K. G., Oreshkin, B. N., Garza Ramirez, F.,
Mergenthaler-Canseco, M., & Dubrawski, A. (2023).
NHITS: Neural hierarchical interpolation for time series
forecasting. Proceedings of the AAAI Conference on Artificial
Intelligence, 37(6), 6989–6997. https://doi.org/10.1609/aaai.v37i6.25854
Chan, K. C. G., & Yam, S. C. P. (2014). Oracle, multiple robust and
multipurpose calibration in a missing response problem. Statistical
Science, 29(3), 380–396. https://doi.org/10.1214/14-STS486
Chandrashekaran, M., & Sinha, R. K. (1995). Isolating the
determinants of innovativeness: A split-population tobit
(SPOT) duration model. Journal of Marketing
Research, 32(4), 444–456. https://doi.org/10.1177/002224379503200407
Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library
for support vector machines. ACM Transactions on Intelligent Systems
and Technology, 2(3), 1–27. https://doi.org/10.1145/1961189.1961199
Chapelle, O., Schölkopf, B., & Zien, A. (2006). Semi-supervised
learning. MIT Press.
Chava, S., & Jarrow, R. A. (2004). Bankruptcy prediction with
industry effects. Review of Finance, 8(4), 537–569. https://doi.org/10.1093/rof/8.4.537
Chava, S., Paradkar, N., & Zhang, Y. (2021). Winners and losers of
marketplace lending: Evidence from borrower credit dynamics. Journal
of Financial Economics, 142(3), 1186–1208. https://doi.org/10.1016/j.jfineco.2021.05.027
Chava, S., Stefanescu, C., & Turnbull, S. (2011). Modeling the loss
distribution. Management Science, 57(7), 1267–1287. https://doi.org/10.1287/mnsc.1110.1345
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P.
(2002). SMOTE: Synthetic minority over-sampling technique.
Journal of Artificial Intelligence Research, 16,
321–357. https://doi.org/10.1613/jair.953
Chefer, H., Gur, S., & Wolf, L. (2021). Transformer interpretability
beyond attention visualization. Proceedings of the IEEE/CVF
Conference on Computer Vision and Pattern Recognition (CVPR),
782–791. https://doi.org/10.1109/CVPR46437.2021.00084
Chen, C., Li, O., Tao, D., Barnett, A., Rudin, C., & Su, J. K.
(2019). This looks like that: Deep learning for interpretable image
recognition. Advances in Neural Information Processing Systems 32
(NeurIPS 2019).
Chen, H. (2010). Macroeconomic conditions and the puzzles of credit
spreads and capital structure. The Journal of Finance,
65(6), 2171–2212. https://doi.org/10.1111/j.1540-6261.2010.01613.x
Chen, H., Janizek, J. D., Lundberg, S., & Lee, S.-I. (2020). True to
the model or true to the data? ICML Workshop on Human
Interpretability in Machine Learning.
Chen, L., Jia, N., Jiao, Z., Zhao, H., Cui, R., & Wang, H. (2025). A
semi-supervised reject inference framework with hierarchical
heterogeneous networks for credit scoring. International Journal of
Forecasting, 41(3), 920–939. https://doi.org/10.1016/j.ijforecast.2024.07.011
Chen, M. A., Wu, Q., & Yang, B. (2019). How valuable is FinTech
innovation? The Review of Financial Studies, 32(5),
2062–2106. https://doi.org/10.1093/rfs/hhy130
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable
tree boosting system. Proceedings of the 22nd ACM
SIGKDD International Conference on Knowledge Discovery and Data
Mining, 785–794. https://doi.org/10.1145/2939672.2939785
Cheng, D., Tu, Y., Ma, Z., Niu, Z., & Zhang, L. (2019). Risk
assessment for networked-guarantee loans using high-order graph
attention representation. Proceedings of the 28th International
Joint Conference on Artificial Intelligence (IJCAI), 5822–5828. https://doi.org/10.24963/ijcai.2019/807
Cheng, K., Fan, T., Jin, Y., Liu, Y., Chen, T., Papadopoulos, D., &
Yang, Q. (2021). SecureBoost: A lossless federated learning
framework. IEEE Intelligent Systems, 36, 87–98. https://doi.org/10.1109/MIS.2021.3082561
Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C.,
Newey, W., & Robins, J. (2018). Double/debiased machine learning for
treatment and structural parameters. The Econometrics Journal,
21(1), C1–C68. https://doi.org/10.1111/ectj.12097
Chernozhukov, V., Escanciano, J. C., Ichimura, H., Newey, W. K., &
Robins, J. M. (2022). Locally robust semiparametric estimation.
Econometrica, 90(4), 1501–1535. https://doi.org/10.3982/ECTA16294
Chernozhukov, V., Fernández-Val, I., & Galichon, A. (2010). Quantile
and probability curves without crossing. Econometrica,
78(3), 1093–1125. https://doi.org/10.3982/ECTA7880
Chiang, W.-L., Liu, X., Si, S., Li, Y., Bengio, S., & Hsieh, C.-J.
(2019). Cluster-GCN: An efficient algorithm for training
deep and large graph convolutional networks. Proceedings of the 25th
ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining, 257–266. https://doi.org/10.1145/3292500.3330925
Chiburis, R. C., Das, J., & Lokshin, M. (2012). A practical
comparison of the bivariate probit and linear IV estimators.
Economics Letters, 117(3), 762–766. https://doi.org/10.1016/j.econlet.2012.08.037
Chouldechova, A. (2017). Fair prediction with disparate impact: A study
of bias in recidivism prediction instruments. Big Data,
5(2), 153–163. https://doi.org/10.1089/big.2016.0047
Chow, G. C. (1960). Tests of equality between sets of coefficients in
two linear regressions. Econometrica, 28(3), 591–605.
https://doi.org/10.2307/1910133
Christen, P. (2012). A survey of indexing techniques for scalable record
linkage and deduplication. IEEE Transactions on Knowledge and Data
Engineering, 24(9), 1537–1555. https://doi.org/10.1109/TKDE.2011.127
Chung, F. R. K. (1997). Spectral graph theory. CBMS Regional
Conference Series in Mathematics, 92.
Ciampi, F. (2015). Corporate governance characteristics and default
prediction modeling for small enterprises: An empirical analysis of
Italian firms. Journal of Business Research,
68(5), 1012–1025. https://doi.org/10.1016/j.jbusres.2014.10.003
Clark, K., Khandelwal, U., Levy, O., & Manning, C. D. (2019). What
does BERT look at? An analysis of BERT’s
attention. Proceedings of the 2019 ACL Workshop BlackboxNLP,
276–286. https://doi.org/10.18653/v1/W19-4828
Clayton, D., & Cuzick, J. (1985). Multivariate generalizations of
the proportional hazards model. Journal of the Royal Statistical
Society. Series A (General), 148(2), 82–117. https://doi.org/10.2307/2981943
Cohen, L., & Frazzini, A. (2008). Economic links and predictable
returns. The Journal of Finance, 63(4), 1977–2011. https://doi.org/10.1111/j.1540-6261.2008.01379.x
Cohen, L., Malloy, C., & Nguyen, Q. (2020). Lazy prices. The
Journal of Finance, 75(3), 1371–1415. https://doi.org/10.1111/jofi.12885
Collin-Dufresne, P., Goldstein, R. S., & Martin, J. S. (2001). The
determinants of credit spread changes. The Journal of Finance,
56(6), 2177–2207. https://doi.org/10.1111/0022-1082.00402
Comisión Nacional Bancaria y de Valores. (2024). Disposiciones de
carácter general aplicables a las instituciones de crédito (Circular Única de Bancos). As amended through
2024. https://www.cnbv.gob.mx/
Conley, T. G., Hansen, C. B., & Rossi, P. E. (2012). Plausibly
exogenous. Review of Economics and Statistics, 94(1),
260–272. https://doi.org/10.1162/REST_a_00139
Conselho Monetário Nacional. (2017). Resolução no. 4.557: Integrated
risk management and capital management structure. Conselho
Monetário Nacional, Banco Central do Brasil. https://www.bcb.gov.br/
Consumer Financial Protection Bureau. (2011). Regulation b: Equal
credit opportunity act. 12 C.F.R. Part 1002.
Consumer Financial Protection Bureau. (2013a). Equal credit
opportunity act (ECOA) examination procedures. CFPB Supervision and
Examination Manual. https://www.consumerfinance.gov/compliance/supervision-examinations/
Consumer Financial Protection Bureau. (2013b). Regulation
B, 12 CFR § 1002.9:
notifications. https://www.consumerfinance.gov/rules-policy/regulations/1002/9/
Consumer Financial Protection Bureau. (2013c). Regulation
B: Equal credit opportunity (12 CFR part
1002). https://www.consumerfinance.gov/rules-policy/regulations/1002/
Consumer Financial Protection Bureau. (2014). Using publicly
available information to proxy for unidentified race and ethnicity: A
methodology and assessment. CFPB Research Report. https://www.consumerfinance.gov/data-research/research-reports/
Consumer Financial Protection Bureau. (2017). List of consumer
reporting companies. CFPB. https://www.consumerfinance.gov/consumer-tools/credit-reports-and-scores/consumer-reporting-companies/
Consumer Financial Protection Bureau. (2022b). Circular 2022-03:
Adverse action notification requirements in connection with credit
decisions based on complex algorithms. CFPB. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022a). Circular 2022-03:
Adverse action notification requirements in connection with credit
decisions based on complex algorithms. U.S. Consumer Financial
Protection Bureau. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022f). Consumer financial
protection circular 2022-03: Adverse action notification
requirements in connection with credit decisions based on complex
algorithms. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03-adverse-action-notification-requirements-in-connection-with-credit-decisions-based-on-complex-algorithms/
Consumer Financial Protection Bureau. (2022d). Consumer financial
protection circular 2022-03: Adverse action notification requirements in
connection with credit decisions based on complex algorithms
[Circular]. CFPB.
Consumer Financial Protection Bureau. (2022e). Consumer financial
protection circular 2022-03: Adverse action notification requirements in
connection with credit decisions based on complex algorithms.
Consumer Financial Protection Bureau.
Consumer Financial Protection Bureau. (2022c). Consumer financial
protection circular 2022-03: Adverse action notification requirements in
connection with credit decisions based on complex algorithms. https://www.consumerfinance.gov/compliance/circulars/circular-2022-03/
Consumer Financial Protection Bureau. (2023a). Chatbots in consumer
finance. CFPB. https://www.consumerfinance.gov/data-research/research-reports/chatbots-in-consumer-finance/
Consumer Financial Protection Bureau. (2023b). Consumer financial
protection circular 2023-03: Adverse action notification
requirements and the proper use of the CFPB’s sample forms
provided in regulation B. https://www.consumerfinance.gov/compliance/circulars/circular-2023-03-adverse-action-notification-requirements-and-the-proper-use-of-the-cfpbs-sample-forms-provided-in-regulation-b/
Consumer Financial Protection Bureau. (2024a). Home mortgage
disclosure act (HMDA) public loan/application
register. FFIEC and CFPB Public Data Platform.
Consumer Financial Protection Bureau. (2024b). Required rulemaking
on personal financial data rights (section 1033) [Final Rule, 12
CFR Part 1033]. https://www.consumerfinance.gov/rules-policy/final-rules/personal-financial-data-rights/
Cont, R., Moussa, A., & Santos, E. B. (2013). Network structure and
systemic risk in banking systems. Handbook on Systemic Risk,
327–368. https://doi.org/10.1017/CBO9781139151184.018
Copas, J. B., & Li, H. G. (1997). Inference for non-random samples.
Journal of the Royal Statistical Society. Series B
(Methodological), 59(1), 55–95. https://doi.org/10.1111/1467-9868.00055
Corbett-Davies, S., Gaebler, J. D., Nilforoshan, H., Shroff, R., &
Goel, S. (2023). The measure and mismeasure of fairness. Journal of
Machine Learning Research, 24(312), 1–117.
Corbett-Davies, S., Pierson, E., Feller, A., Goel, S., & Huq, A.
(2017). Algorithmic decision making and the cost of fairness.
Proceedings of the 23rd ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, 797–806. https://doi.org/10.1145/3097983.3098095
Corcoran, A. W. (1978). The use of exponentially-smoothed transition
matrices to improve forecasting of cash flows from accounts receivable.
Management Science, 24(7), 732–739. https://doi.org/10.1287/mnsc.24.7.732
Cornaggia, J., & Cornaggia, K. J. (2013). Estimating the costs of
issuer-paid credit ratings. Review of Financial Studies,
26(9), 2229–2269. https://doi.org/10.1093/rfs/hht041
Cornelli, G., Frost, J., Gambacorta, L., Rau, P. R., Wardrop, R., &
Ziegler, T. (2023a). Fintech and big tech credit: Drivers of the growth
of digital lending. Journal of Banking and Finance,
148, 106742. https://doi.org/10.1016/j.jbankfin.2022.106742
Cornelli, G., Frost, J., Gambacorta, L., Rau, P. R., Wardrop, R., &
Ziegler, T. (2023b). Fintech and big tech credit: Drivers of the
growth of digital lending (BIS Working Paper 1028). Bank for
International Settlements. https://www.bis.org/publ/work1028.htm
Cortes, C., & Vapnik, V. (1995). Support-vector networks.
Machine Learning, 20(3), 273–297. https://doi.org/10.1007/BF00994018
Costello, A. M., Down, A. K., & Mehta, M. N. (2020). Machine + man:
A field experiment on the role of discretion in augmenting AI-based
lending models. Journal of Accounting and Economics,
70(2–3), 101360. https://doi.org/10.1016/j.jacceco.2020.101360
Cover, T. M., & Thomas, J. A. (2006a). Elements of information
theory.
Cover, T. M., & Thomas, J. A. (2006b). Elements of information
theory. Wiley Series in Telecommunications and Signal Processing,
2nd Ed. https://doi.org/10.1002/047174882X
Covert, I., Lundberg, S. M., & Lee, S.-I. (2020). Understanding
global feature contributions with additive importance measures.
Advances in Neural Information Processing Systems 33 (NeurIPS
2020).
Covert, I., Lundberg, S. M., & Lee, S.-I. (2021). Explaining by
removing: A unified framework for model explanation. Journal of
Machine Learning Research, 22(209), 1–90.
Cox, D. R. (1958). The regression analysis of binary sequences.
Journal of the Royal Statistical Society. Series B
(Methodological), 20(2), 215–242.
Cox, D. R. (1972). Regression models and life-tables. Journal of the
Royal Statistical Society. Series B (Methodological),
34(2), 187–220.
Cox, D. R. (1975). Partial likelihood. Biometrika,
62(2), 269–276. https://doi.org/10.1093/biomet/62.2.269
Crankshaw, D., Wang, X., Zhou, G., Franklin, M. J., Gonzalez, J. E.,
& Stoica, I. (2017). Clipper: A low-latency online prediction
serving system. USENIX Symposium on Networked Systems Design and
Implementation (NSDI), 613–627.
Crawford, G. S., Pavanini, N., & Schivardi, F. (2018). Asymmetric
information and imperfect competition in lending markets. American
Economic Review, 108(7), 1659–1701. https://doi.org/10.1257/aer.20150487
Credit Fusion, & Will Cukierski. (2011). Give me some
credit. Kaggle Competition.
Credit Information Center of Vietnam. (2023). Annual report on
credit information activities. CIC, State Bank of Vietnam. https://cic.gov.vn/
Crook, J. N., & Banasik, J. (2004). Does reject inference really
improve the performance of application scoring models? Journal of
Banking & Finance, 28(4), 857–874. https://doi.org/10.1016/j.jbankfin.2003.10.010
Crook, J. N., & Bellotti, T. (2010). Time varying and dynamic models
for default risk in consumer loans. Journal of the Royal Statistical
Society: Series A, 173(2), 283–305. https://doi.org/10.1111/j.1467-985X.2009.00617.x
Crook, J. N., Edelman, D. B., & Thomas, L. C. (2007). Recent
developments in consumer credit risk assessment. European Journal of
Operational Research, 183(3), 1447–1465. https://doi.org/10.1016/j.ejor.2006.09.100
Crouhy, M., Galai, D., & Mark, R. (2001). Prototype risk rating
system. Journal of Banking & Finance, 25(1),
47–95. https://doi.org/10.1016/S0378-4266(00)00117-5
Cybenko, G. (1989). Approximation by superpositions of a sigmoidal
function. Mathematics of Control, Signals and Systems,
2(4), 303–314. https://doi.org/10.1007/BF02551274
Cyert, R. M., Davidson, H. J., & Thompson, G. L. (1962). Estimation
of the allowance for doubtful accounts by Markov chains.
Management Science, 8(3), 287–303. https://doi.org/10.1287/mnsc.8.3.287
D’Haultfoeuille, X. (2010). A new instrumental method for dealing with
endogenous selection. Journal of Econometrics, 154(1),
1–15. https://doi.org/10.1016/j.jeconom.2009.06.003
Dal Pozzolo, A., Caelen, O., Johnson, R. A., & Bontempi, G. (2015).
Calibrating probability with undersampling for unbalanced
classification. 159–166. https://doi.org/10.1109/SSCI.2015.33
Daniel, K., Titman, S., & Wei, K. J. (2001). Explaining the
cross-section of stock returns in japan: Factors or characteristics?
The Journal of Finance, 56(2), 743–766.
Daniels, M. J., & Hogan, J. W. (2008). Missing data in
longitudinal studies: Strategies for bayesian modeling and sensitivity
analysis. Chapman; Hall/CRC. https://doi.org/10.1201/9781420011180
Das, S. R., & Chen, M. Y. (2007). Yahoo! For
Amazon: Sentiment extraction from small talk on the web.
Management Science, 53(9), 1375–1388. https://doi.org/10.1287/mnsc.1070.0704
Das, S. R., Duffie, D., Kapadia, N., & Saita, L. (2007). Common
failings: How corporate defaults are correlated. Journal of
Finance, 62(1), 93–117. https://doi.org/10.1111/j.1540-6261.2007.01202.x
Dastile, X., Celik, T., & Potsane, M. (2020). Statistical and
machine learning models in credit scoring: A systematic literature
survey. Applied Soft Computing, 91, 106263. https://doi.org/10.1016/j.asoc.2020.106263
Davis, J., & Goadrich, M. (2006). The relationship between
precision-recall and ROC curves. 233–240. https://doi.org/10.1145/1143844.1143874
Dawid, A. P. (1982). The well-calibrated bayesian. Journal of the
American Statistical Association, 77(379), 605–610. https://doi.org/10.2307/2287720
Defferrard, M., Bresson, X., & Vandergheynst, P. (2016).
Convolutional neural networks on graphs with fast localized spectral
filtering. Advances in Neural Information Processing Systems 29
(NIPS 2016).
DeFusco, A. A., & Paciorek, A. (2017). The interest rate elasticity
of mortgage demand: Evidence from bunching at the conforming loan limit.
American Economic Journal: Economic Policy, 9(1),
210–240. https://doi.org/10.1257/pol.20140108
DeGroot, M. H., & Fienberg, S. E. (1983). The comparison and
evaluation of forecasters. The Statistician, 32(1/2),
12–22. https://doi.org/10.2307/2987588
DellaVigna, S., & Linos, E. (2022). RCTs to scale:
Comprehensive evidence from two nudge units. Econometrica,
90(1), 81–116. https://doi.org/10.3982/ECTA18709
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988a).
Comparing the areas under two or more correlated receiver operating
characteristic curves. Biometrics, 44(3), 837–845. https://doi.org/10.2307/2531595
DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988b).
Comparing the areas under two or more correlated receiver operating
characteristic curves: A nonparametric approach. Biometrics,
44(3), 837–845. https://doi.org/10.2307/2531595
Demarta, S., & McNeil, A. J. (2005). The t copula and related copulas.
International Statistical Review, 73(1), 111–129. https://doi.org/10.1111/j.1751-5823.2005.tb00254.x
Demirgüç-Kunt, A., Klapper, L., Singer, D., & Ansar, S. (2022).
The global findex database 2021: Financial inclusion, digital
payments, and resilience in the age of COVID-19. https://www.worldbank.org/en/publication/globalfindex
Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum
likelihood from incomplete data via the EM algorithm. Journal of the
Royal Statistical Society. Series B (Methodological),
39(1), 1–38.
Demšar, J. (2006). Statistical comparisons of classifiers over multiple
data sets. Journal of Machine Learning Research, 7,
1–30.
Demyanyk, Y., & Van Hemert, O. (2011). Understanding the subprime
mortgage crisis. The Review of Financial Studies,
24(6), 1848–1880. https://doi.org/10.1093/rfs/hhp033
Deng, Y., Quigley, J. M., & Van Order, R. (2000). Mortgage
terminations, heterogeneity and the exercise of mortgage options.
Econometrica, 68(2), 275–307. https://doi.org/10.1111/1468-0262.00110
Dettmers, T., Pagnoni, A., Holtzman, A., & Zettlemoyer, L. (2023).
QLoRA: Efficient finetuning of quantized LLMs.
Advances in Neural Information Processing Systems 36
(NeurIPS), 10088–10115.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019).
BERT: Pre-training of deep bidirectional transformers for
language understanding. Proceedings of the 2019 Conference of the
North American Chapter of the Association for Computational Linguistics
(NAACL), 4171–4186. https://doi.org/10.18653/v1/N19-1423
DeYoung, R., Glennon, D., & Nigro, P. (2008). Borrower-lender
distance, credit scoring, and loan performance: Evidence from
informational-opaque small business borrowers. Journal of Financial
Intermediation, 17(1), 113–143. https://doi.org/10.1016/j.jfi.2007.07.002
Dhurandhar, A., Chen, P.-Y., Luss, R., Tu, C.-C., Ting, P., Shanmugam,
K., & Das, P. (2018). Explanations based on the missing: Towards
contrastive explanations with pertinent negatives. Advances in
Neural Information Processing Systems 31 (NeurIPS 2018).
Diamond, D. W. (1984). Financial intermediation and delegated
monitoring. The Review of Economic Studies, 51(3),
393–414. https://doi.org/10.2307/2297430
Diamond, D. W. (1991). Monitoring and reputation: The choice between
bank loans and directly placed debt. Journal of Political
Economy, 99(4), 689–721. https://doi.org/10.1086/261775
Dietterich, T. G. (1998). Approximate statistical tests for comparing
supervised classification learning algorithms. Neural
Computation, 10(7), 1895–1923. https://doi.org/10.1162/089976698300017197
Dirick, L., Claeskens, G., & Baesens, B. (2017). Time to default in
credit scoring using survival analysis: A benchmark study. Journal
of the Operational Research Society, 68(6), 652–665. https://doi.org/10.1057/s41274-016-0128-9
Djeundje, V. B., & Crook, J. (2018). Dynamic survival models with
varying coefficients for credit risks. International Journal of
Forecasting, 34(4), 636–649. https://doi.org/10.1016/j.ijforecast.2018.04.006
Dobbie, W., Liberman, A., Paravisini, D., & Pathania, V. (2021).
Measuring bias in consumer lending. Review of Economic Studies,
88(6), 2799–2832. https://doi.org/10.1093/restud/rdaa078
Dobbie, W., & Song, J. (2015). Debt relief and debtor outcomes:
Measuring the effects of consumer bankruptcy protection. American
Economic Review, 105(3), 1272–1311. https://doi.org/10.1257/aer.20130612
Doerr, S., Frost, J., Gambacorta, L., & Qiu, H. (2022). Fintech and
the digital transformation of financial services. BIS Working
Papers, (1008). https://www.bis.org/publ/work1008.htm
Dorfleitner, G., Priberny, C., Schuster, S., Stoiber, J., Weber, M.,
Castro, I. de, & Kammler, J. (2016). Description-text related soft
information in peer-to-peer lending: Evidence from two leading
European platforms. Journal of Banking and
Finance, 64, 169–187. https://doi.org/10.1016/j.jbankfin.2015.11.009
Doshi-Velez, F., & Kim, B. (2017). Towards a rigorous science of
interpretable machine learning. arXiv Preprint
arXiv:1702.08608.
Drineas, P., & Mahoney, M. W. (2005). On the
Nyström method for approximating a
Gram matrix for improved kernel-based learning. Journal
of Machine Learning Research, 6, 2153–2175.
Drummond, C., & Holte, R. C. (2003). C4.5, class
imbalance, and cost sensitivity: Why under-sampling beats
over-sampling.
Drummond, C., & Holte, R. C. (2006). Cost curves: An improved method
for visualizing classifier performance. Machine Learning,
65(1), 95–130. https://doi.org/10.1007/s10994-006-8199-5
Druz, M., Petzev, I., Wagner, A. F., & Zeckhauser, R. J. (2020).
When managers change their tone, analysts and investors change their
tune. Financial Analysts Journal, 76(2), 47–69. https://doi.org/10.1080/0015198X.2019.1707592
Duan, J.-C. (1994). Maximum likelihood estimation using price data of
the derivative contract. Mathematical Finance, 4(2),
155–167. https://doi.org/10.1111/j.1467-9965.1994.tb00055.x
Duan, J.-C., Gauthier, G., & Simonato, J.-G. (2004). On the
equivalence of the KMV and maximum likelihood methods for
structural credit risk models. Finance Research Letters,
1(3), 167–181. https://doi.org/10.1016/j.frl.2004.04.003
Duan, J.-C., Sun, J., & Wang, T. (2012). Multiperiod corporate
default prediction: A forward intensity approach. Journal of
Econometrics, 170(1), 191–209. https://doi.org/10.1016/j.jeconom.2012.05.002
Duarte, J., Siegel, S., & Young, L. (2012). Trust and credit: The
role of appearance in peer-to-peer lending. The Review of Financial
Studies, 25(8), 2455–2484. https://doi.org/10.1093/rfs/hhs071
Duffie, D., Eckner, A., Horel, G., & Saita, L. (2009b). Frailty
correlated default. The Journal of Finance, 64(5),
2089–2123. https://doi.org/10.1111/j.1540-6261.2009.01495.x
Duffie, D., Eckner, A., Horel, G., & Saita, L. (2009a). Frailty
correlated default. The Journal of Finance, 64(5),
2089–2123. https://doi.org/10.1111/j.1540-6261.2009.01495.x
Duffie, D., & Lando, D. (2001). Term structures of credit spreads
with incomplete accounting information. Econometrica,
69(3), 633–664. https://doi.org/10.1111/1468-0262.00208
Duffie, D., Saita, L., & Wang, K. (2007). Multi-period corporate
default prediction with stochastic covariates. Journal of Financial
Economics, 83(3), 635–665. https://doi.org/10.1016/j.jfineco.2005.10.011
Duffie, D., & Singleton, K. J. (1999a). Modeling term structures of
defaultable bonds. The Review of Financial Studies,
12(4), 687–720. https://doi.org/10.1093/rfs/12.4.687
Duffie, D., & Singleton, K. J. (1999b). Modeling term structures of
defaultable bonds. The Review of Financial Studies,
12(4), 687–720. https://doi.org/10.1093/rfs/12.4.687
Dumitrescu, E., Hué, S., Hurlin, C., & Tokpavi, S. (2022). Machine
learning for credit scoring: Improving logistic regression with
non-linear decision-tree effects. European Journal of Operational
Research, 297(3), 1178–1192. https://doi.org/10.1016/j.ejor.2021.06.053
Durand, D. (1941). Risk elements in consumer instalment
financing [NBER Studies in Consumer Instalment Financing]. (8). https://www.nber.org/books-and-chapters/risk-elements-consumer-instalment-financing
Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2012).
Fairness through awareness. Proceedings of the 3rd Innovations in
Theoretical Computer Science Conference, 214–226. https://doi.org/10.1145/2090236.2090255
Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006). Calibrating
noise to sensitivity in private data analysis. Proceedings of the
Third Conference on Theory of Cryptography (TCC), 265–284. https://doi.org/10.1007/11681878_14
Dwork, C., & Roth, A. (2014). The algorithmic foundations of
differential privacy. Foundations and Trends in Theoretical Computer
Science, 9(3-4), 211–407. https://doi.org/10.1561/0400000042
Dyer, T., Lang, M., & Stice-Lawrence, L. (2017). The evolution of
10-K textual disclosure: Evidence from Latent
Dirichlet Allocation. Journal of
Accounting and Economics, 64(2–3), 221–245. https://doi.org/10.1016/j.jacceco.2017.07.002
Eagle, N., Macy, M., & Claxton, R. (2010). Network diversity and
economic development. Science, 328(5981), 1029–1031.
https://doi.org/10.1126/science.1186605
Edelberg, W. (2006). Risk-based pricing of interest rates for consumer
loans. Journal of Monetary Economics, 53(8),
2283–2298. https://doi.org/10.1016/j.jmoneco.2005.10.018
Efron, B. (1975). The efficiency of logistic regression compared to
normal discriminant analysis. Journal of the American Statistical
Association, 70(352), 892–898. https://doi.org/10.2307/2285453
Efron, B. (1977). The efficiency of cox’s likelihood function for
censored data. Journal of the American Statistical Association,
72(359), 557–565. https://doi.org/10.2307/2286217
Efron, B. (1979). Bootstrap methods: Another look at the jackknife.
The Annals of Statistics, 7(1), 1–26. https://doi.org/10.1214/aos/1176344552
Efron, B. (1987). Better bootstrap confidence intervals. Journal of
the American Statistical Association, 82(397), 171–185. https://doi.org/10.2307/2289144
Efron, B., & Petrosian, V. (1999). Nonparametric methods for doubly
truncated data. Journal of the American Statistical
Association, 94(447), 824–834. https://doi.org/10.1080/01621459.1999.10474187
Efron, B., & Tibshirani, R. J. (1993). An introduction to the
bootstrap. Chapman; Hall/CRC. https://doi.org/10.1201/9780429246593
Efron, B., & Tibshirani, R. J. (1994). An introduction to the
bootstrap. Chapman; Hall/CRC. https://doi.org/10.1201/9780429246593
Egger, D. J., Gambella, C., Marecek, J., McFaddin, S., Mevissen, M.,
Raymond, R., Simonetto, A., Woerner, S., & Yndurain, E. (2020).
Quantum computing for finance: State-of-the-art and future prospects.
IEEE Transactions on Quantum Engineering, 1, 1–24. https://doi.org/10.1109/TQE.2020.3030314
Eisenberg, L., & Noe, T. H. (2001). Systemic risk in financial
systems. Management Science, 47(2), 236–249. https://doi.org/10.1287/mnsc.47.2.236.9835
Elhage, N., Nanda, N., Olsson, C., Henighan, T., Joseph, N., Mann, B.,
Askell, A., Bai, Y., Chen, A., Conerly, T., DasSarma, N., Drain, D.,
Ganguli, D., Hatfield-Dodds, Z., Hernandez, D., Jones, A., Kernion, J.,
Lovitt, L., Ndousse, K., … Olah, C. (2021). A mathematical framework for
transformer circuits. Transformer Circuits Thread. https://transformer-circuits.pub/2021/framework/index.html
Elkan, C. (2001). The foundations of cost-sensitive learning.
973–978.
Elkan, C. (2008). The foundations of cost-sensitive learning: An
overview. Invited Survey, UCSD Technical Report.
Elliott, M. N., Morrison, P. A., Fremont, A., McCaffrey, D. F., Pantoja,
P., & Lurie, N. (2009). Using the census bureau’s surname list to
improve estimates of race/ethnicity and associated disparities.
Health Services and Outcomes Research Methodology,
9(2), 69–83.
Elliott, M., Golub, B., & Jackson, M. O. (2014). Financial networks
and contagion. American Economic Review, 104(10),
3115–3153. https://doi.org/10.1257/aer.104.10.3115
Embrechts, P., McNeil, A. J., & Straumann, D. (2002).
Correlation and dependence in risk management: Properties and
pitfalls. 176–223. https://doi.org/10.1017/CBO9780511615337.008
Eom, Y. H., Helwege, J., & Huang, J.-Z. (2004). Structural models of
corporate bond pricing: An empirical analysis. The Review of
Financial Studies, 17(2), 499–544. https://doi.org/10.1093/rfs/hhg053
Equal Employment Opportunity Commission and others. (1978). Uniform
guidelines on employee selection procedures. 29 C.F.R. Part 1607.
European Banking Authority. (2017a). Guidelines on credit
institutions’ credit risk management practices and accounting for
expected credit losses (EBA/GL/2017/06). European Banking
Authority. https://www.eba.europa.eu/regulation-and-policy/accounting-and-auditing/guidelines-on-credit-institutions-credit-risk-management-practices-and-accounting-for-expected-credit-losses
European Banking Authority. (2017b). Guidelines on PD
estimation, LGD estimation and the treatment of defaulted
exposures (EBA/GL/2017/16). European Banking
Authority. https://www.eba.europa.eu/sites/default/files/documents/10180/2033363/6b062012-45d6-4655-af04-801d26493ed0/Guidelines\%20on\%20PD\%20and\%20LGD\%20estimation\%20\%28EBA-GL-2017-16\%29.pdf
European Banking Authority. (2017d). Guidelines on PD
estimation, LGD estimation and the treatment of defaulted
exposures (EBA/GL/2017/16).
European Banking Authority. (2017c). Guidelines on PD
estimation, LGD estimation and the treatment of defaulted
exposures (EBA/GL/2017/16). European Banking
Authority. https://www.eba.europa.eu/regulation-and-policy/credit-risk/guidelines-on-pd-estimation-lgd-estimation-and-treatment-of-defaulted-assets
European Banking Authority. (2019). Guidelines for the estimation of
LGD appropriate for an economic downturn
(EBA/GL/2019/03). European Banking Authority. https://www.eba.europa.eu/sites/default/files/documents/10180/2551996/Final\%20Report\%20on\%20Guidelines\%20on\%20the\%20estimation\%20of\%20LGD\%20appropriate\%20for\%20an\%20economic\%20downturn.pdf
European Banking Authority. (2021). Report on machine learning for
IRB models. European Banking Authority. https://www.eba.europa.eu/sites/default/files/document_library/Publications/Discussions/2021/Discussion\%20on\%20machine\%20learning\%20for\%20IRB\%20models/1023883/Discussion\%20paper\%20on\%20machine\%20learning\%20for\%20IRB\%20models.pdf
European Banking Authority. (2022). Report on the 2022 review of the
IRB approach (regulatory products). European Banking
Authority.
European Banking Authority. (2023a). 2023 EU-wide stress test
results. European Banking Authority. https://www.eba.europa.eu/risk-and-data-analysis/risk-analysis/eu-wide-stress-testing
European Banking Authority. (2023b). Follow-up report on the use of
machine learning for IRB models. EBA.
European Central Bank. (2019a). ECB guide to internal models
(TRIM). European Central Bank. https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssm.guidetointernalmodels_consolidated_201910.en.pdf
European Central Bank. (2019b). Guide to internal models: Credit
risk. European Central Bank. https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssm.guidetointernalmodels_consolidated_201910.en.pdf
European Central Bank. (2024). Supervisory expectations on the use
of artificial intelligence and machine learning in internal models.
European Central Bank.
European Data Protection Board. (2022). Guidelines 04/2022 on the
calculation of administrative fines under the GDPR. https://edpb.europa.eu/
European Parliament and Council. (2016a). Regulation
(EU) 2016/679 (GDPR). https://eur-lex.europa.eu/eli/reg/2016/679/oj
European Parliament and Council. (2016b). Regulation
(EU) 2016/679 (general data protection regulation).
Official Journal of the European Union L 119/1.
European Parliament and Council. (2024a). Regulation (EU) 2024/1689
laying down harmonised rules on artificial intelligence (artificial
intelligence act). Official Journal of the European Union. https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024b). Regulation
(EU) 2024/1689 laying down harmonised rules on artificial
intelligence (EU AI act).
European Parliament and Council. (2024c). Regulation (EU) 2024/1689
laying down harmonised rules on artificial intelligence (EU AI
Act). Official Journal of the European Union. https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024d). Regulation
(EU) 2024/1689 of 13 June 2024 laying down
harmonised rules on artificial intelligence (artificial intelligence
act). https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council. (2024e). Regulation
(EU) 2024/1689 on artificial intelligence (EU
AI act). https://eur-lex.europa.eu/eli/reg/2024/1689/oj
European Parliament and Council of the European Union. (2015).
Directive (EU) 2015/2366 on payment services in the internal market
(PSD2). Official Journal of the European Union. https://eur-lex.europa.eu/eli/dir/2015/2366/oj
Fader, P. S., & Hardie, B. G. S. (2007). How to project customer
retention. Journal of Interactive Marketing, 21(1),
76–90. https://doi.org/10.1002/dir.20074
Fader, P. S., & Hardie, B. G. S. (2010). Customer-base valuation in
a contractual setting: The perils of ignoring heterogeneity.
Marketing Science, 29(1), 85–93. https://doi.org/10.1287/mksc.1090.0507
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., & Lin, C.-J.
(2008). LIBLINEAR: A library for large linear
classification. Journal of Machine Learning Research,
9, 1871–1874.
Farewell, V. T. (1982). The use of mixture models for the analysis of
survival data with long-term survivors. Biometrics,
38(4), 1041–1046. https://doi.org/10.2307/2529885
Fayyad, U. M., & Irani, K. B. (1993). Multi-interval
discretization of continuous-valued attributes for classification
learning. 1022–1027.
Fedaseyeu, V. (2020). Debt collection agencies and the supply of
consumer credit. Journal of Financial Economics,
138(1), 193–221. https://doi.org/10.1016/j.jfineco.2020.04.009
Federal Home Loan Mortgage Corporation. (2024). Single-family
loan-level dataset. Freddie Mac Public Data Release.
Federal Housing Finance Agency. (2023). Fannie Mae and
Freddie Mac public single-family loan-level
datasets. Federal Housing Finance Agency. https://www.fhfa.gov/DataTools/Downloads
Federal National Mortgage Association. (2024). Single-family loan
performance data. Fannie Mae Data Dynamics.
Federal Republic of Brazil. (2011). Lei no. 12.414: Cadastro
positivo. Federal Law, as amended by Complementary Law 166/2019. https://www.planalto.gov.br/ccivil_03/_ato2011-2014/2011/lei/l12414.htm
Federal Republic of Brazil. (2018). Lei geral de protecao de dados
pessoais (LGPD), federal law no. 13,709. Presidency of
the Republic. https://www.gov.br/cidadania/pt-br/acesso-a-informacao/lgpd
Federal Trade Commission. (2024). Operation AI comply: Actions
against deceptive AI claims. FTC.
Feelders, A., & Pardoel, M. (2003). Pruning for monotone
classification trees. Lecture Notes in Computer Science,
2810, 1–12. https://doi.org/10.1007/978-3-540-45231-7_1
Feldman, M., Friedler, S. A., Moeller, J., Scheidegger, C., &
Venkatasubramanian, S. (2015). Certifying and removing disparate impact.
Proceedings of the 21st ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, 259–268. https://doi.org/10.1145/2783258.2783311
Fellegi, I. P., & Sunter, A. B. (1969). A theory for record linkage.
Journal of the American Statistical Association,
64(328), 1183–1210. https://doi.org/10.1080/01621459.1969.10501049
Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014).
Do we need hundreds of classifiers to solve real world classification
problems? Journal of Machine Learning Research, 15,
3133–3181.
Figlewski, S., Frydman, H., & Liang, W. (2012). Modeling the effect
of macroeconomic factors on corporate default and credit rating
transitions. International Review of Economics and Finance,
21(1), 87–105. https://doi.org/10.1016/j.iref.2011.05.004
Financial Accounting Standards Board. (2016). Financial instruments
- credit losses (topic 326). FASB.
Financial Conduct Authority. (2023). Recommendations for the next
phase of open banking in the UK. Financial Conduct
Authority. https://www.fca.org.uk/publications/corporate-documents/recommendations-next-phase-open-banking-uk
Fine, J. P., & Gray, R. J. (1999). A proportional hazards model for
the subdistribution of a competing risk. Journal of the American
Statistical Association, 94(446), 496–509. https://doi.org/10.1080/01621459.1999.10474144
Finlay, S. (2011). Multiple classifier architectures and their
application to credit risk assessment. European Journal of
Operational Research, 210(2), 368–378. https://doi.org/10.1016/j.ejor.2010.09.029
Firth, D. (1993). Bias reduction of maximum likelihood estimates.
Biometrika, 80(1), 27–38. https://doi.org/10.1093/biomet/80.1.27
Fisher, A., Rudin, C., & Dominici, F. (2019). All models are wrong,
but many are useful: Learning a variable’s importance by studying an
entire class of prediction models simultaneously. Journal of Machine
Learning Research, 20(177), 1–81.
Fisher, R. A. (1936). The use of multiple measurements in taxonomic
problems. Annals of Eugenics, 7(2), 179–188. https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
Flesch, R. (1948). A new readability yardstick. Journal of Applied
Psychology, 32(3), 221–233. https://doi.org/10.1037/h0057532
Fok, D., Paap, R., & Franses, P. H. (2012). Modeling dynamic effects
of promotion on interpurchase times. Computational Statistics and
Data Analysis, 56(11), 3055–3069. https://doi.org/10.1016/j.csda.2011.02.004
Foote, C. L., Gerardi, K., & Willen, P. S. (2008). Negative equity
and foreclosure: Theory and evidence. Journal of Urban
Economics, 64(2), 234–245. https://doi.org/10.1016/j.jue.2008.07.006
Fortin, N., Lemieux, T., & Firpo, S. (2011). Decomposition methods
in economics. Handbook of Labor Economics, 4A, 1–102.
https://doi.org/10.1016/S0169-7218(11)00407-2
Frame, W. S., Srinivasan, A., & Woosley, L. (2001). The effect of
credit scoring on small-business lending. Journal of Money, Credit
and Banking, 33(3), 813–825. https://doi.org/10.2307/2673896
Franks, J., Serrano-Velarde, N., & Sussman, O. (2021). Marketplace
lending, information aggregation, and liquidity. The Review of
Financial Studies, 34(5), 2318–2361. https://doi.org/10.1093/rfs/hhaa101
Fredrikson, M., Jha, S., & Ristenpart, T. (2015). Model inversion
attacks that exploit confidence information and basic countermeasures.
Proceedings of the 22nd ACM SIGSAC Conference on Computer and
Communications Security (CCS), 1322–1333. https://doi.org/10.1145/2810103.2813677
Freedman, S., & Jin, G. Z. (2017). The information value of online
social networks: Lessons from peer-to-peer lending. International
Journal of Industrial Organization, 51, 185–222. https://doi.org/10.1016/j.ijindorg.2016.09.002
Freeman, L. C. (1977). A set of measures of centrality based on
betweenness. Sociometry, 40(1), 35–41. https://doi.org/10.2307/3033543
Freixas, X., Parigi, B. M., & Rochet, J.-C. (2000). Systemic risk,
interbank relations, and liquidity provision by the central bank.
Journal of Money, Credit and Banking, 32(3), 611–638.
https://doi.org/10.2307/2601198
Freund, Y., & Schapire, R. E. (1997a). A decision-theoretic
generalization of on-line learning and an application to boosting.
Journal of Computer and System Sciences, 55(1),
119–139. https://doi.org/10.1006/jcss.1997.1504
Freund, Y., & Schapire, R. E. (1997b). A decision-theoretic
generalization of on-line learning and an application to boosting.
Journal of Computer and System Sciences, 55(1),
119–139. https://doi.org/10.1006/jcss.1997.1504
Friedman, J. H. (1989). Regularized discriminant analysis. Journal
of the American Statistical Association, 84(405), 165–175.
https://doi.org/10.2307/2289860
Friedman, J. H. (2001). Greedy function approximation: A gradient
boosting machine. The Annals of Statistics, 29(5),
1189–1232. https://doi.org/10.1214/aos/1013203451
Friedman, J. H. (2002). Stochastic gradient boosting. Computational
Statistics and Data Analysis, 38(4), 367–378. https://doi.org/10.1016/S0167-9473(01)00065-2
Friedman, J. H., Hastie, T., & Tibshirani, R. (2000). Additive
logistic regression: A statistical view of boosting. The Annals of
Statistics, 28(2), 337–407. https://doi.org/10.1214/aos/1016218223
Friedman, J. H., & Popescu, B. E. (2008). Predictive learning via
rule ensembles. The Annals of Applied Statistics,
2(3), 916–954. https://doi.org/10.1214/07-AOAS148
Friedman, J., Hastie, T., & Tibshirani, R. (2010). Regularization
paths for generalized linear models via coordinate descent. Journal
of Statistical Software, 33(1), 1–22. https://doi.org/10.18637/jss.v033.i01
Friedman, M. (1937). The use of ranks to avoid the assumption of
normality implicit in the analysis of variance. Journal of the
American Statistical Association, 32(200), 675–701. https://doi.org/10.2307/2279372
Frost, J. (2020). The economic forces driving FinTech adoption
across countries (BIS Working Paper 838). Bank for International
Settlements. https://www.bis.org/publ/work838.htm
Frost, J., Gambacorta, L., Huang, Y., Shin, H. S., & Zbinden, P.
(2019). BigTech and the changing structure of financial intermediation.
Economic Policy, 34(100), 761–799. https://doi.org/10.1093/epolic/eiaa003
Frye, C., Rowat, C., & Feige, I. (2020). Asymmetric
Shapley values: Incorporating causal knowledge into
model-agnostic explainability.
Frye, J. (2000). Depressing recoveries. Risk Magazine,
13(11), 108–111.
Fuster, A., Goldsmith-Pinkham, P., Ramadorai, T., & Walther, A.
(2022b). Predictably unequal? The effects of machine learning on credit
markets. Journal of Finance, 77(1), 5–47. https://doi.org/10.1111/jofi.13090
Fuster, A., Goldsmith-Pinkham, P., Ramadorai, T., & Walther, A.
(2022a). Predictably unequal? The effects of machine learning on credit
markets. Journal of Finance, 77(1), 5–47. https://doi.org/10.1111/jofi.13090
Fuster, A., Hizmo, A., Lambie-Hanson, L., Vickery, J., & Willen, P.
S. (2021). How resilient is mortgage credit supply? Evidence from the
COVID-19 pandemic. Journal of Financial Economics,
143(2), 735–757. https://doi.org/10.1016/j.jfineco.2021.09.004
Fuster, A., Hizmo, A., Lambie-Hanson, L., Vickery, J., & Willen, P.
S. (2024). How resilient is mortgage credit supply? Evidence from the
COVID-19 pandemic. Journal of Finance. https://doi.org/10.3386/w28843
Fuster, A., Plosser, M., Schnabl, P., & Vickery, J. (2019a). The
role of technology in mortgage lending. The Review of Financial
Studies, 32(5), 1854–1899. https://doi.org/10.1093/rfs/hhz018
Fuster, A., Plosser, M., Schnabl, P., & Vickery, J. (2019b). The
role of technology in mortgage lending. Review of Financial
Studies, 32(5), 1854–1899. https://doi.org/10.1093/rfs/hhz018
Fuster, A., & Willen, P. S. (2017). Payment size, negative equity,
and mortgage default. American Economic Journal: Economic
Policy, 9(4), 167–191. https://doi.org/10.1257/pol.20150007
Gagliardini, P., & Gourieroux, C. (2013). Granularity adjustment for
risk measures: Systematic vs unsystematic risks. International
Journal of Approximate Reasoning, 54(6), 717–747. https://doi.org/10.1016/j.ijar.2013.02.001
Gai, P., & Kapadia, S. (2010). Contagion in financial networks.
Proceedings of the Royal Society A, 466(2120),
2401–2423. https://doi.org/10.1098/rspa.2009.0410
Gal, Y., & Ghahramani, Z. (2016). Dropout as a Bayesian
approximation: Representing model uncertainty in deep learning.
Proceedings of the 33rd International Conference on Machine Learning
(ICML), 1050–1059.
Gama, J., Medas, P., Castillo, G., & Rodrigues, P. (2004). Learning
with drift detection. Advances in Artificial Intelligence (SBIA
2004), Lecture Notes in Computer Science, 3171, 286–295.
https://doi.org/10.1007/978-3-540-28645-5_29
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia,
A. (2014a). A survey on concept drift adaptation. ACM Computing
Surveys, 46(4), 44. https://doi.org/10.1145/2523813
Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., & Bouchachia,
A. (2014b). A survey on concept drift adaptation. ACM Computing
Surveys, 46(4), 44:1–44:37. https://doi.org/10.1145/2523813
Gambacorta, L., Huang, Y., Li, Z., Qiu, H., & Chen, S. (2020).
Data vs collateral (BIS Working Paper 881). Bank for
International Settlements. https://www.bis.org/publ/work881.htm
Gambacorta, L., Huang, Y., Qiu, H., & Wang, J. (2024). How do
machine learning and non-traditional data affect credit scoring? New
evidence from a chinese fintech firm. Journal of Financial
Stability, 73, 101284. https://doi.org/10.1016/j.jfs.2024.101284
Ganin, Y., & Lempitsky, V. (2015). Unsupervised domain adaptation by
backpropagation. Proceedings of the 32nd International Conference on
Machine Learning (ICML), 1180–1189.
Ganong, P., & Noel, P. (2019). Consumer spending during
unemployment: Positive and normative implications. American Economic
Review, 109(7), 2383–2424. https://doi.org/10.1257/aer.20170537
Ganong, P., & Noel, P. (2020). Liquidity versus wealth in household
debt obligations: Evidence from housing policy in the Great
Recession. American Economic Review,
110(10), 3100–3138. https://doi.org/10.1257/aer.20181243
Gao, L., Madaan, A., Zhou, S., Alon, U., Liu, P., Yang, Y., Callan, J.,
& Neubig, G. (2023). PAL: Program-aided language
models. Proceedings of the 40th International Conference on Machine
Learning (ICML), 10764–10799.
Gao, Q., Lin, M., & Sias, R. (2023). Words matter: The role of texts
in online credit markets. Journal of Financial and Quantitative
Analysis, 58(1), 1–28. https://doi.org/10.1017/S0022109021000697
Garcia, D. (2013). Sentiment during recessions. The Journal of
Finance, 68(3), 1267–1300. https://doi.org/10.1111/jofi.12027
Garcı́a, S., & Herrera, F. (2008). An extension on “statistical
comparisons of classifiers over multiple data sets” for all
pairwise comparisons. Journal of Machine Learning Research,
9, 2677–2694.
Garza, A., Challu, C., & Mergenthaler-Canseco, M. (2024).
TimeGPT-1. arXiv:2310.03589. https://arxiv.org/abs/2310.03589
Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J. W., Wallach, H.,
Daumé III, H., & Crawford, K. (2021). Datasheets for datasets.
Communications of the ACM, 64, 86–92. https://doi.org/10.1145/3458723
Gelman, A., Jakulin, A., Pittau, M. G., & Su, Y.-S. (2008). A weakly
informative default prior distribution for logistic and other regression
models. The Annals of Applied Statistics, 2(4),
1360–1383. https://doi.org/10.1214/08-AOAS191
Genest, C., & Favre, A.-C. (2007). Everything you always wanted to
know about copula modeling but were afraid to ask. Journal of
Hydrologic Engineering, 12(4), 347–368. https://doi.org/10.1061/(ASCE)1084-0699(2007)12:4(347)
Gentzkow, M., Kelly, B., & Taddy, M. (2019). Text as data.
Journal of Economic Literature, 57(3), 535–574. https://doi.org/10.1257/jel.20181020
Gerardi, K., Herkenhoff, K. F., Ohanian, L. E., & Willen, P. S.
(2018). Can’t pay or won’t pay? Unemployment, negative equity, and
strategic default. The Review of Financial Studies,
31(3), 1098–1131. https://doi.org/10.1093/rfs/hhx115
Gerds, T. A., & Schumacher, M. (2006). Consistent estimation of the
expected Brier score in general survival models with
right-censored event times. Biometrical Journal,
48(6), 1029–1040. https://doi.org/10.1002/bimj.200610301
Geske, R. (1977). The valuation of corporate liabilities as compound
options. Journal of Financial and Quantitative Analysis,
12(4), 541–552. https://doi.org/10.2307/2330330
Geskus, R. B. (2011). Cause-specific cumulative incidence estimation and
the fine and gray model under both left truncation and right censoring.
Biometrics, 67(1), 39–49. https://doi.org/10.1111/j.1541-0420.2010.01420.x
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized
trees. Machine Learning, 63(1), 3–42. https://doi.org/10.1007/s10994-006-6226-1
Ghent, A. C., & Kudlyak, M. (2011). Recourse and residential
mortgage default: Evidence from US states. The Review
of Financial Studies, 24(9), 3139–3186. https://doi.org/10.1093/rfs/hhr055
Ghorbani, A., Abid, A., & Zou, J. (2019). Interpretation of neural
networks is fragile. Proceedings of the AAAI Conference on
Artificial Intelligence, 33, 3681–3688. https://doi.org/10.1609/aaai.v33i01.33013681
Gibbs, I., & Candès, E. J. (2021). Adaptive conformal inference
under distribution shift. Advances in Neural Information Processing
Systems 34 (NeurIPS 2021).
Gillis, T. B. (2022). The input fallacy. Minnesota Law Review,
106, 1175–1263.
Gilmer, J., Schoenholz, S. S., Riley, P. F., Vinyals, O., & Dahl, G.
E. (2017). Neural message passing for quantum chemistry. Proceedings
of the 34th International Conference on Machine Learning (ICML),
1263–1272.
Glasserman, P., & Young, H. P. (2016). Contagion in financial
networks. Journal of Economic Literature, 54(3),
779–831. https://doi.org/10.1257/jel.20151228
Gneiting, T., & Raftery, A. E. (2007). Strictly proper scoring
rules, prediction, and estimation. Journal of the American
Statistical Association, 102(477), 359–378. https://doi.org/10.1198/016214506000001437
Goldfarb, A., & Tucker, C. (2011). Privacy regulation and online
advertising. Management Science, 57(1), 57–71. https://doi.org/10.1287/mnsc.1100.1246
Goldfarb, A., & Tucker, C. (2019). Digital economics. Journal of
Economic Literature, 57(1), 3–43. https://doi.org/10.1257/jel.20171452
Goldstein, A., Kapelner, A., Bleich, J., & Pitkin, E. (2015).
Peeking inside the black box: Visualizing statistical learning with
plots of individual conditional expectation. Journal of
Computational and Graphical Statistics, 24(1), 44–65. https://doi.org/10.1080/10618600.2014.907095
Goldstein, I., Jiang, W., & Karolyi, G. A. (2019). To
FinTech and beyond. Review of Financial Studies,
32(5), 1647–1661. https://doi.org/10.1093/rfs/hhz025
Golub, G. H., & Van Loan, C. F. (2013). Matrix computations
(4th ed.). Johns Hopkins University Press.
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley,
D., Ozair, S., Courville, A., & Bengio, Y. (2020). Generative
adversarial nets. Communications of the ACM, 63(11),
139–144. https://doi.org/10.1145/3422622
Goodfellow, I. J., Shlens, J., & Szegedy, C. (2015). Explaining and
harnessing adversarial examples. International Conference on
Learning Representations (ICLR).
Goodman, B., & Flaxman, S. (2017). European Union
regulations on algorithmic decision-making and a “right to
explanation.” AI Magazine, 38(3), 50–57. https://doi.org/10.1609/aimag.v38i3.2741
Goodman-Bacon, A. (2021). Difference-in-differences with variation in
treatment timing. Journal of Econometrics, 225(2),
254–277. https://doi.org/10.1016/j.jeconom.2021.03.014
Gordy, M. B. (2003). A risk-factor model foundation for ratings-based
bank capital rules. Journal of Financial Intermediation,
12(3), 199–232. https://doi.org/10.1016/S1042-9573(03)00040-8
Gordy, M. B., & Lütkebohmert, E. (2013). Granularity adjustment for
regulatory capital assessment. International Journal of Central
Banking, 9(3), 38–77.
Gorishniy, Y., Rubachev, I., Khrulkov, V., & Babenko, A. (2021).
Revisiting deep learning models for tabular data. Advances in Neural
Information Processing Systems 34 (NeurIPS 2021).
Gourieroux, C., Monfort, A., Renault, E., & Trognon, A. (1987).
Generalised residuals. Journal of Econometrics,
34(1–2), 5–32. https://doi.org/10.1016/0304-4076(87)90065-0
Government of India. (2005). Credit information companies
(regulation) act, 2005 and CIC regulations, 2006. Act
No. 30 of 2005. https://www.rbi.org.in/
Government of India. (2023). Digital personal data protection act,
2023. Act No. 22 of 2023. https://www.meity.gov.in/content/digital-personal-data-protection-act-2023
Government of Vietnam. (2021a). Decision no. 942/QD-TTg
on e-government development and the research and pilot of virtual
currency based on blockchain technology. Prime Minister of Vietnam.
https://vanban.chinhphu.vn/
Government of Vietnam. (2021b). Decree
80/2021/ND-CP detailing and guiding
implementation of several articles of the law on support for small and
medium-sized enterprises. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2022). Decree
53/2022/ND-CP detailing the law on
cybersecurity. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2023a). Decree
13/2023/ND-CP on personal data
protection. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2023b). Decree no. 13/2023/ND-CP on personal
data protection. Government of the Socialist Republic of Vietnam.
https://vanbanphapluat.co/decree-13-2023-nd-cp-personal-data-protection
Government of Vietnam. (2023c). Resolution
33/NQ-CP on solutions to remove difficulties
for the real estate market and promote its safe, healthy, and
sustainable development. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2025a). Decree
94/2025/ND-CP on the controlled testing
mechanism for fintech activities in the banking sector. Hanoi. https://vanbanphapluat.co/
Government of Vietnam. (2025b). Decree no. 94/2025/ND-CP on the
controlled testing mechanism (Regulatory Sandbox) in the
banking sector. Official Gazette of the Socialist Republic of
Vietnam. https://vanban.chinhphu.vn/
Graf, E., Schmoor, C., Sauerbrei, W., & Schumacher, M. (1999).
Assessment and comparison of prognostic classification schemes for
survival data. Statistics in Medicine, 18(17-18),
2529–2545. https://doi.org/10.1002/(SICI)1097-0258(19990915/30)18:17/18<2529::AID-SIM274>3.0.CO;2-5
Grambsch, P. M., & Therneau, T. M. (1994). Proportional hazards
tests and diagnostics based on weighted residuals. Biometrika,
81(3), 515–526.
Gray, R. J. (1988). A class of K-sample tests for comparing the
cumulative incidence of a competing risk. Annals of Statistics,
16(3), 1141–1154. https://doi.org/10.1214/aos/1176350951
Green, P. J. (1984). Iteratively reweighted least squares for maximum
likelihood estimation, and some robust and resistant alternatives.
Journal of the Royal Statistical Society. Series B
(Methodological), 46(2), 149–192.
Greene, W. H. (2003). Econometric analysis (5th ed.). Prentice
Hall.
Greenwood, R., Hanson, S. G., Shleifer, A., & Sørensen, J. A.
(2022). Predictable financial crises. Journal of Finance,
77(2), 863–921. https://doi.org/10.1111/jofi.13105
Greer, C. C. (1967). The optimal credit acceptance policy. Journal
of Financial and Quantitative Analysis, 2(4), 399–415. https://doi.org/10.2307/2329825
Grembi, V., Nannicini, T., & Troiano, U. (2016). Do fiscal rules
matter? American Economic Journal: Applied Economics,
8(3), 1–30. https://doi.org/10.1257/app.20150076
Griffin, J. M., & Tang, D. Y. (2012). Did subjectivity play a role
in CDO credit ratings? Journal of Finance,
67(4), 1293–1328. https://doi.org/10.1111/j.1540-6261.2012.01748.x
Grinsztajn, L., Oyallon, E., & Varoquaux, G. (2022b). Why do
tree-based models still outperform deep learning on typical tabular
data?
Grinsztajn, L., Oyallon, E., & Varoquaux, G. (2022a). Why do
tree-based models still outperform deep learning on typical tabular
data? Advances in Neural Information Processing Systems 35
(NeurIPS), 507–520.
Gross, D. B., & Souleles, N. S. (2002). Do liquidity constraints and
interest rates matter for consumer behavior? Evidence from credit card
data. Quarterly Journal of Economics, 117(1), 149–185.
https://doi.org/10.1162/003355302753399472
Grover, A., & Leskovec, J. (2016). node2vec: Scalable feature
learning for networks. Proceedings of the 22nd ACM SIGKDD
International Conference on Knowledge Discovery and Data Mining,
855–864. https://doi.org/10.1145/2939672.2939754
GSMA. (2023). The state of the industry report on mobile money
2023. GSM Association. https://www.gsma.com/sotir/
Gunnarsson, B. R., Broucke, S. vanden, Baesens, B., Óskarsdóttir, M.,
& Lemahieu, W. (2021). Deep learning for credit scoring: Do or
don’t? European Journal of Operational Research,
295(1), 292–305. https://doi.org/10.1016/j.ejor.2021.03.006
Gunning, R. (1952). The technique of clear writing.
McGraw-Hill.
Guo, C., Pleiss, G., Sun, Y., & Weinberger, K. Q. (2017). On
calibration of modern neural networks. 1321–1330.
Gupton, G. M., Finger, C. C., & Bhatia, M. (1997).
CreditMetrics technical document. J.P. Morgan &
Co. https://www.msci.com/documents/1296102/1636401/CreditMetricsTechnicalDoc.pdf
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and
feature selection. Journal of Machine Learning Research,
3, 1157–1182.
Hahn, J., Todd, P., & Klaauw, W. van der. (2001). Identification and
estimation of treatment effects with a regression-discontinuity design.
Econometrica, 69(1), 201–209. https://doi.org/10.1111/1468-0262.00183
Haldane, A. G., & May, R. M. (2011). Systemic risk in banking
ecosystems. Nature, 469(7330), 351–355. https://doi.org/10.1038/nature09659
Hale, G., Kapan, T., & Minoiu, C. (2020). Shock transmission through
cross-border bank lending: Credit and real effects. The Review of
Financial Studies, 33(10), 4839–4882. https://doi.org/10.1093/rfs/hhz147
Hall, P. (1988). On symmetric bootstrap confidence intervals.
Journal of the Royal Statistical Society: Series B
(Methodological), 50(1), 35–45.
Hamilton, W. L., Ying, R., & Leskovec, J. (2017). Inductive
representation learning on large graphs. Advances in Neural
Information Processing Systems 30 (NIPS 2017).
Han, H., Wang, W.-Y., & Mao, B.-H. (2005).
Borderline-SMOTE: A new over-sampling method in imbalanced
data sets learning. Advances in Intelligent Computing (ICIC 2005),
Lecture Notes in Computer Science, 3644, 878–887. https://doi.org/10.1007/11538059_91
Han, P. (2014). Multiply robust estimation in regression analysis with
missing data. Journal of the American Statistical Association,
109(507), 1159–1173. https://doi.org/10.1080/01621459.2014.880058
Han, P., & Wang, L. (2013). Estimation with missing data: Beyond
double robustness. Biometrika, 100(2), 417–430. https://doi.org/10.1093/biomet/ass087
Hand, D. J. (2006). Classifier technology and the illusion of progress.
Statistical Science, 21(1), 1–14. https://doi.org/10.1214/088342306000000060
Hand, D. J. (2009). Measuring classifier performance: A coherent
alternative to the area under the ROC curve. Machine
Learning, 77(1), 103–123. https://doi.org/10.1007/s10994-009-5119-5
Hand, D. J., & Adams, N. M. (2000). Defining attributes for
scorecard construction in credit scoring. Journal of Applied
Statistics, 27(5), 527–540. https://doi.org/10.1080/02664760050076371
Hand, D. J., & Anagnostopoulos, C. (2013). When is the area under
the receiver operating characteristic curve an appropriate measure of
classifier performance? Pattern Recognition Letters,
34(5), 492–495. https://doi.org/10.1016/j.patrec.2012.12.004
Hand, D. J., & Henley, W. E. (1997a). Statistical classification
methods in consumer credit scoring: A review. Journal of the Royal
Statistical Society. Series A (Statistics in Society),
160(3), 523–541. https://doi.org/10.1111/j.1467-985X.1997.00078.x
Hand, D. J., & Henley, W. E. (1997b). Statistical classification
methods in consumer credit scoring: A review. Journal of the Royal
Statistical Society: Series A, 160(3), 523–541. https://doi.org/10.1111/j.1467-985X.1997.00078.x
Hand, D. J., & Till, R. J. (2001). A simple generalisation of the
area under the ROC curve for multiple class classification
problems. Machine Learning, 45(2), 171–186. https://doi.org/10.1023/A:1010920819831
Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the
area under a receiver operating characteristic (ROC) curve.
Radiology, 143(1), 29–36. https://doi.org/10.1148/radiology.143.1.7063747
Hansen, S., McMahon, M., & Prat, A. (2018). Transparency and
deliberation within the FOMC: A computational linguistics
approach. The Quarterly Journal of Economics, 133(2),
801–870. https://doi.org/10.1093/qje/qjx045
Hardt, M., Price, E., & Srebro, N. (2016). Equality of opportunity
in supervised learning. Advances in Neural Information Processing
Systems 29 (NIPS 2016).
Hardy, S., Henecka, W., Ivey-Law, H., Nock, R., Patrini, G., Smith, G.,
& Thorne, B. (2017). Private federated learning on vertically
partitioned data via entity resolution and additively homomorphic
encryption. NeurIPS Workshop on Privacy-Preserving Machine
Learning.
Harrell, F. E., Califf, R. M., Pryor, D. B., Lee, K. L., & Rosati,
R. A. (1982). Evaluating the yield of medical tests. Journal of the
American Medical Association, 247(18), 2543–2546. https://doi.org/10.1001/jama.1982.03320430047030
Harrell, F. E., Lee, K. L., & Mark, D. B. (1996). Multivariable
prognostic models: Issues in developing models, evaluating assumptions
and adequacy, and measuring and reducing errors. Statistics in
Medicine, 15(4), 361–387. https://doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4
Harris, T. (2013). Quantitative credit risk assessment using support
vector machines: Broad versus narrow default definitions. Expert
Systems with Applications, 40(11), 4404–4413. https://doi.org/10.1016/j.eswa.2013.01.044
Harrison, J. M., & Kreps, D. M. (1979). Martingales and arbitrage in
multiperiod securities markets. Journal of Economic Theory,
20(3), 381–408. https://doi.org/10.1016/0022-0531(79)90043-7
Harrison, J. M., & Pliska, S. R. (1981). Martingales and stochastic
integrals in the theory of continuous trading. Stochastic Processes
and Their Applications, 11(3), 215–260. https://doi.org/10.1016/0304-4149(81)90026-0
Hart, P. E. (1968). The condensed nearest neighbor rule. IEEE
Transactions on Information Theory, 14(3), 515–516. https://doi.org/10.1109/TIT.1968.1054155
Hashimoto, T. B., Srivastava, M., Namkoong, H., & Liang, P. (2018).
Fairness without demographics in repeated loss minimization.
Proceedings of the 35th International Conference on Machine Learning
(ICML).
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements
of statistical learning. https://doi.org/10.1007/978-0-387-84858-7
Hau, H., Huang, Y., Lin, C., Shan, H., Sheng, Z., & Wei, L. (2024).
FinTech credit and entrepreneurial growth. Journal of
Finance, 79(5), 3309–3359. https://doi.org/10.1111/jofi.13384
Hau, H., Huang, Y., Shan, H., & Sheng, Z. (2019). How
FinTech enters China’s credit market. AEA
Papers and Proceedings, 109, 60–64. https://doi.org/10.1257/pandp.20191012
Hausman, C., & Rapson, D. S. (2018). Regression discontinuity in
time: Considerations for empirical applications. Annual Review of
Resource Economics, 10, 533–552. https://doi.org/10.1146/annurev-resource-121517-033306
Hauswald, R., & Marquez, R. (2006). Competition and strategic
information acquisition in credit markets. The Review of Financial
Studies, 19(3), 967–1000. https://doi.org/10.1093/rfs/hhj021
Havlı́ček, V., Córcoles, A. D., Temme, K., Harrow, A. W., Kandala, A.,
Chow, J. M., & Gambetta, J. M. (2019). Supervised learning with
quantum-enhanced feature spaces. Nature, 567(7747),
209–212. https://doi.org/10.1038/s41586-019-0980-2
Havrylchyk, O., Mariotto, C., Rahim, T., & Verdier, M. (2020). The
expansion of peer-to-peer lending. The Review of Network
Economics, 19(3), 145–187. https://doi.org/10.1515/rne-2020-0033
He, H., Bai, Y., Garcia, E. A., & Li, S. (2008).
ADASYN: Adaptive synthetic sampling approach for imbalanced
learning. Proceedings of the IEEE International Joint Conference on
Neural Networks (IJCNN), 1322–1328. https://doi.org/10.1109/IJCNN.2008.4633969
He, H., & Garcia, E. A. (2009). Learning from imbalanced data.
IEEE Transactions on Knowledge and Data Engineering,
21(9), 1263–1284. https://doi.org/10.1109/TKDE.2008.239
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning
for image recognition. Proceedings of the IEEE Conference on
Computer Vision and Pattern Recognition (CVPR), 770–778. https://doi.org/10.1109/CVPR.2016.90
He, Z., Huang, J., & Zhou, J. (2023). Open banking: Credit market
competition when borrowers own the data. Journal of Financial
Economics, 147(2), 449–474. https://doi.org/10.1016/j.jfineco.2022.12.003
Heckman, J. J. (1974). Shadow prices, market wages, and labor supply.
Econometrica, 42(4), 679–694. https://doi.org/10.2307/1913937
Heckman, J. J. (1976). The common structure of statistical models of
truncation, sample selection and limited dependent variables and a
simple estimator for such models. Annals of Economic and Social
Measurement, 5(4), 475–492.
Heckman, J. J. (1979). Sample selection bias as a specification error.
Econometrica, 47(1), 153–161. https://doi.org/10.2307/1912352
Helsen, K., & Schmittlein, D. C. (1993). Analyzing duration times in
marketing: Evidence for the effectiveness of hazard rate models.
Marketing Science, 12(4), 395–414. https://doi.org/10.1287/mksc.12.4.395
Hi! PARIS Center. (2024). XPER: eXplainable
PERformance (Python package). https://github.com/hi-paris/XPER
Hillegeist, S. A., Keating, E. K., Cram, D. P., & Lundstedt, K. G.
(2004). Assessing the probability of bankruptcy. Review of
Accounting Studies, 9(1), 5–34. https://doi.org/10.1023/B:RAST.0000013627.90884.b7
Hinkley, D. V. (1971). Inference about the change-point from cumulative
sum tests. Biometrika, 58(3), 509–523. https://doi.org/10.2307/2334386
Hirano, K., Imbens, G. W., & Ridder, G. (2003). Efficient estimation
of average treatment effects using the estimated propensity score.
Econometrica, 71(4), 1161–1189. https://doi.org/10.1111/1468-0262.00442
Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion
probabilistic models. Advances in Neural Information Processing
Systems 33 (NeurIPS).
Hoberg, G., & Phillips, G. (2016). Text-based network industries and
endogenous product differentiation. Journal of Political
Economy, 124(5), 1423–1465. https://doi.org/10.1086/688176
Hobson, J. L., Mayew, W. J., & Venkatachalam, M. (2012). Analyzing
speech to detect financial misreporting. Journal of Accounting
Research, 50(2), 349–392. https://doi.org/10.1111/j.1475-679X.2011.00433.x
Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory.
Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Hodges, J. L., & Lehmann, E. L. (1962). Rank methods for combination
of independent experiments in analysis of variance. The Annals of
Mathematical Statistics, 33(2), 482–497. https://doi.org/10.1214/aoms/1177704575
Hoeting, J. A., Madigan, D., Raftery, A. E., & Volinsky, C. T.
(1999). Bayesian model averaging: A tutorial. Statistical
Science, 14(4), 382–417. https://doi.org/10.1214/ss/1009212519
Hofert, M., Kojadinovic, I., Mächler, M., & Yan, J. (2018).
Elements of copula modeling with R. Use
R! https://doi.org/10.1007/978-3-319-89635-9
Hofmann, H. (1994). Statlog (german credit data). UCI Machine
Learning Repository. https://doi.org/10.24432/C5NC77
Holford, T. R. (1983). The estimation of age, period and cohort effects
for vital rates. Biometrics, 39(2), 311–324. https://doi.org/10.2307/2531004
Holland, P. W. (1986). Statistics and causal inference. Journal of
the American Statistical Association, 81(396), 945–960. https://doi.org/10.2307/2289064
Holm, S. (1979). A simple sequentially rejective multiple test
procedure. Scandinavian Journal of Statistics, 6(2),
65–70.
Holmstrom, B. (1979). Moral hazard and observability. The Bell
Journal of Economics, 10(1), 74–91. https://doi.org/10.2307/3003320
Home Credit Group. (2018). Home credit default risk. Kaggle
Competition.
Home Credit Vietnam Finance Company Limited. (2023). Annual report
2023. Ho Chi Minh City. https://www.homecredit.vn/
Hooker, S., Erhan, D., Kindermans, P.-J., & Kim, B. (2019). A
benchmark for interpretability methods in deep neural networks.
Advances in Neural Information Processing Systems 32 (NeurIPS
2019).
Horn, R. A., & Johnson, C. R. (2012). Matrix analysis (2nd
ed.). Cambridge University Press. https://doi.org/10.1017/CBO9781139020411
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer
feedforward networks are universal approximators. Neural
Networks, 2(5), 359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Horvitz, D. G., & Thompson, D. J. (1952). A generalization of
sampling without replacement from a finite universe. Journal of the
American Statistical Association, 47(260), 663–685. https://doi.org/10.1080/01621459.1952.10483446
Hosmer, D. W., & Lemesbow, S. (1980). Goodness of fit tests for the
multiple logistic regression model. Communications in
Statistics-Theory and Methods, 9(10), 1043–1069.
Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied
logistic regression.
Hothorn, T., Hornik, K., & Zeileis, A. (2006). Unbiased recursive
partitioning: A conditional inference framework. Journal of
Computational and Graphical Statistics, 15(3), 651–674. https://doi.org/10.1198/106186006X133933
Houlsby, N., Giurgiu, A., Jastrzebski, S., Morrone, B., De Laroussilhe,
Q., Gesmundo, A., Attariyan, M., & Gelly, S. (2019).
Parameter-efficient transfer learning for NLP.
Proceedings of the 36th International Conference on Machine Learning
(ICML), 2790–2799.
Howell, S. T., Kuchler, T., Snitkof, D., Stroebel, J., & Wong, J.
(2024). Lender automation and racial disparities in credit access.
The Journal of Finance, 79(2), 1457–1512. https://doi.org/10.1111/jofi.13303
Hsia, D. C. (1978). Credit scoring and the equal credit opportunity act.
Hastings Law Journal, 30(2), 371–448.
Hsieh, C.-J., Chang, K.-W., Lin, C.-J., Keerthi, S. S., &
Sundararajan, S. (2008). A dual coordinate descent method for
large-scale linear SVM. 408–415. https://doi.org/10.1145/1390156.1390208
Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang,
L., & Chen, W. (2022). LoRA: Low-rank adaptation of
large language models. International Conference on Learning
Representations (ICLR).
Hu, X., Rudin, C., & Seltzer, M. (2019). Optimal sparse decision
trees.
Huang, A. H., Wang, H., & Yang, Y. (2023). FinBERT: A
large language model for extracting information from financial text.
Contemporary Accounting Research, 40(2), 806–841. https://doi.org/10.1111/1911-3846.12832
Huang, C.-L., Chen, M.-C., & Wang, C.-J. (2007). Credit scoring with
a data mining approach based on support vector machines. Expert
Systems with Applications, 33(4), 847–856. https://doi.org/10.1016/j.eswa.2006.07.007
Huang, H.-Y., Broughton, M., Cotler, J., Chen, S., Li, J., Mohseni, M.,
Neven, H., Babbush, R., Kueng, R., Preskill, J., & McClean, J. R.
(2022). Quantum advantage in learning from experiments.
Science, 376(6598), 1182–1186. https://doi.org/10.1126/science.abn7293
Huang, J., Smola, A. J., Gretton, A., Borgwardt, K. M., & Schölkopf,
B. (2007). Correcting sample selection bias by unlabeled data.
Advances in Neural Information Processing Systems (NeurIPS),
19.
Huang, J.-Z., & Huang, M. (2012). How much of the corporate-treasury
yield spread is due to credit risk? The Review of Asset Pricing
Studies, 2(2), 153–202. https://doi.org/10.1093/rapstu/ras011
Huang, X., Khetan, A., Cvitkovic, M., & Karnin, Z. (2020).
TabTransformer: Tabular data modeling using contextual
embeddings. arXiv Preprint arXiv:2012.06678.
Huang, Y., Zhang, L., Li, Z., Qiu, H., Sun, T., & Wang, X. (2020).
Fintech credit risk assessment for SMEs: Evidence from
China. IMF Working Paper, (20/193). https://www.imf.org/en/Publications/WP/Issues/2020/09/25/Fintech-Credit-Risk-Assessment-for-SMEs-Evidence-from-China-49742
Hué, S., Hurlin, C., Pérignon, C., & Saurin, S. (2022). Measuring
the driving forces of predictive performance: Application to credit
scoring. arXiv Preprint arXiv:2212.05866.
Hull, J. C., & White, A. (2013). LIBOR vs.
OIS: The derivatives discounting dilemma. Journal of
Investment Management, 11(3), 14–27.
Hurley, M., & Adebayo, J. (2016). Credit scoring in the era of big
data. Yale Journal of Law and Technology, 18, 148–216.
Hurlin, C., Pérignon, C., & Saurin, S. (2026). The fairness of
credit scoring models. Management Science, 72(1),
406–425.
IEEE Computational Intelligence Society, & Corporation, V. (2019).
IEEE-CIS fraud detection. Kaggle Competition.
Iman, R. L., & Davenport, J. M. (1980). Approximations of the
critical region of the Friedman statistic.
Communications in Statistics - Theory and Methods,
9(6), 571–595. https://doi.org/10.1080/03610928008827904
Imbens, G. W., & Angrist, J. D. (1994). Identification and
estimation of local average treatment effects. Econometrica,
62(2), 467–475. https://doi.org/10.2307/2951620
Imbens, G. W., & Lemieux, T. (2008). Regression discontinuity
designs: A guide to practice. Journal of Econometrics,
142(2), 615–635. https://doi.org/10.1016/j.jeconom.2007.05.001
Imbens, G. W., & Rubin, D. B. (2015). Causal inference for
statistics, social, and biomedical sciences: An introduction.
Cambridge University Press. https://doi.org/10.1017/CBO9781139025751
Indarte, S. (2023). Moral hazard versus liquidity in household
bankruptcy. Journal of Finance, 78(5), 2421–2464. https://doi.org/10.1111/jofi.13263
International Accounting Standards Board. (2014). IFRS
9: Financial instruments. IFRS Foundation.
International Finance Corporation. (2019). MSME finance
gap: Viet nam country profile. International Finance Corporation.
https://www.ifc.org/en/what-we-do/sector-expertise/financial-institutions/msme-finance
International Monetary Fund. (2019). Vietnam: Financial sector
assessment program, technical note on systemic risk analysis and stress
testing (IMF Country Report 19/373). International Monetary Fund.
https://www.imf.org/en/Publications/CR/Issues/2019/12/13/Vietnam-Financial-Sector-Assessment-Program-48885
International Monetary Fund. (2023a). Fintech and financial
inclusion in low-income countries (IMF Departmental Paper
DP/2023/004). International Monetary Fund. https://www.imf.org/en/Publications/Departmental-Papers-Policy-Papers/Issues/2023/06/23/Fintech-and-Financial-Inclusion-in-Low-Income-Countries-534832
International Monetary Fund. (2023b). Vietnam: 2023 article
IV consultation, IMF country report no.
23/352. International Monetary Fund. https://www.imf.org/en/Publications/CR/Issues/2023/10/10/Vietnam-2023-Article-IV-Consultation
International Monetary Fund. (2024). Vietnam: 2024 article
IV consultation – press release; staff report; and
statement by the executive director for vietnam, IMF
country report no. 24/306. International Monetary Fund. https://www.imf.org/en/publications/cr/issues/2024/09/27/vietnam-2024-article-iv-consultation-press-release-staff-report-and-statement-by-the-555679
Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating
deep network training by reducing internal covariate shift.
Proceedings of the 32nd International Conference on Machine Learning
(ICML), 448–456.
Ishwaran, H., Kogalur, U. B., Blackwell, E. H., & Lauer, M. S.
(2008). Random survival forests. The Annals of Applied
Statistics, 2(3), 841–860. https://doi.org/10.1214/08-AOAS169
Israel, R. B., Rosenthal, J. S., & Wei, J. Z. (2001). Finding
generators for Markov chains via empirical transition
matrices, with applications to credit ratings. Mathematical
Finance, 11(2), 245–265. https://doi.org/10.1111/1467-9965.00114
Ivanov, I. T., Kruttli, M. S., & Watugala, S. W. (2024). Banking on
carbon: Corporate lending and cap-and-trade policy. Review of
Financial Studies, 37(5), 1640–1684. https://doi.org/10.1093/rfs/hhad080
Iyer, R., Khwaja, A. I., Luttmer, E. F. P., & Shue, K. (2016).
Screening peers softly: Inferring the quality of small borrowers.
Management Science, 62(6), 1554–1577. https://doi.org/10.1287/mnsc.2015.2181
Iyer, R., & Peydro, J.-L. (2011). Interbank contagion at work:
Evidence from a natural experiment. The Review of Financial
Studies, 24(4), 1337–1377. https://doi.org/10.1093/rfs/hhp105
Jack, W., & Suri, T. (2014). Risk sharing and transactions costs:
Evidence from Kenya’s mobile money revolution. American
Economic Review, 104(1), 183–223. https://doi.org/10.1257/aer.104.1.183
Jaffee, D. M., & Russell, T. (1976). Imperfect information,
uncertainty, and credit rationing. The Quarterly Journal of
Economics, 90(4), 651–666. https://doi.org/10.2307/1885327
Jäger, S., Allhorn, A., & Bießmann, F. (2021). A benchmark for data
imputation methods. Frontiers in Big Data, 4, 693674.
https://doi.org/10.3389/fdata.2021.693674
Jagtiani, J., & Lemieux, C. (2019). The roles of alternative data
and machine learning in fintech lending: Evidence from the
LendingClub consumer platform. Financial
Management, 48(4), 1009–1029. https://doi.org/10.1111/fima.12295
Jain, D. C., & Vilcassim, N. J. (1991). Investigating household
purchase timing decisions: A conditional hazard function approach.
Marketing Science, 10(1), 1–23. https://doi.org/10.1287/mksc.10.1.1
Jain, S., & Wallace, B. C. (2019). Attention is not explanation.
Proceedings of NAACL-HLT, 3543–3556. https://doi.org/10.18653/v1/N19-1357
Janakiraman, R., Lim, J. H., & Rishika, R. (2018). The effect of a
data breach announcement on customer behavior: Evidence from a
multichannel retailer. Journal of Marketing, 82(2),
85–105. https://doi.org/10.1509/jm.16.0124
Janzing, D., Minorics, L., & Blöbaum, P. (2020). Feature relevance
quantification in explainable AI: A causal problem.
Proceedings of the 23rd International Conference on Artificial
Intelligence and Statistics (AISTATS), 2907–2916.
Jarrett, D., Cebere, B. C., Liu, T., Curth, A., & Schaar, M. van
der. (2022). HyperImpute: Generalized iterative imputation
with automatic model selection. Proceedings of the 39th
International Conference on Machine Learning (ICML).
Jarrow, R. A., Lando, D., & Turnbull, S. M. (1997). A
Markov model for the term structure of credit risk spreads.
The Review of Financial Studies, 10(2), 481–523. https://doi.org/10.1093/rfs/10.2.481
Jarrow, R. A., & Turnbull, S. M. (1995). Pricing derivatives on
financial securities subject to credit risk. The Journal of
Finance, 50(1), 53–85. https://doi.org/10.1111/j.1540-6261.1995.tb05167.x
Jegadeesh, N., & Wu, D. (2013). Word power: A new approach for
content analysis. Journal of Financial Economics,
110(3), 712–729. https://doi.org/10.1016/j.jfineco.2013.08.018
Jethani, N., Sudarshan, M., Covert, I., Lee, S.-I., & Ranganath, R.
(2022). FastSHAP: Real-time Shapley value
estimation. International Conference on Learning Representations
(ICLR).
Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang,
Y., Madotto, A., & Fung, P. (2023). Survey of hallucination in
natural language generation. ACM Computing Surveys,
55, 1–38. https://doi.org/10.1145/3571730
Joe, H. (2014). Dependence modeling with copulas. Chapman;
Hall/CRC. https://doi.org/10.1201/b17116
Johnson, G. A., Shriver, S. K., & Goldberg, S. G. (2023). Privacy
and market concentration: Intended and unintended consequences of the
GDPR. Management Science, 69(10),
5695–5721. https://doi.org/10.1287/mnsc.2023.4709
Jones, C. I., & Tonetti, C. (2020). Nonrivalry and the economics of
data. American Economic Review, 110(9), 2819–2858. https://doi.org/10.1257/aer.20191330
Jones, E. P., Mason, S. P., & Rosenfeld, E. (1984). Contingent
claims analysis of corporate capital structures: An empirical
investigation. The Journal of Finance, 39(3), 611–625.
https://doi.org/10.2307/2327919
Jordon, J., Szpruch, L., Houssiau, F., Bottarelli, M., Cherubin, G.,
Maple, C., Cohen, S. N., & Weller, A. (2022). Synthetic data - what,
why and how? The Royal Society Report (Commissioned by The Alan
Turing Institute).
Jordon, J., Yoon, J., & Schaar, M. van der. (2019).
PATE-GAN: Generating synthetic data with differential
privacy guarantees. International Conference on Learning
Representations (ICLR).
Kairouz, P., McMahan, H. B., Avent, B., Bellet, A., Bennis, M., Bhagoji,
A. N., Bonawitz, K., Charles, Z., Cormode, G., Cummings, R., et al.
(2021). Advances and open problems in federated learning.
Foundations and Trends in Machine Learning, 14(1-2),
1–210. https://doi.org/10.1561/2200000083
Kalemli-Özcan, Ş., Di Giovanni, J., Silva, Á., & Yildirim, M. A.
(2022). Global supply chain pressures, international trade, and
inflation. NBER Working Paper, (30240). https://www.nber.org/papers/w30240
Kamiran, F., & Calders, T. (2012). Data preprocessing techniques for
classification without discrimination. Knowledge and Information
Systems, 33, 1–33. https://doi.org/10.1007/s10115-011-0463-8
Kang, J. D. Y., & Schafer, J. L. (2007). Demystifying double
robustness: A comparison of alternative strategies for estimating a
population mean from incomplete data. Statistical Science,
22(4), 523–539. https://doi.org/10.1214/07-STS227
Kantorovich, L. V. (1960). Mathematical methods of organizing and
planning production. Management Science, 6(4),
366–422. https://doi.org/10.1287/mnsc.6.4.366
Kaplan, E. L., & Meier, P. (1958). Nonparametric estimation from
incomplete observations. Journal of the American Statistical
Association, 53(282), 457–481. https://doi.org/10.2307/2281868
Karakoulas, G. (2004). Empirical validation of retail credit-scoring
models. RMA Journal, 87(1), 56–60.
Karimi, A.-H., Barthe, G., Balle, B., & Valera, I. (2020).
Model-agnostic counterfactual explanations for consequential decisions.
Proceedings of the 23rd International Conference on Artificial
Intelligence and Statistics (AISTATS), 895–905.
Karimi, A.-H., Barthe, G., Schölkopf, B., & Valera, I. (2022). A
survey of algorithmic recourse: Contrastive explanations and
consequential recommendations. ACM Computing Surveys,
55(5), 1–29. https://doi.org/10.1145/3527848
Karlan, D., McConnell, M., Mullainathan, S., & Zinman, J. (2016).
Getting to the top of mind: How reminders increase saving.
Management Science, 62(12), 3393–3411. https://doi.org/10.1287/mnsc.2015.2296
Karlan, D., Mobius, M., Rosenblat, T., & Szeidl, A. (2009). Trust
and social collateral. The Quarterly Journal of Economics,
124(3), 1307–1361. https://doi.org/10.1162/qjec.2009.124.3.1307
Karlan, D., & Zinman, J. (2009). Observing unobservables:
Identifying information asymmetries with a consumer credit field
experiment. Econometrica, 77(6), 1993–2008. https://doi.org/10.3982/ECTA5781
Karlan, D., & Zinman, J. (2010). Expanding credit access: Using
randomized supply decisions to estimate the impacts. Review of
Financial Studies, 23(1), 433–464. https://doi.org/10.1093/rfs/hhp092
Katz, L. (1953). A new status index derived from sociometric analysis.
Psychometrika, 18(1), 39–43. https://doi.org/10.1007/BF02289026
Katzman, J. L., Shaham, U., Cloninger, A., Bates, J., Jiang, T., &
Kluger, Y. (2018). DeepSurv: Personalized treatment
recommender system using a Cox proportional hazards deep
neural network. BMC Medical Research Methodology,
18(1), 24. https://doi.org/10.1186/s12874-018-0482-1
Kau, J. B., Keenan, D. C., Muller, W. J., & Epperson, J. F. (1992).
A generalized valuation model for fixed-rate residential mortgages.
Journal of Financial and Quantitative Analysis, 27(3),
279–299. https://doi.org/10.2307/2331201
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., &
Liu, T.-Y. (2017). LightGBM: A highly efficient gradient
boosting decision tree. Advances in Neural Information Processing
Systems 30 (NIPS 2017).
Keane, M. P., & Neal, T. (2024). A practical guide to weak
instruments. Annual Review of Economics, 16, 185–212.
https://doi.org/10.1146/annurev-economics-092123-111021
Kearns, M., & Valiant, L. (1994). Cryptographic limitations on
learning Boolean formulae and finite automata. Journal
of the ACM, 41(1), 67–95. https://doi.org/10.1145/174644.174647
Keerthi, S. S., & Lin, C.-J. (2003). Asymptotic behaviors of support
vector machines with Gaussian kernel. Neural
Computation, 15(7), 1667–1689. https://doi.org/10.1162/089976603321891855
Kennedy, E. H. (2024). Semiparametric doubly robust targeted double
machine learning: A review. https://arxiv.org/abs/2203.06469
Keys, B. J., Mukherjee, T., Seru, A., & Vig, V. (2010). Did
securitization lead to lax screening? Evidence from subprime loans.
The Quarterly Journal of Economics, 125(1), 307–362.
https://doi.org/10.1162/qjec.2010.125.1.307
Khandani, A. E., Kim, A. J., & Lo, A. W. (2010). Consumer
credit-risk models via machine-learning algorithms. Journal of
Banking and Finance, 34(11), 2767–2787. https://doi.org/10.1016/j.jbankfin.2010.06.001
Khattab, O., & Zaharia, M. (2020). ColBERT: Efficient
and effective passage search via contextualized late interaction over
BERT. Proceedings of the 43rd International ACM SIGIR
Conference on Research and Development in Information Retrieval,
39–48. https://doi.org/10.1145/3397271.3401075
Khieu, H. D., Mullineaux, D. J., & Yi, H.-C. (2012). The
determinants of bank loan recovery rates. Journal of Banking and
Finance, 36(4), 923–933. https://doi.org/10.1016/j.jbankfin.2011.10.005
Kilbertus, N., Rojas-Carulla, M., Parascandolo, G., Hardt, M., Janzing,
D., & Schölkopf, B. (2017). Avoiding discrimination through causal
reasoning. Advances in Neural Information Processing Systems 30
(NIPS 2017).
Kim, B., Khanna, R., & Koyejo, O. O. (2016). Examples are not
enough, learn to criticize! Criticism for interpretability. Advances
in Neural Information Processing Systems 29 (NeurIPS 2016).
Kim, B., Wattenberg, M., Gilmer, J., Cai, C., Wexler, J., Viegas, F.,
& Sayres, R. (2018). Interpretability beyond feature attribution:
Quantitative testing with concept activation vectors
(TCAV). Proceedings of the 35th International
Conference on Machine Learning (ICML), 2668–2677.
Kimeldorf, G., & Wahba, G. (1971). Some results on
Tchebycheffian spline functions. Journal of
Mathematical Analysis and Applications, 33(1), 82–95. https://doi.org/10.1016/0022-247X(71)90184-3
King, G., & Zeng, L. (2001). Logistic regression in rare events
data. Political Analysis, 9(2), 137–163. https://doi.org/10.1093/oxfordjournals.pan.a004868
Kingma, D. P., & Ba, J. (2015). Adam: A method for stochastic
optimization. International Conference on Learning Representations
(ICLR).
Kingma, D. P., & Welling, M. (2014). Auto-encoding variational
Bayes. International Conference on Learning
Representations (ICLR).
Kipf, T. N., & Welling, M. (2017). Semi-supervised classification
with graph convolutional networks. International Conference on
Learning Representations (ICLR).
Kiryo, R., Niu, G., Plessis, M. C. du, & Sugiyama, M. (2017).
Positive-unlabeled learning with non-negative risk estimator.
Advances in Neural Information Processing Systems (NeurIPS),
30.
Kisgen, D. J. (2006). Credit ratings and capital structure. Journal
of Finance, 61(3), 1035–1072. https://doi.org/10.1111/j.1540-6261.2006.00866.x
Klaise, J., Van Looveren, A., Cox, C., Vacanti, G., & Coca, A.
(2020). Monitoring and explainability of models in production.
USENIX Conference on Operational Machine Learning (OpML).
Klein, J. P., & Moeschberger, M. L. (2003). Survival analysis:
Techniques for censored and truncated data (2nd ed.). Springer. https://doi.org/10.1007/b97377
Kleinberg, J., Ludwig, J., Mullainathan, S., & Rambachan, A. (2018).
Algorithmic fairness. AEA Papers and Proceedings, 108,
22–27. https://doi.org/10.1257/pandp.20181018
Kleinberg, J., Mullainathan, S., & Raghavan, M. (2017). Inherent
trade-offs in the fair determination of risk scores. 8th Innovations
in Theoretical Computer Science Conference (ITCS 2017), 43:1–43:23.
https://doi.org/10.4230/LIPIcs.ITCS.2017.43
Klinger, B., Khwaja, A. I., & Carpio, C. del. (2013). Enterprising
psychometrics and poverty reduction. SpringerBriefs in
Psychology. https://doi.org/10.1007/978-1-4614-7227-8
Koenker, R., & Bassett, G. (1978). Regression quantiles.
Econometrica, 46(1), 33–50. https://doi.org/10.2307/1913643
Koh, K., Kim, S.-J., & Boyd, S. (2007). An interior-point method for
large-scale L1-regularized logistic regression. Journal of Machine
Learning Research, 8, 1519–1555.
Koh, P. W., Sagawa, S., Marklund, H., Xie, S. M., Zhang, M.,
Balsubramani, A., Hu, W., Yasunaga, M., Phillips, R. L., Beery, S., et
al. (2021). WILDS: A benchmark of in-the-wild distribution
shifts. Proceedings of the 38th International Conference on Machine
Learning (ICML).
Kojima, T., Gu, S. S., Reid, M., Matsuo, Y., & Iwasawa, Y. (2022).
Large language models are zero-shot reasoners. Advances in Neural
Information Processing Systems 35 (NeurIPS),
22199–22213.
Kokhlikyan, N., Miglani, V., Martin, M., Wang, E., Alsallakh, B.,
Reynolds, J., Melnikov, A., Kliushkina, N., Araya, C., Yan, S., &
Reblitz-Richardson, O. (2020). Captum: A unified and generic model
interpretability library for PyTorch. arXiv Preprint
arXiv:2009.07896.
Kolen, M. J., & Brennan, R. L. (2014). Test equating, scaling,
and linking: Methods and practices (3rd ed.). Springer. https://doi.org/10.1007/978-1-4939-0317-7
Kolmogorov, A. (1933). Sulla determinazione empirica di una legge di
distribuzione. Giornale Dell’Istituto Italiano Degli Attuari,
4, 83–91.
Koopman, S. J., Lucas, A., & Monteiro, A. (2008). The multi-state
latent factor intensity model for credit rating transitions. Journal
of Econometrics, 142(1), 399–424. https://doi.org/10.1016/j.jeconom.2007.07.001
Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits
and attributes are predictable from digital records of human behavior.
Proceedings of the National Academy of Sciences,
110(15), 5802–5805. https://doi.org/10.1073/pnas.1218772110
Kotelnikov, A., Baranchuk, D., Rubachev, I., & Babenko, A. (2023).
TabDDPM: Modelling tabular data with diffusion models.
Proceedings of the 40th International Conference on Machine Learning
(ICML), 17564–17579.
Kou, G., Xu, Y., Peng, Y., Shen, F., Chen, Y., Chang, K., & Kou, S.
(2021). Bankruptcy prediction for SMEs using transactional
data and two-stage multiobjective feature selection. Decision
Support Systems, 140, 113429. https://doi.org/10.1016/j.dss.2020.113429
Kozodoi, N., Lessmann, S., Alamgir, M., Moreira-Matias, L., &
Papakonstantinou, K. (2025). Fighting sampling bias: A framework for
training and evaluating credit scoring models. European Journal of
Operational Research, 324(2), 616–628.
Kraus, S., & Feuerriegel, S. (2017). Decision support from financial
disclosures with deep neural networks and transfer learning.
Decision Support Systems, 104, 38–48. https://doi.org/10.1016/j.dss.2017.10.001
Krawczyk, B. (2016). Learning from imbalanced data: Open challenges and
future directions. Progress in Artificial Intelligence,
5(4), 221–232. https://doi.org/10.1007/s13748-016-0094-0
Kreps, J., Narkhede, N., & Rao, J. (2011). Kafka: A distributed
messaging system for log processing. Proceedings of the 6th
International Workshop on Networking Meets Databases (NetDB).
Krishna, S., Han, T., Gu, A., Pombra, J., Jabbari, S., Wu, S., &
Lakkaraju, H. (2024). The disagreement problem in explainable machine
learning: A practitioner’s perspective. Transactions on Machine
Learning Research.
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2017).
ImageNet classification with deep convolutional neural
networks. Communications of the ACM, 60(6), 84–90. https://doi.org/10.1145/3065386
Kuk, A. Y. C., & Chen, C.-H. (1992). A mixture model combining
logistic regression with proportional hazards regression.
Biometrika, 79(3), 531–541. https://doi.org/10.1093/biomet/79.3.531
Kull, M., Silva Filho, T. M., & Flach, P. (2017). Beyond sigmoids:
How to obtain well-calibrated probabilities from binary classifiers with
beta calibration. Electronic Journal of Statistics,
11(2), 5052–5080. https://doi.org/10.1214/17-EJS1338SI
Kullback, S., & Leibler, R. A. (1951). On information and
sufficiency. The Annals of Mathematical Statistics,
22(1), 79–86. https://doi.org/10.1214/aoms/1177729694
Kumar, A., Liang, P. S., & Ma, T. (2019). Verified uncertainty
calibration. Advances in Neural Information Processing Systems,
32.
Kumar, I. E., Venkatasubramanian, S., Scheidegger, C., & Friedler,
S. (2020). Problems with Shapley-value-based explanations
as feature importance measures. Proceedings of the 37th
International Conference on Machine Learning, 5491–5500.
Künzel, S. R., Sekhon, J. S., Bickel, P. J., & Yu, B. (2019a).
Metalearners for estimating heterogeneous treatment effects using
machine learning. Proceedings of the National Academy of
Sciences, 116(10), 4156–4165. https://doi.org/10.1073/pnas.1804597116
Künzel, S. R., Sekhon, J. S., Bickel, P. J., & Yu, B. (2019b).
Metalearners for estimating heterogeneous treatment effects using
machine learning. Proceedings of the National Academy of
Sciences, 116(10), 4156–4165. https://doi.org/10.1073/pnas.1804597116
Kupiec, P. H. (2018). On the accuracy of alternative approaches for
calibrating bank stress test models. Journal of Financial
Stability, 38, 132–146. https://doi.org/10.1016/j.jfs.2018.04.002
Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the
Boruta package. Journal of Statistical Software,
36(11), 1–13. https://doi.org/10.18637/jss.v036.i11
Kusner, M. J., Loftus, J. R., Russell, C., & Silva, R. (2017).
Counterfactual fairness. Advances in Neural Information Processing
Systems 30 (NIPS 2017).
Kvamme, H., Sellereite, N., Aas, K., & Sjursen, S. (2018).
Predicting mortgage default using convolutional neural networks.
Expert Systems with Applications, 102, 207–217. https://doi.org/10.1016/j.eswa.2018.02.029
Lagakos, S. W., Barraj, L. M., & De Gruttola, V. (1988).
Nonparametric analysis of truncated survival data, with application to
AIDS. Biometrika, 75(3), 515–523. https://doi.org/10.1093/biomet/75.3.515
Lando, D. (1998). On Cox processes and credit risky
securities. Review of Derivatives Research, 2(2-3),
99–120. https://doi.org/10.1007/BF01531332
Lando, D., & Nielsen, M. S. (2010). Correlation in corporate
defaults: Contagion or conditional independence? Journal of
Financial Intermediation, 19(3), 355–372. https://doi.org/10.1016/j.jfi.2010.03.002
Lando, D., & Skødeberg, T. M. (2002). Analyzing rating transitions
and rating drift with continuous observations. Journal of Banking
and Finance, 26(2-3), 423–444. https://doi.org/10.1016/S0378-4266(01)00228-X
Larcker, D. F., & Zakolyukina, A. A. (2012). Detecting deceptive
discussions in conference calls. Journal of Accounting
Research, 50(2), 495–540. https://doi.org/10.1111/j.1475-679X.2012.00450.x
Lauer, J. (2017). Creditworthy: A history of consumer surveillance
and financial identity in america.
Laugel, T., Lesot, M.-J., Marsala, C., Renard, X., & Detyniecki, M.
(2018). Comparison-based inverse classification for interpretability in
machine learning. Communications in Computer and Information
Science, 853, 100–111. https://doi.org/10.1007/978-3-319-91473-2\_9
Le Morvan, M., Josse, J., Scornet, E., & Varoquaux, G. (2021).
What’s a good imputation to predict with missing values? Advances in
Neural Information Processing Systems (NeurIPS), 34.
LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning.
Nature, 521(7553), 436–444. https://doi.org/10.1038/nature14539
LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998).
Gradient-based learning applied to document recognition. Proceedings
of the IEEE, 86(11), 2278–2324. https://doi.org/10.1109/5.726791
Lee, D. S., & Lemieux, T. (2010). Regression discontinuity designs
in economics. Journal of Economic Literature, 48(2),
281–355. https://doi.org/10.1257/jel.48.2.281
Lee, D. S., McCrary, J., Moreira, M. J., & Porter, J. (2022). Valid
t-ratio inference for IV. American Economic
Review, 112(10), 3260–3290. https://doi.org/10.1257/aer.20211063
Lee, D.-H. (2013). Pseudo-label: The simple and efficient
semi-supervised learning method for deep neural networks. ICML
Workshop on Challenges in Representation Learning.
Lee, L.-F. (1983). Generalized econometric models with selectivity.
Econometrica, 51(2), 507–512. https://doi.org/10.2307/1912003
Lehmann, E. L., & Casella, G. (1998). Theory of point
estimation (2nd ed.). Springer. https://doi.org/10.1007/b98854
Lei, J., G’Sell, M., Rinaldo, A., Tibshirani, R. J., & Wasserman, L.
(2018). Distribution-free predictive inference for regression.
Journal of the American Statistical Association,
113(523), 1094–1111. https://doi.org/10.1080/01621459.2017.1307116
Leland, H. E. (1994). Corporate debt value, bond covenants, and optimal
capital structure. The Journal of Finance, 49(4),
1213–1252. https://doi.org/10.2307/2329184
Leland, H. E., & Toft, K. B. (1996). Optimal capital structure,
endogenous bankruptcy, and the term structure of credit spreads. The
Journal of Finance, 51(3), 987–1019. https://doi.org/10.2307/2329229
Lemaître, G., Nogueira, F., & Aridas, C. K. (2017).
Imbalanced-learn: A Python toolbox to tackle the curse of
imbalanced datasets in machine learning. Journal of Machine Learning
Research, 18(17), 1–5.
Lemmens, A., & Gupta, S. (2020). Managing churn to maximize profits.
Marketing Science, 39(5), 956–973. https://doi.org/10.1287/mksc.2020.1229
Lending Club. (2019). Lending club loan data (2007–2018).
Kaggle Dataset Mirror.
Leow, M., & Crook, J. (2014). Intensity models and transition
probabilities for credit card loan delinquencies. European Journal
of Operational Research, 236(2), 685–694. https://doi.org/10.1016/j.ejor.2013.12.026
Lessmann, S., Baesens, B., Seow, H.-V., & Thomas, L. C. (2015b).
Benchmarking state-of-the-art classification algorithms for credit
scoring: An update of research. European Journal of Operational
Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
Lessmann, S., Baesens, B., Seow, H.-V., & Thomas, L. C. (2015a).
Benchmarking state-of-the-art classification algorithms for credit
scoring: An update of research. European Journal of Operational
Research, 247(1), 124–136. https://doi.org/10.1016/j.ejor.2015.05.030
Letham, B., Rudin, C., McCormick, T. H., & Madigan, D. (2015).
Interpretable classifiers using rules and Bayesian
analysis: Building a better stroke prediction model. The Annals of
Applied Statistics, 9(3), 1350–1371. https://doi.org/10.1214/15-AOAS848
Letizia, E., & Lillo, F. (2019). Corporate payments networks and
credit risk rating. EPJ Data Science, 8(1), 21. https://doi.org/10.1140/epjds/s13688-019-0197-5
Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N.,
Küttler, H., Lewis, M., Yih, W., Rocktäschel, T., Riedel, S., &
Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive
NLP tasks. Advances in Neural Information Processing
Systems 33 (NeurIPS), 9459–9474.
Leyshon, A., & Thrift, N. (1999). Lists come alive: Electronic
systems of knowledge and the rise of credit-scoring in retail banking.
Economy and Society, 28(3), 434–466. https://doi.org/10.1080/03085149900000013
Li, C., Wang, H., Jiang, S., & Gu, B. (2024). The effect of
AI-enabled credit scoring on financial inclusion: Evidence
from an underserved population of over one million. MIS
Quarterly, 48(4), 1803–1834. https://doi.org/10.25300/MISQ/2024/18340
Li, D. X. (2000). On default correlation: A copula function approach.
Journal of Fixed Income, 9(4), 43–54. https://doi.org/10.3905/jfi.2000.319253
Li, F. (2008). Annual report readability, current earnings, and earnings
persistence. Journal of Accounting and Economics,
45(2–3), 221–247. https://doi.org/10.1016/j.jacceco.2008.02.003
Li, F. (2010). The information content of forward-looking statements in
corporate filings: A naïve Bayesian machine
learning approach. Journal of Accounting Research,
48(5), 1049–1102. https://doi.org/10.1111/j.1475-679X.2010.00382.x
Li, O., Liu, H., Chen, C., & Rudin, C. (2018). Deep learning for
case-based reasoning through prototypes: A neural network that explains
its predictions. Proceedings of the 32nd AAAI Conference on
Artificial Intelligence, 3530–3537.
Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2020). Federated
learning: Challenges, methods, and future directions. IEEE Signal
Processing Magazine, 37(3), 50–60. https://doi.org/10.1109/MSP.2020.2975749
Liang, D., Lu, C.-C., Tsai, C.-F., & Shih, G.-A. (2016). Financial
ratios and corporate governance indicators in bankruptcy prediction: A
comprehensive study. European Journal of Operational Research,
252(2), 561–572.
Liberti, J. M., & Petersen, M. A. (2019). Information: Hard and
soft. Review of Corporate Finance Studies, 8(1), 1–41.
https://doi.org/10.1093/rcfs/cfy009
Lim, B., Alaa, A. M., & Schaar, M. van der. (2018). Forecasting
treatment responses over time using recurrent marginal structural
networks. Advances in Neural Information Processing Systems
(NeurIPS), 31.
Lim, B., Arık, S. Ö., Loeff, N., & Pfister, T. (2021). Temporal
fusion transformers for interpretable multi-horizon time series
forecasting. International Journal of Forecasting,
37(4), 1748–1764. https://doi.org/10.1016/j.ijforecast.2021.03.012
Lin, H.-T., Lin, C.-J., & Weng, R. C. (2007). A note on
Platt’s probabilistic outputs for support vector machines.
Machine Learning, 68(3), 267–276. https://doi.org/10.1007/s10994-007-5018-6
Lin, M., Prabhala, N. R., & Viswanathan, S. (2013). Judging
borrowers by the company they keep: Friendship networks and information
asymmetry in online peer-to-peer lending. Management Science,
59(1), 17–35. https://doi.org/10.1287/mnsc.1120.1560
Lipton, Z. C. (2018). The mythos of model interpretability.
Communications of the ACM, 61(10), 36–43. https://doi.org/10.1145/3233231
Lipton, Z. C., Wang, Y.-X., & Smola, A. (2018). Detecting and
correcting for label shift with black box predictors. International
Conference on Machine Learning (ICML), 3122–3130.
Little, R. J. A. (1993). Pattern-mixture models for multivariate
incomplete data. Journal of the American Statistical
Association, 88(421), 125–134. https://doi.org/10.1080/01621459.1993.10594302
Little, R. J. A., & Rubin, D. B. (2019). Statistical analysis
with missing data.
Liu, Y., Hu, T., Zhang, H., Wu, H., Wang, S., Ma, L., & Long, M.
(2024). iTransformer: Inverted transformers
are effective for time series forecasting. Proceedings of the
International Conference on Learning Representations (ICLR). https://openreview.net/forum?id=JePfAI8fah
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O.,
Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2020).
RoBERTa: A robustly optimized BERT pretraining
approach. Proceedings of ICLR (Workshop Track).
Löffler, G. (2004). An anatomy of rating through the cycle. Journal
of Banking & Finance, 28(3), 695–720. https://doi.org/10.1016/S0378-4266(03)00041-4
Löffler, G. (2013). Can rating agencies look through the cycle?
Review of Quantitative Finance and Accounting, 40(4),
623–646. https://doi.org/10.1007/s11156-012-0289-9
Löffler, G., & Posch, P. N. (2011). Credit risk modeling using
Excel and VBA (2nd ed.). Wiley Finance.
Loh, W.-Y. (2014). Fifty years of classification and regression trees.
International Statistical Review, 82(3), 329–348. https://doi.org/10.1111/insr.12016
Longstaff, F. A., Pan, J., Pedersen, L. H., & Singleton, K. J.
(2011). How sovereign is sovereign credit risk? American Economic
Journal: Macroeconomics, 3(2), 75–103. https://doi.org/10.1257/mac.3.2.75
Longstaff, F. A., & Schwartz, E. S. (1995). A simple approach to
valuing risky fixed and floating rate debt. The Journal of
Finance, 50(3), 789–819. https://doi.org/10.2307/2329288
López de Prado, M. (2018). Advances in financial machine
learning.
Loshchilov, I., & Hutter, F. (2019). Decoupled weight decay
regularization. International Conference on Learning Representations
(ICLR).
Lou, Y., Caruana, R., Gehrke, J., & Hooker, G. (2013). Accurate
intelligible models with pairwise interactions. Proceedings of the
19th ACM SIGKDD International Conference on Knowledge Discovery and Data
Mining (KDD), 623–631. https://doi.org/10.1145/2487575.2487579
Loughran, T., & McDonald, B. (2011). When is a liability not a
liability? Textual analysis, dictionaries, and 10-Ks.
The Journal of Finance, 66(1), 35–65. https://doi.org/10.1111/j.1540-6261.2010.01625.x
Loughran, T., & McDonald, B. (2016). Textual analysis in accounting
and finance: A survey. Journal of Accounting Research,
54(4), 1187–1230. https://doi.org/10.1111/1475-679X.12123
Loukas, L., Stogiannidis, I., Diamantopoulos, O., Malakasiotis, P.,
& Vassos, S. (2023). Making LLMs worth every penny:
Resource-limited text classification in banking. Proceedings of the
Fourth ACM International Conference on AI in Finance
(ICAIF), 392–400. https://doi.org/10.1145/3604237.3626891
Loutskina, E. (2011). The role of securitization in bank liquidity and
funding management. Journal of Financial Economics,
100(3), 663–684. https://doi.org/10.1016/j.jfineco.2011.02.005
Lu, J., Liu, A., Dong, F., Gu, F., Gama, J., & Zhang, G. (2019).
Learning under concept drift: A review. IEEE Transactions on
Knowledge and Data Engineering, 31(12), 2346–2363. https://doi.org/10.1109/TKDE.2018.2876857
Lu, T., Zhang, Y., & Li, B. (2023). Profit vs. Equality? The case of
financial risk assessment and a new perspective on alternative data.
MIS Quarterly, 47(4), 1517–1556. https://doi.org/10.25300/MISQ/2023/17330
Lu, Y., Bartolo, M., Moore, A., Riedel, S., & Stenetorp, P. (2022).
Fantastically ordered prompts and where to find them: Overcoming
few-shot prompt order sensitivity. Proceedings of the 60th Annual
Meeting of the Association for Computational Linguistics
(ACL), 8086–8098. https://doi.org/10.18653/v1/2022.acl-long.556
Lundberg, S. M., Erion, G. G., & Lee, S.-I. (2018). Consistent
individualized feature attribution for tree ensembles. ICML Workshop
on Human Interpretability in Machine Learning.
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair,
B., Katz, R., Himmelfarb, J., Bansal, N., & Lee, S.-I. (2020). From
local explanations to global understanding with explainable AI for
trees. Nature Machine Intelligence, 2(1), 56–67. https://doi.org/10.1038/s42256-019-0138-9
Lundberg, S. M., & Lee, S.-I. (2017). A unified approach to
interpreting model predictions. Advances in Neural Information
Processing Systems 30.
Luo, D., Cheng, W., Xu, D., Yu, W., Zong, B., Chen, H., & Zhang, X.
(2020). Parameterized explainer for graph neural network. Advances
in Neural Information Processing Systems 33 (NeurIPS 2020).
MacKay, D. J. C. (1992). A practical Bayesian framework for
backpropagation networks. Neural Computation, 4(3),
448–472. https://doi.org/10.1162/neco.1992.4.3.448
MacKinlay, A. C. (1997). Event studies in economics and finance.
Journal of Economic Literature, 35(1), 13–39.
Madras, D., Creager, E., Pitassi, T., & Zemel, R. (2018). Learning
adversarially fair and transferable representations. Proceedings of
the 35th International Conference on Machine Learning (ICML),
3384–3393.
Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A.
(2018). Towards deep learning models resistant to adversarial attacks.
International Conference on Learning Representations (ICLR).
Mahalanobis, P. C. (1936). On the generalised distance in statistics.
Proceedings of the National Institute of Sciences of India,
2(1), 49–55.
Mahoney, N. (2015). Bankruptcy as implicit health insurance.
American Economic Review, 105(2), 710–746. https://doi.org/10.1257/aer.20131408
Malesky, E., & Taussig, M. (2009). Out of the gray: The impact of
provincial institutions on business formalization in vietnam.
Journal of East Asian Studies, 9(2), 249–290.
Malgieri, G., & Comandé, G. (2017). Why a right to legibility of
automated decision-making exists in the general data protection
regulation. International Data Privacy Law, 7(4),
243–265. https://doi.org/10.1093/idpl/ipx019
Malik, M., & Thomas, L. C. (2010). Modelling credit risk of
portfolio of consumer loans. Journal of the Operational Research
Society, 61(3), 411–420. https://doi.org/10.1057/jors.2009.123
Mancisidor, R. A., Kampffmeyer, M., Aas, K., & Jenssen, R. (2020).
Deep generative models for reject inference in credit scoring.
Knowledge-Based Systems, 196, 105758. https://doi.org/10.1016/j.knosys.2020.105758
Manela, A., & Moreira, A. (2017). News implied volatility and
disaster concerns. Journal of Financial Economics,
123(1), 137–162. https://doi.org/10.1016/j.jfineco.2016.01.032
Mani, I., & Zhang, I. (2003). kNN
approach to unbalanced data distributions: A case study involving
information extraction. Proceedings of the ICML Workshop on Learning
from Imbalanced Datasets.
Mann, H. B., & Whitney, D. R. (1947). On a test of whether one of
two random variables is stochastically larger than the other. The
Annals of Mathematical Statistics, 18(1), 50–60. https://doi.org/10.1214/aoms/1177730491
Manski, C. F. (1989). Anatomy of the selection problem. Journal of
Human Resources, 24(3), 343–360. https://doi.org/10.2307/145818
Manski, C. F. (1990). Nonparametric bounds on treatment effects.
American Economic Review, 80(2), 319–323.
Manski, C. F. (1993). Identification of endogenous social effects: The
reflection problem. The Review of Economic Studies,
60(3), 531–542. https://doi.org/10.2307/2298123
Marchenko, Y. V., & Genton, M. G. (2012). A heckman selection-t model. Journal of the American
Statistical Association, 107(497), 304–317. https://doi.org/10.1080/01621459.2012.656011
Marqués, A. I., García, V., & Sánchez, J. S. (2013). On the
suitability of resampling techniques for the class imbalance problem in
credit scoring. Journal of the Operational Research Society,
64(7), 1060–1070. https://doi.org/10.1057/jors.2012.120
Marra, G., & Radice, R. (2013). A penalized likelihood estimation
approach to semiparametric sample selection binary response modeling.
Electronic Journal of Statistics, 7, 1432–1455. https://doi.org/10.1214/13-EJS814
Marra, G., & Radice, R. (2017). Bivariate copula additive models for
location, scale and shape. Computational Statistics and Data
Analysis, 112, 99–113. https://doi.org/10.1016/j.csda.2017.03.004
Martin, K. D., Borah, A., & Palmatier, R. W. (2017). Data privacy:
Effects on customer and firm performance. Journal of Marketing,
81(1), 36–58. https://doi.org/10.1509/jm.15.0497
Martins, A., & Astudillo, R. (2016). From softmax to sparsemax: A
sparse model of attention and multi-label classification.
International Conference on Machine Learning, 1614–1623.
Mason, K. O., Mason, W. M., Winsborough, H. H., & Poole, W. K.
(1973). Some methodological issues in cohort analysis of archival data.
American Sociological Review, 38(2), 242–258. https://doi.org/10.2307/2094398
Mason, L., Baxter, J., Bartlett, P., & Frean, M. (1999).
Boosting algorithms as gradient descent.
Mattei, P.-A., & Frellsen, J. (2019). MIWAE: Deep
generative modelling and imputation of incomplete data sets.
Proceedings of the 36th International Conference on Machine Learning
(ICML).
Matz, S. C., Kosinski, M., Nave, G., & Stillwell, D. J. (2017).
Psychological targeting as an effective approach to digital mass
persuasion. Proceedings of the National Academy of Sciences,
114(48), 12714–12719. https://doi.org/10.1073/pnas.1710966114
Mayew, W. J., & Venkatachalam, M. (2012). The power of voice:
Managerial affective states and future firm performance. The Journal
of Finance, 67(1), 1–43. https://doi.org/10.1111/j.1540-6261.2011.01705.x
Mazumder, R., Hastie, T., & Tibshirani, R. (2010). Spectral
regularization algorithms for learning large incomplete matrices.
Journal of Machine Learning Research, 11, 2287–2322.
Mbiti, I., & Weil, D. N. (2011). Mobile banking: The impact of
m-pesa in kenya. NBER Working Paper, (17129). https://doi.org/10.3386/w17129
McClish, D. K. (1989). Analyzing a portion of the ROC
curve. Medical Decision Making, 9(3), 190–195. https://doi.org/10.1177/0272989X8900900307
McCrary, J. (2008). Manipulation of the running variable in the
regression discontinuity design: A density test. Journal of
Econometrics, 142(2), 698–714. https://doi.org/10.1016/j.jeconom.2007.05.005
McCullagh, P., & Nelder, J. A. (1989a). Generalized linear
models.
McCullagh, P., & Nelder, J. A. (1989b). Generalized linear
models (2nd ed.). Chapman; Hall/CRC. https://doi.org/10.1201/9780203753736
McFadden, D. (1974). Conditional logit analysis of qualitative
choice behavior. 105–142.
McKenzie, D., & Paffhausen, A. L. (2019). Small firm death in
developing countries. Review of Economics and Statistics,
101(4), 645–657. https://doi.org/10.1162/rest_a_00798
McMahan, B., Moore, E., Ramage, D., Hampson, S., & Agüera y Arcas,
B. (2017). Communication-efficient learning of deep networks from
decentralized data. Proceedings of the 20th International Conference
on Artificial Intelligence and Statistics (AISTATS), 1273–1282.
McNeil, A. J., Frey, R., & Embrechts, P. (2015). Quantitative
risk management: Concepts, techniques and tools.
Mease, D., & Wyner, A. (2008). Evidence contrary to the statistical
view of boosting. Journal of Machine Learning Research,
9, 131–156.
Medina, P. C. (2021). Side effects of nudging: Evidence from a
randomized intervention in the credit card market. Review of
Financial Studies, 34(5), 2580–2607. https://doi.org/10.1093/rfs/hhaa108
Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A.
(2021). A survey on bias and fairness in machine learning. ACM
Computing Surveys, 54(6), 1–35. https://doi.org/10.1145/3457607
Meinshausen, N., & Bühlmann, P. (2010). Stability selection.
Journal of the Royal Statistical Society. Series B (Statistical
Methodology), 72(4), 417–473. https://doi.org/10.1111/j.1467-9868.2010.00740.x
Melnychuk, V., Frauen, D., & Feuerriegel, S. (2022). Causal
transformer for estimating counterfactual outcomes. International
Conference on Machine Learning (ICML).
Mercer, J. (1909). Functions of positive and negative type, and their
connection with the theory of integral equations. Philosophical
Transactions of the Royal Society of London. Series A,
209, 415–446. https://doi.org/10.1098/rsta.1909.0016
Merrick, L., & Taly, A. (2020). The explanation game: Explaining
machine learning models using Shapley values. 17–38.
https://doi.org/10.1007/978-3-030-57321-8\_2
Merton, R. C. (1974). On the pricing of corporate debt: The risk
structure of interest rates. The Journal of Finance,
29(2), 449–470. https://doi.org/10.2307/2978814
Mester, L. J. (1997). What’s the point of credit scoring? Federal
Reserve Bank of Philadelphia Business Review, 3–16.
Mian, A., & Sufi, A. (2009). The consequences of mortgage credit
expansion: Evidence from the U.S. Mortgage default crisis.
The Quarterly Journal of Economics, 124(4), 1449–1496.
https://doi.org/10.1162/qjec.2009.124.4.1449
Mian, A., Sufi, A., & Verner, E. (2017). Household debt and business
cycles worldwide. Quarterly Journal of Economics,
132(4), 1755–1817. https://doi.org/10.1093/qje/qjx017
Miao, W., Liu, L., Tchetgen Tchetgen, E. J., & Geng, Z. (2024).
Identification, doubly robust estimation, and semiparametric efficiency
theory of nonignorable missing data with a shadow variable. Annals
of Statistics, 52(4), 1448–1473. https://doi.org/10.1214/24-AOS2391
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient
estimation of word representations in vector space. Proceedings of
the International Conference on Learning Representations (ICLR).
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J.
(2013). Distributed representations of words and phrases and their
compositionality. Advances in Neural Information Processing Systems
(NeurIPS).
Miller, A. R., & Tucker, C. E. (2018). Privacy protection,
personalized medicine, and genetic testing. Management Science,
64(10), 4648–4668. https://doi.org/10.1287/mnsc.2017.2858
Miller, T. (2019). Explanation in artificial intelligence: Insights from
the social sciences. Artificial Intelligence, 267,
1–38. https://doi.org/10.1016/j.artint.2018.07.007
Ministry of Finance of Vietnam. (2014). Vietnamese accounting
standards framework and circular
200/2014/TT-BTC on the corporate accounting
regime. Hanoi. https://mof.gov.vn/
Ministry of Finance of Vietnam. (2020). Decision no.
345/QD-BTC approving the scheme on application of financial
reporting standards in Vietnam. Ministry of Finance of
Vietnam. https://mof.gov.vn/
Mironov, I. (2017). Rényi differential
privacy. Proceedings of the IEEE 30th Computer Security Foundations
Symposium (CSF), 263–275. https://doi.org/10.1109/CSF.2017.11
Mitchell, M., Wu, S., Zaldivar, A., Barnes, P., Vasserman, L.,
Hutchinson, B., Spitzer, E., Raji, I. D., & Gebru, T. (2019). Model
cards for model reporting. Proceedings of the Conference on
Fairness, Accountability, and Transparency, 220–229. https://doi.org/10.1145/3287560.3287596
Miu, P., & Ozdemir, B. (2006). Basel requirements of downturn loss
given default: Modeling and estimating probability of default and loss
given default correlations. Journal of Credit Risk,
2(2), 43–68. https://doi.org/10.21314/JCR.2006.038
Molnar, C. (2022). Interpretable machine learning.
Montiel Olea, J. L., & Pflueger, C. (2013). A robust test for weak
instruments. Journal of Business and Economic Statistics,
31(3), 358–369. https://doi.org/10.1080/00401706.2013.806694
Moreno-Torres, J. G., Raeder, T., Alaiz-Rodrı́guez, R., Chawla, N. V.,
& Herrera, F. (2012). A unifying view on dataset shift in
classification. Pattern Recognition, 45(1), 521–530.
Moritz, P., Nishihara, R., Wang, S., Tumanov, A., Liaw, R., Liang, E.,
Elibol, M., Yang, Z., Paul, W., Jordan, M. I., & Stoica, I. (2018).
Ray: A distributed framework for emerging AI applications.
USENIX Symposium on Operating Systems Design and Implementation
(OSDI), 561–577.
Morse, A. (2015). Peer-to-peer crowdfunding: Information and the
potential for disruption in consumer lending. Annual Review of
Financial Economics, 7, 463–482. https://doi.org/10.1146/annurev-financial-111914-041939
Moscatelli, M., Parlapiano, F., Narizzano, S., & Viggiano, G.
(2020). Corporate default forecasting with machine learning. Expert
Systems with Applications, 161, 113567. https://doi.org/10.1016/j.eswa.2020.113567
Moscato, V., Picariello, A., & Sperlì, G. (2021). A benchmark of
machine learning approaches for credit score prediction. Expert
Systems with Applications, 165, 113986. https://doi.org/10.1016/j.eswa.2020.113986
Mothilal, R. K., Sharma, A., & Tan, C. (2020). Explaining machine
learning classifiers through diverse counterfactual explanations.
Proceedings of the 2020 Conference on Fairness, Accountability, and
Transparency, 607–617. https://doi.org/10.1145/3351095.3372850
M_Service Joint Stock Company. (2022). MoMo alternative
credit scoring pilot with TPBank and consumer finance
partners. Company press release, Ho Chi Minh City. https://momo.vn/
Munnell, A. H., Tootell, G. M. B., Browne, L. E., & McEneaney, J.
(1996). Mortgage lending in Boston: Interpreting
HMDA data. American Economic Review,
86(1), 25–53.
Murfin, J., & Spiegel, M. (2020). Is the risk of sea level rise
capitalized in residential real estate? Review of Financial
Studies, 33(3), 1217–1255. https://doi.org/10.1093/rfs/hhz134
Murphy, A. H. (1973). A new vector partition of the probability score.
Journal of Applied Meteorology, 12(4), 595–600. https://doi.org/10.1175/1520-0450(1973)012<0595:ANVPOT>2.0.CO;2
Murphy, K. M., & Topel, R. H. (1985). Estimation and inference in
two-step econometric models. Journal of Business and Economic
Statistics, 3(4), 370–379. https://doi.org/10.1080/07350015.1985.10509471
Myers, J. H., & Forgy, E. W. (1963). The development of numerical
credit evaluation systems. Journal of the American Statistical
Association, 58(303), 799–806. https://doi.org/10.2307/2282727
Nakkiran, P., Kaplun, G., Bansal, Y., Yang, T., Barak, B., &
Sutskever, I. (2020). Deep double descent: Where bigger models and more
data hurt. International Conference on Learning Representations
(ICLR). https://openreview.net/forum?id=B1g5sA4twr
Nakkiran, P., Venkat, P., Kakade, S., & Ma, T. (2021). Optimal
regularization can mitigate double descent. International Conference
on Learning Representations (ICLR). https://openreview.net/forum?id=7R7fAoUygoa
Narain, B. (1992). Survival analysis and the credit granting decision.
Credit Scoring and Credit Control, Oxford University Press,
109–121.
National Assembly of Vietnam. (2006). Law on gender equality, no.
73/2006/QH11. Hanoi. https://vanbanphapluat.co/
National Assembly of Vietnam. (2010). Law on persons with
disabilities, no. 51/2010/QH12. Hanoi. https://vanbanphapluat.co/
National Assembly of Vietnam. (2018). Law on cybersecurity, no.
24/2018/QH14. Hanoi. https://vanbanphapluat.co/
National Credit Information Centre of Vietnam. (2023). Annual report
of the Credit Information Centre
(CIC). State Bank of Vietnam. https://cic.gov.vn/
National Institute of Standards and Technology. (2023). Artificial
intelligence risk management framework (AI RMF 1.0)
[NIST AI 100-1]. U.S. Department of Commerce. https://doi.org/10.6028/NIST.AI.100-1
National Payment Corporation of Vietnam. (2023). NAPAS
annual report on interbank electronic payment switching. Hanoi. https://napas.com.vn/
Neal, R. M. (1996). Bayesian learning for neural networks. Lecture
Notes in Statistics, 118.
Nelder, J. A., & Wedderburn, R. W. M. (1972). Generalized linear
models. Journal of the Royal Statistical Society. Series A
(General), 135(3), 370–384. https://doi.org/10.2307/2344614
Nelsen, R. B. (2006). An introduction to copulas (2nd ed.).
Springer. https://doi.org/10.1007/0-387-28678-0
Nelson, S. (2024). Private information and price regulation in the
US credit card market. Working Paper, Chicago
Booth.
Nemenyi, P. B. (1963). Distribution-free multiple comparisons.
Princeton University Press.
Network for Greening the Financial System. (2022). NGFS climate
scenarios for central banks and supervisors. NGFS.
Netzer, O., Lemaire, A., & Herzenstein, M. (2019). When words sweat:
Identifying signals for loan default in the text of loan applications.
Journal of Marketing Research, 56(6), 960–980. https://doi.org/10.1177/0022243719852959
Newman, M. E. J. (2003). The structure and function of complex networks.
SIAM Review, 45(2), 167–256. https://doi.org/10.1137/S003614450342480
Newman, M. E. J. (2005). A measure of betweenness centrality based on
random walks. Social Networks, 27(1), 39–54. https://doi.org/10.1016/j.socnet.2004.11.009
Neyman, J. (1959). Optimal asymptotic tests of composite statistical
hypotheses. Probability and Statistics, 213–234.
Neyman, J., & Scott, E. L. (1948). Consistent estimates based on
partially consistent observations. Econometrica,
16(1), 1–32. https://doi.org/10.2307/1914288
Ngai, E. W. T., Hu, Y., Wong, Y. H., Chen, Y., & Sun, X. (2011). The
application of data mining techniques in financial fraud detection: A
classification framework and an academic review of literature.
Decision Support Systems, 50(3), 559–569. https://doi.org/10.1016/j.dss.2010.08.006
Nguyen, D. Q., & Nguyen, A.-T. (2020). PhoBERT: Pre-trained language
models for vietnamese. Findings of the Association for Computational
Linguistics: EMNLP 2020, 1037–1042.
Nickell, P., Perraudin, W., & Varotto, S. (2000). Stability of
rating transitions. Journal of Banking and Finance,
24(1-2), 203–227. https://doi.org/10.1016/S0378-4266(99)00057-6
Niculescu-Mizil, A., & Caruana, R. (2005). Predicting good
probabilities with supervised learning. Proceedings of the 22nd
International Conference on Machine Learning (ICML), 625–632. https://doi.org/10.1145/1102351.1102430
Nie, Y., Nguyen, N. H., Sinthong, P., & Kalagnanam, J. (2023). A
time series is worth 64 words: Long-term forecasting with transformers.
Proceedings of the International Conference on Learning
Representations (ICLR). https://openreview.net/forum?id=Jbdc0vTOcol
Nocedal, J., & Wright, S. J. (2006). Numerical optimization
(2nd ed.). Springer. https://doi.org/10.1007/978-0-387-40065-5
Nogueira, R., & Cho, K. (2019). Passage re-ranking with
BERT. arXiv:1901.04085.
Nori, H., Jenkins, S., Koch, P., & Caruana, R. (2019).
InterpretML: A unified framework for machine learning
interpretability. https://arxiv.org/abs/1909.09223
Oaxaca, R. (1973). Male-female wage differentials in urban labor
markets. International Economic Review, 14(3),
693–709. https://doi.org/10.2307/2525981
Office of the Comptroller of the Currency. (2011a). Supervisory
guidance on model risk management (OCC bulletin
2011-12). https://www.occ.treas.gov/news-issuances/bulletins/2011/bulletin-2011-12.html
Office of the Comptroller of the Currency. (2011b). Supervisory
guidance on model risk management (OCC bulletin 2011-12). U.S.
Department of the Treasury. https://www.occ.treas.gov/news-issuances/bulletins/2011/bulletin-2011-12.html
Office of the Comptroller of the Currency. (2013). OCC bulletin
2013-29: Third-party relationships. OCC Risk Management Guidance.
https://www.occ.gov/news-issuances/bulletins/2013/bulletin-2013-29.html
Office of the Comptroller of the Currency. (2015). Comptroller’s
handbook: Credit card lending. Office of the Comptroller of the
Currency. https://www.occ.gov/publications-and-resources/publications/comptrollers-handbook/files/credit-card-lending/index-credit-card-lending.html
Office of the Comptroller of the Currency. (2021). Model risk
management: Comptroller’s handbook. OCC. https://www.occ.gov/publications-and-resources/publications/comptrollers-handbook/files/model-risk-management/index-model-risk-management.html
Ohlson, J. A. (1980). Financial ratios and the probabilistic prediction
of bankruptcy. Journal of Accounting Research, 18(1),
109–131. https://doi.org/10.2307/2490395
Ólafsson, A., & Pagel, M. (2018). The liquid hand-to-mouth: Evidence
from personal finance management software. The Review of Financial
Studies, 31(11), 4398–4446. https://doi.org/10.1093/rfs/hhy055
Olegario, R. (2006). A culture of credit: Embedding trust and
transparency in american business.
Olson, D. L., Delen, D., & Meng, Y. (2012). Comparative analysis of
data mining methods for bankruptcy prediction. Decision Support
Systems, 52(2), 464–473. https://doi.org/10.1016/j.dss.2011.10.007
Onnela, J.-P., Saramaki, J., Hyvonen, J., Szabo, G., Lazer, D., Kaski,
K., Kertesz, J., & Barabasi, A.-L. (2007). Structure and tie
strengths in mobile communication networks. Proceedings of the
National Academy of Sciences, 104(18), 7332–7336. https://doi.org/10.1073/pnas.0610245104
Oreshkin, B. N., Carpov, D., Chapados, N., & Bengio, Y. (2020).
N-BEATS: Neural basis expansion analysis for interpretable
time series forecasting. Proceedings of the International Conference
on Learning Representations (ICLR). https://openreview.net/forum?id=r1ecqn4YwB
Orgler, Y. E. (1970). A credit scoring model for commercial loans.
Journal of Money, Credit and Banking, 2(4), 435–445.
https://doi.org/10.2307/1991095
Orús, R., Mugel, S., & Lizaso, E. (2019). Quantum computing for
finance: Overview and prospects. Reviews in Physics,
4, 100028. https://doi.org/10.1016/j.revip.2019.100028
Otoritas Jasa Keuangan. (2016). POJK
11/POJK.03/2016 on minimum capital adequacy requirement for
commercial banks (KPMM). Indonesian Financial Services
Authority. https://www.ojk.go.id/
Otoritas Jasa Keuangan. (2022). Regulation number
10/POJK.05/2022 on information technology-based lending
services. Indonesian Financial Services Authority. https://www.ojk.go.id/
Otoritas Jasa Keuangan. (2023). POJK 22/2023 on
consumer and community protection in the financial services sector.
Indonesian Financial Services Authority. https://www.ojk.go.id/
Owen, A. B. (2014). Sobol’ indices and Shapley value.
SIAM/ASA Journal on Uncertainty Quantification, 2(1),
245–251. https://doi.org/10.1137/130936233
Pagan, A., & Vella, F. (1989). Diagnostic tests for models based on
individual data: A survey. Journal of Applied Econometrics,
4(S1), S29–S59. https://doi.org/10.1002/jae.3950040504
Page, E. S. (1954). Continuous inspection schemes. Biometrika,
41(1/2), 100–115. https://doi.org/10.2307/2333009
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999a). The
PageRank citation ranking: Bringing order to the web.
Stanford InfoLab Technical Report.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999b). The
PageRank citation ranking: Bringing order to the
Web. Stanford InfoLab Technical Report.
Paleyes, A., Urma, R.-G., & Lawrence, N. D. (2022). Challenges in
deploying machine learning: A survey of case studies. ACM Computing
Surveys, 55(6), 114:1–114:29. https://doi.org/10.1145/3533378
Papadopoulos, H., Proedrou, K., Vovk, V., & Gammerman, A. (2002).
Inductive confidence machines for regression. European Conference on
Machine Learning (ECML), 345–356. https://doi.org/10.1007/3-540-36755-1\_29
Paravisini, D., & Schoar, A. (2015). The incentive effect of
scores: Randomized evidence from credit committees (NBER Working
Paper 19303). National Bureau of Economic Research. https://doi.org/10.3386/w19303
Parikh, N., & Boyd, S. (2014). Proximal algorithms. Foundations
and Trends in Optimization, 1(3), 127–239. https://doi.org/10.1561/2400000003
Park, M. Y., & Hastie, T. (2007). L1-regularization path algorithm
for generalized linear models. Journal of the Royal Statistical
Society. Series B (Statistical Methodology), 69(4),
659–677. https://doi.org/10.1111/j.1467-9868.2007.00607.x
Parlour, C. A., Rajan, U., & Walden, J. (2022). Payment system
externalities. The Journal of Finance, 77(2),
1019–1053. https://doi.org/10.1111/jofi.13110
Parlour, C. A., Rajan, U., & Zhu, H. (2022). When
FinTech competes for payment flows. Review of Financial
Studies, 35(11), 4985–5024. https://doi.org/10.1093/rfs/hhac022
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G.,
Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., et al. (2019).
PyTorch: An imperative style, high-performance deep
learning library. Advances in Neural Information Processing
Systems, 32.
Pattabhiramaiah, A., Sriram, S., & Sridhar, S. (2018). Rising prices
under declining preferences: The case of the U.S. Print
newspaper industry. Marketing Science, 37(1), 97–122.
https://doi.org/10.1287/mksc.2017.1051
Pearl, J. (1995). Causal diagrams for empirical research.
Biometrika, 82(4), 669–688. https://doi.org/10.1093/biomet/82.4.669
Pearl, J. (2009). Causality: Models, reasoning, and inference
(2nd ed.). Cambridge University Press. https://doi.org/10.1017/CBO9780511803161
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B.,
Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V.,
Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M.,
& Duchesnay, É. (2011). Scikit-learn: Machine learning in
Python. Journal of Machine Learning Research,
12, 2825–2830.
Pennington, J., Socher, R., & Manning, C. D. (2014).
GloVe: Global vectors for word representation.
Proceedings of the Conference on Empirical Methods in Natural
Language Processing (EMNLP). https://doi.org/10.3115/v1/D14-1162
Perdomo, J. C., Zrnic, T., Mendler-Dünner, C., & Hardt, M. (2020).
Performative prediction. Proceedings of the 37th International
Conference on Machine Learning (ICML).
Perozzi, B., Al-Rfou, R., & Skiena, S. (2014).
DeepWalk: Online learning of social representations.
Proceedings of the 20th ACM SIGKDD International Conference on
Knowledge Discovery and Data Mining, 701–710. https://doi.org/10.1145/2623330.2623732
Peters, M. E., Neumann, M., Iyyer, M., Gardner, M., Clark, C., Lee, K.,
& Zettlemoyer, L. (2018). Deep contextualized word representations.
Proceedings of the Conference of the North American Chapter of the
Association for Computational Linguistics (NAACL). https://doi.org/10.18653/v1/N18-1202
Petersen, M. A., & Rajan, R. G. (1994). The benefits of lending
relationships: Evidence from small business data. Journal of
Finance, 49(1), 3–37. https://doi.org/10.1111/j.1540-6261.1994.tb04418.x
Petersen, M. A., & Rajan, R. G. (2002). Does distance still matter?
The information revolution in small business lending.
Journal of Finance, 57(6), 2533–2570. https://doi.org/10.1111/1540-6261.00505
Petsiuk, V., Das, A., & Saenko, K. (2018). RISE:
Randomized input sampling for explanation of black-box models.
British Machine Vision Conference (BMVC).
Phan, L., Tran, H., Nguyen, H., & Trinh, T. H. (2022). ViT5:
Pretrained text-to-text transformer for vietnamese language generation.
Proceedings of the 2022 Conference of the North American Chapter of
the Association for Computational Linguistics: Human Language
Technologies: Student Research Workshop, 136–142.
Philippon, T. (2016). The FinTech opportunity
(NBER Working Paper 22476). National Bureau of Economic Research. https://doi.org/10.3386/w22476
Philippon, T. (2020). On fintech and financial inclusion. NBER
Working Paper, (26330).
Pineau, J., Vincent-Lamarre, P., Sinha, K., Larivière, V., Beygelzimer,
A., d’Alché-Buc, F., Fox, E., & Larochelle, H. (2021). Improving
reproducibility in machine learning research: A report from the
NeurIPS 2019 reproducibility program. Journal of
Machine Learning Research, 22, 1–20. https://jmlr.org/papers/v22/20-303.html
Piskorski, T., Seru, A., & Witkin, J. (2015). Asset quality
misrepresentation by financial intermediaries: Evidence from the
RMBS market. The Journal of Finance,
70(6), 2635–2678. https://doi.org/10.1111/jofi.12271
Platt, J. C. (1998). Sequential minimal optimization: A fast
algorithm for training support vector machines (Technical Report
MSR-TR-98-14). Microsoft Research.
Platt, J. C. (1999). Probabilistic outputs for support vector
machines and comparisons to regularized likelihood methods. 61–74.
Pleiss, G., Raghavan, M., Wu, F., Kleinberg, J., & Weinberger, K. Q.
(2017). On fairness and calibration. Advances in Neural Information
Processing Systems 30 (NIPS 2017).
Plosser, M. C., & Santos, J. A. C. (2018). Banks’ incentives and
inconsistent risk models. The Review of Financial Studies,
31(6), 2080–2112. https://doi.org/10.1093/rfs/hhy028
Pluto, K., & Tasche, D. (2005a). Thinking positively. Risk,
18(8), 72–78.
Pluto, K., & Tasche, D. (2005b). Thinking positively. Risk
Magazine.
Polyzotis, N., Roy, S., Whang, S. E., & Zinkevich, M. (2018). Data
lifecycle challenges in production machine learning: A survey. ACM
SIGMOD Record, 47, 17–28. https://doi.org/10.1145/3299887.3299891
Pope, D. G., & Sydnor, J. R. (2011). What’s in a picture? Evidence
of discrimination from Prosper.com.
Journal of Human Resources, 46(1), 53–92. https://doi.org/10.3368/jhr.46.1.53
Popov, S., Morozov, S., & Babenko, A. (2020). Neural oblivious
decision ensembles for deep learning on tabular data. International
Conference on Learning Representations (ICLR).
Potharst, R., & Feelders, A. J. (2002). Classification trees for
problems with monotonicity constraints. ACM SIGKDD Explorations
Newsletter, 4(1), 1–10. https://doi.org/10.1145/568574.568577
Poyiadzi, R., Sokol, K., Santos-Rodriguez, R., De Bie, T., & Flach,
P. (2020). FACE: Feasible and actionable counterfactual
explanations. Proceedings of the AAAI/ACM Conference on AI, Ethics,
and Society, 344–350. https://doi.org/10.1145/3375627.3375850
Prechelt, L. (1998). Early stopping—but when? Neural Networks:
Tricks of the Trade, Lecture Notes in Computer Science,
1524, 55–69. https://doi.org/10.1007/3-540-49430-8_3
Pregibon, D. (1980). Goodness of link tests for generalized linear
models. Journal of the Royal Statistical Society Series C: Applied
Statistics, 29(1), 15–24.
Prentice, R. L., & Gloeckler, L. A. (1978). Regression analysis of
grouped survival data with application to breast cancer data.
Biometrics, 34(1), 57–67. https://doi.org/10.2307/2529588
Prentice, R. L., Kalbfleisch, J. D., Peterson, A. V., Flournoy, N.,
Farewell, V. T., & Breslow, N. E. (1978). The analysis of failure
times in the presence of competing risks. Biometrics,
34(4), 541–554. https://doi.org/10.2307/2530374
Preskill, J. (2018). Quantum computing in the NISQ era and
beyond. Quantum, 2, 79. https://doi.org/10.22331/q-2018-08-06-79
Press, S. J., & Wilson, S. (1978). Choosing between logistic
regression and discriminant analysis. Journal of the American
Statistical Association, 73(364), 699–705. https://doi.org/10.2307/2286261
Prieger, J. E. (2003). A flexible parametric selection model for
non-normal data with application to health care usage. Journal of
Applied Econometrics, 18(3), 367–392. https://doi.org/10.1002/jae.696
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin,
A. (2018). CatBoost: Unbiased boosting with categorical
features. Advances in Neural Information Processing Systems 31
(NeurIPS 2018).
Provost, F., & Fawcett, T. (2001). Robust classification for
imprecise environments. Machine Learning, 42(3),
203–231. https://doi.org/10.1023/A:1007601015854
Prudential Regulation Authority. (2018). Model risk management
principles for stress testing (SS3/18). Bank of England. https://www.bankofengland.co.uk/prudential-regulation/publication/2018/model-risk-management-principles-for-stress-testing
Prudential Regulation Authority. (2023). Supervisory statement
SS1/23: Model risk management principles for banks. Bank of
England. https://www.bankofengland.co.uk/prudential-regulation/publication/2023/may/model-risk-management-principles-for-banks-ss
Puhani, P. A. (2000). The heckman correction for sample selection and
its critique. Journal of Economic Surveys, 14(1),
53–68. https://doi.org/10.1111/1467-6419.00104
Purda, L., & Skillicorn, D. (2015). Accounting variables, deception,
and a bag of words: Assessing the tools of fraud detection.
Contemporary Accounting Research, 32(3), 1193–1223. https://doi.org/10.1111/1911-3846.12089
Qi, M., & Zhao, X. (2011). Comparison of modeling methods for loss
given default. Journal of Banking and Finance, 35(11),
2842–2855. https://doi.org/10.1016/j.jbankfin.2011.03.011
Quinlan, J. R. (1986). Induction of decision trees. Machine
Learning, 1(1), 81–106. https://doi.org/10.1007/BF00116251
Quinlan, J. R. (1993). C4.5: Programs for machine learning.
Morgan Kaufmann.
Quiñonero-Candela, J., Sugiyama, M., Schwaighofer, A., & Lawrence,
N. D. (2009). Dataset shift in machine learning. MIT Press.
Rabanser, S., Günnemann, S., & Lipton, Z. C. (2019). Failing loudly:
An empirical study of methods for detecting dataset shift. Advances
in Neural Information Processing Systems (NeurIPS), 32,
1394–1406.
Rabiner, L. R. (1989). A tutorial on hidden Markov models
and selected applications in speech recognition. Proceedings of the
IEEE, 77(2), 257–286. https://doi.org/10.1109/5.18626
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S.,
Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., &
Sutskever, I. (2021). Learning transferable visual models from natural
language supervision. Proceedings of the 38th International
Conference on Machine Learning (ICML).
Rafieian, O., & Yoganarasimhan, H. (2021). Targeting and privacy in
mobile advertising. Marketing Science, 40(2), 193–218.
https://doi.org/10.1287/mksc.2020.1235
Raftery, A. E. (1995). Bayesian model selection in social
research. Sociological Methodology, 25, 111–163. https://doi.org/10.2307/271063
Rahimi, A., & Recht, B. (2007). Random features for large-scale
kernel machines. Advances in Neural Information Processing Systems
20 (NIPS 2007).
Rajan, R. G. (1992). Insiders and outsiders: The choice between informed
and arm’s-length debt. Journal of Finance, 47(4),
1367–1400. https://doi.org/10.1111/j.1540-6261.1992.tb04662.x
Rajan, U., Seru, A., & Vig, V. (2010). Statistical default models
and incentives. American Economic Review Papers and
Proceedings, 100(2), 506–510. https://doi.org/10.1257/aer.100.2.506
Rajan, U., Seru, A., & Vig, V. (2015). The failure of models that
predict failure: Distance, incentives, and defaults. Journal of
Financial Economics, 115(2), 237–260. https://doi.org/10.1016/j.jfineco.2014.09.012
Rambachan, A., Kleinberg, J., Ludwig, J., & Mullainathan, S. (2020).
An economic perspective on algorithmic fairness. AEA Papers and
Proceedings, 110, 91–95. https://doi.org/10.1257/pandp.20201036
Rambachan, A., & Roth, J. (2023). A more credible approach to
parallel trends. Review of Economic Studies, 90(5),
2555–2591. https://doi.org/10.1093/restud/rdad018
Rao, C. R. (1948). The utilization of multiple measurements in problems
of biological classification. Journal of the Royal Statistical
Society. Series B (Methodological), 10(2), 159–203. https://doi.org/10.1111/j.2517-6161.1948.tb00008.x
Rasul, K., Ashok, A., Williams, A. R., Ghonia, H., Bhagwatkar, R.,
Khorasani, A., Bayazi, M. J. D., Adamopoulos, G., Riachi, R., Hassen,
N., Biloš, M., Garg, S., Schneider, A., Chapados, N., Drouin, A.,
Zantedeschi, V., Nevmyvaka, Y., & Rish, I. (2024).
Lag-Llama: Towards foundation models for probabilistic
time series forecasting. arXiv:2310.08278. https://arxiv.org/abs/2310.08278
Reimers, N., & Gurevych, I. (2019). Sentence-BERT:
Sentence embeddings using siamese BERT-networks.
Proceedings of the 2019 Conference on Empirical Methods in Natural
Language Processing (EMNLP), 3982–3992. https://doi.org/10.18653/v1/D19-1410
Republic of Indonesia. (2022). Law no. 27/2022 on personal data
protection (UU PDP). State Gazette of the Republic of
Indonesia. https://www.bphn.go.id/
Republic of Kenya. (2019). Data protection act, 2019. Kenya
Gazette Supplement No. 181, Act No. 24 of 2019. https://www.odpc.go.ke/
Republic of South Africa. (2013). Protection of personal information
act (POPIA). Government Gazette. https://popia.co.za/
Reserve Bank of India. (2016). Master direction – non-banking
financial company – account aggregator (Reserve Bank)
directions. Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2022). Guidelines on digital lending.
Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2023a). Guidelines on default loss guarantee
(FLDG) in digital lending. Reserve Bank of India. https://www.rbi.org.in/
Reserve Bank of India. (2023b). Master circular: Basel
III capital regulations. Reserve Bank of India. https://www.rbi.org.in/Scripts/BS_ViewMasCirculardetails.aspx
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016). “Why
Should I Trust You?”: Explaining the predictions of any
classifier. Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, 1135–1144. https://doi.org/10.1145/2939672.2939778
Ribeiro, M. T., Singh, S., & Guestrin, C. (2018). Anchors:
High-precision model-agnostic explanations. Proceedings of the AAAI
Conference on Artificial Intelligence, 32. https://doi.org/10.1609/aaai.v32i1.11491
Robbins, H., & Monro, S. (1951). A stochastic approximation method.
The Annals of Mathematical Statistics, 22(3), 400–407.
https://doi.org/10.1214/aoms/1177729586
Robertson, S., & Zaragoza, H. (2009). The probabilistic relevance
framework: BM25 and beyond. Foundations and Trends in
Information Retrieval, 3(4), 333–389. https://doi.org/10.1561/1500000019
Robins, J. M., & Rotnitzky, A. (1992). Recovery of information and
adjustment for dependent censoring using surrogate markers. In N. P.
Jewell, K. Dietz, & V. T. Farewell (Eds.), AIDS epidemiology:
Methodological issues (pp. 297–331). Birkhäuser. https://doi.org/10.1007/978-1-4757-1229-2_14
Robins, J. M., Rotnitzky, A., & Scharfstein, D. O. (2000).
Sensitivity analysis for selection bias and unmeasured confounding in
missing data and causal inference models. In Statistical models in
epidemiology, the environment, and clinical trials (Vol. 116, pp.
1–94). Springer. https://doi.org/10.1007/978-1-4612-1284-3_1
Robins, J. M., Rotnitzky, A., & Zhao, L. P. (1994). Estimation of
regression coefficients when some regressors are not always observed.
Journal of the American Statistical Association,
89(427), 846–866. https://doi.org/10.1080/01621459.1994.10476818
Robinson, P. M. (1988). Root-N-consistent semiparametric
regression. Econometrica, 56(4), 931–954. https://doi.org/10.2307/1912705
Rogers, A., Kovaleva, O., & Rumshisky, A. (2020). A primer in
BERTology: What we know about how BERT works.
Transactions of the Association for Computational Linguistics,
8, 842–866. https://doi.org/10.1162/tacl_a_00349
Romano, Y., Patterson, E., & Candès, E. J. (2019). Conformalized
quantile regression. Advances in Neural Information Processing
Systems 32 (NeurIPS 2019).
Romano, Y., Sesia, M., & Candès, E. J. (2020). Classification with
valid and adaptive coverage. Advances in Neural Information
Processing Systems, 33.
Rona-Tas, A. (2020). Predicting the future: Art and algorithms.
Socio-Economic Review, 18(3), 893–911. https://doi.org/10.1093/ser/mwaa040
Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the
propensity score in observational studies for causal effects.
Biometrika, 70(1), 41–55. https://doi.org/10.1093/biomet/70.1.41
Ross, S. L., Turner, M. A., Godfrey, E., & Smith, R. R. (2008).
Mortgage lending in chicago and los angeles: A paired testing study of
the pre-application process. Journal of Urban Economics,
63(3), 902–919. https://doi.org/10.1016/j.jue.2007.07.008
Roth, J., Sant’Anna, P. H. C., Bilinski, A., & Poe, J. (2023).
What’s trending in difference-in-differences? A synthesis of the recent
econometrics literature. Journal of Econometrics,
235(2), 2218–2244. https://doi.org/10.1016/j.jeconom.2023.03.008
Rothschild, M., & Stiglitz, J. E. (1976). Equilibrium in competitive
insurance markets: An essay on the economics of imperfect information.
The Quarterly Journal of Economics, 90(4), 629–649. https://doi.org/10.2307/1885326
Roure, C. de, Pelizzon, L., & Thakor, A. V. (2022). P2P lenders
versus banks: Cream skimming or bottom fishing? The Review of
Corporate Finance Studies, 11(2), 213–262. https://doi.org/10.1093/rcfs/cfab026
Rubin, D. B. (1974). Estimating causal effects of treatments in
randomized and nonrandomized studies. Journal of Educational
Psychology, 66(5), 688–701. https://doi.org/10.1037/h0037350
Rubin, D. B. (1976). Inference and missing data. Biometrika,
63(3), 581–592. https://doi.org/10.1093/biomet/63.3.581
Rudin, C. (2019). Stop explaining black box machine learning models for
high stakes decisions and use interpretable models instead. Nature
Machine Intelligence, 1(5), 206–215. https://doi.org/10.1038/s42256-019-0048-x
Rudin, C., Chen, C., Chen, Z., Huang, H., Semenova, L., & Zhong, C.
(2022). Interpretable machine learning: Fundamental principles and 10
grand challenges. Statistics Surveys, 16, 1–85. https://doi.org/10.1214/21-SS133
Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning
representations by back-propagating errors. Nature,
323(6088), 533–536. https://doi.org/10.1038/323533a0
Sadhwani, A., Giesecke, K., & Sirignano, J. (2021). Deep learning
for mortgage risk. Journal of Financial Econometrics,
19(2), 313–368. https://doi.org/10.1093/jjfinec/nbaa025
Saito, T., & Rehmsmeier, M. (2015). The precision-recall plot is
more informative than the ROC plot when evaluating binary
classifiers on imbalanced datasets. PLOS ONE, 10,
e0118432. https://doi.org/10.1371/journal.pone.0118432
Sakurada, M., & Yairi, T. (2014). Anomaly detection using
autoencoders with nonlinear dimensionality reduction. Proceedings of
the MLSDA 2014 2nd Workshop on Machine Learning for Sensory Data
Analysis, 4–11. https://doi.org/10.1145/2689746.2689747
Salinas, D., Flunkert, V., Gasthaus, J., & Januschowski, T. (2020).
DeepAR: Probabilistic forecasting with autoregressive
recurrent networks. International Journal of Forecasting,
36(3), 1181–1191. https://doi.org/10.1016/j.ijforecast.2019.07.001
Sanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019).
DistilBERT, a distilled version of BERT:
Smaller, faster, cheaper and lighter. NeurIPS EMC2 Workshop.
Santurkar, S., Tsipras, D., Ilyas, A., & Madry, A. (2018). How does
batch normalization help optimization? Advances in Neural
Information Processing Systems (NeurIPS).
Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M., &
Monfardini, G. (2009). The graph neural network model. IEEE
Transactions on Neural Networks, 20(1), 61–80. https://doi.org/10.1109/TNN.2008.2005605
Schapire, R. E. (1990). The strength of weak learnability. Machine
Learning, 5(2), 197–227. https://doi.org/10.1007/BF00116037
Scharfstein, D. O., Rotnitzky, A., & Robins, J. M. (1999). Adjusting
for nonignorable drop-out using semiparametric nonresponse models.
Journal of the American Statistical Association,
94(448), 1096–1120. https://doi.org/10.1080/01621459.1999.10473862
Schelter, S., Biessmann, F., Januschowski, T., Salinas, D., Seufert, S.,
& Szarvas, G. (2018). On challenges in machine learning model
management. IEEE Data Engineering Bulletin, 41(4),
5–15.
Schmittlein, D. C., Morrison, D. G., & Colombo, R. (1987). Counting
your customers: Who are they and what will they do next? Management
Science, 33(1), 1–24. https://doi.org/10.1287/mnsc.33.1.1
Schölkopf, B., Platt, J. C., Shawe-Taylor, J., Smola, A. J., &
Williamson, R. C. (2001). Estimating the support of a high-dimensional
distribution. Neural Computation, 13(7), 1443–1471. https://doi.org/10.1162/089976601750264965
Schölkopf, B., Smola, A., & Müller, K.-R. (1998). Nonlinear
component analysis as a kernel eigenvalue problem. Neural
Computation, 10(5), 1299–1319. https://doi.org/10.1162/089976698300017467
Schuermann, T., & Jafry, Y. (2004). Measurement, estimation, and
comparison of credit migration matrices. Journal of Banking &
Finance, 28(11), 2603–2639. https://doi.org/10.1016/j.jbankfin.2004.06.004
Schularick, M., & Taylor, A. M. (2012). Credit booms gone bust:
Monetary policy, leverage cycles, and financial crises, 1870–2008.
American Economic Review, 102(2), 1029–1061. https://doi.org/10.1257/aer.102.2.1029
Schwartz, E. S., & Torous, W. N. (1989). Prepayment and the
valuation of mortgage-backed securities. The Journal of
Finance, 44(2), 375–392. https://doi.org/10.1111/j.1540-6261.1989.tb05062.x
Schweidel, D. A., Fader, P. S., & Bradlow, E. T. (2008).
Understanding service retention within and across cohorts using limited
information. Journal of Marketing, 72(1), 82–94. https://doi.org/10.1509/jmkg.72.1.082
Scornet, E., Biau, G., & Vert, J.-P. (2015). Consistency of random
forests. The Annals of Statistics, 43(4), 1716–1741.
https://doi.org/10.1214/15-AOS1321
Seetharaman, P. B. (2004). Modeling multiple sources of state dependence
in random utility models: A distributed lag approach. Marketing
Science, 23(2), 263–271. https://doi.org/10.1287/mksc.1030.0024
Seetharaman, P. B., & Chintagunta, P. K. (2003). The proportional
hazard model for purchase timing: A comparison of alternative
specifications. Journal of Business and Economic Statistics,
21(3), 368–382. https://doi.org/10.1198/073500103288619025
Seiffert, C., Khoshgoftaar, T. M., Van Hulse, J., & Napolitano, A.
(2010). RUSBoost: A hybrid approach to alleviating class
imbalance. IEEE Transactions on Systems, Man, and Cybernetics, Part
A, 40(1), 185–197. https://doi.org/10.1109/TSMCA.2009.2029559
Selbst, A. D., & Powles, J. (2017). Meaningful information and the
right to explanation. International Data Privacy Law,
7(4), 233–242. https://doi.org/10.1093/idpl/ipx022
Self, S. G., & Liang, K.-Y. (1987). Asymptotic properties of maximum
likelihood estimators and likelihood ratio tests under nonstandard
conditions. Journal of the American Statistical Association,
82(398), 605–610. https://doi.org/10.1080/01621459.1987.10478472
Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., &
Batra, D. (2017). Grad-CAM: Visual explanations from deep
networks via gradient-based localization. Proceedings of the IEEE
International Conference on Computer Vision (ICCV), 618–626. https://doi.org/10.1109/ICCV.2017.74
Sezer, O. B., Gudelek, M. U., & Ozbayoglu, A. M. (2020). Financial
time series forecasting with deep learning: A systematic literature
review: 2005–2019. Applied Soft Computing, 90, 106181.
https://doi.org/10.1016/j.asoc.2020.106181
Shafer, G., & Vovk, V. (2008). A tutorial on conformal prediction.
Journal of Machine Learning Research, 9, 371–421.
Shannon, C. E. (1948). A mathematical theory of communication. The
Bell System Technical Journal, 27(3), 379–423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
Shapley, L. S. (1953). A value for n-person games. Contributions to
the Theory of Games, 2(28), 307–317.
Shi, F., Chen, X., Misra, K., Scales, N., Dohan, D., Chi, E. H.,
Schärli, N., & Zhou, D. (2023). Large language models can be easily
distracted by irrelevant context. Proceedings of the 40th
International Conference on Machine Learning (ICML),
31210–31227.
Shokri, R., Stronati, M., Song, C., & Shmatikov, V. (2017).
Membership inference attacks against machine learning models.
Proceedings of the 2017 IEEE Symposium on Security and Privacy
(SP), 3–18. https://doi.org/10.1109/SP.2017.41
Shrikumar, A., Greenside, P., & Kundaje, A. (2017). Learning
important features through propagating activation differences.
Proceedings of the 34th International Conference on Machine
Learning, 3145–3153.
Shumway, T. (2001). Forecasting bankruptcy more accurately: A simple
hazard model. The Journal of Business, 74(1), 101–124.
https://doi.org/10.1086/209665
Shwartz-Ziv, R., & Armon, A. (2022a). Tabular data: Deep learning is
not all you need. Information Fusion, 81, 84–90. https://doi.org/10.1016/j.inffus.2021.11.011
Shwartz-Ziv, R., & Armon, A. (2022b). Tabular data: Deep learning is
not all you need. Information Fusion, 81, 84–90. https://doi.org/10.1016/j.inffus.2021.11.011
Siddiqi, N. (2017a). Intelligent credit scoring: Building and
implementing better credit risk scorecards.
Siddiqi, N. (2017b). Intelligent credit scoring: Building and
implementing better credit risk scorecards. John Wiley and Sons, 2nd
Edition.
Sill, J. (1998). Monotonic networks. Advances in Neural Information
Processing Systems (NeurIPS), 10.
Simester, D., Timoshenko, A., & Zoumpoulis, S. I. (2020). Targeting
prospective customers: Robustness of machine-learning methods to typical
data challenges. Management Science, 66(6), 2495–2522.
https://doi.org/10.1287/mnsc.2019.3308
Sinha, R. K., & Chandrashekaran, M. (1992). A split hazard model for
analyzing the diffusion of innovations. Journal of Marketing
Research, 29(1), 116–127. https://doi.org/10.1177/002224379202900110
Skiba, P. M., & Tobacman, J. (2019). Do payday loans cause
bankruptcy? Journal of Law and Economics, 62(3),
485–519. https://doi.org/10.1086/706201
Sklar, A. (1959). Fonctions de répartition à n
dimensions et leurs marges. Publications de l’Institut de
Statistique de l’Université de Paris, 8,
229–231.
Skoglund, J., & Chen, W. (2015). Financial risk management:
Applications in market, credit, asset and liability management, and
firmwide risk. Wiley.
Slack, D., Hilgard, S., Jia, E., Singh, S., & Lakkaraju, H. (2020).
Fooling LIME and SHAP: Adversarial attacks on
post hoc explanation methods. Proceedings of the AAAI/ACM Conference
on AI, Ethics, and Society, 180–186. https://doi.org/10.1145/3375627.3375830
Smilkov, D., Thorat, N., Kim, B., Viegas, F., & Wattenberg, M.
(2017). SmoothGrad: Removing noise by adding noise.
arXiv Preprint arXiv:1706.03825.
Smirnov, N. (1948). Table for estimating the goodness of fit of
empirical distributions. The Annals of Mathematical Statistics,
19(2), 279–281. https://doi.org/10.1214/aoms/1177730256
Smith, M. D. (2003). Modelling sample selection using
Archimedean copulas. Econometrics Journal,
6(1), 99–123. https://doi.org/10.1111/1368-423X.00101
Smith, R. J. (1989). On the use of distributional mis-specification
checks in limited dependent variable models. The Economic
Journal, 99(395), 178–192. https://doi.org/10.2307/2234212
Smola, A. J., & Schölkopf, B. (2004). A tutorial on support vector
regression. Statistics and Computing, 14(3), 199–222.
https://doi.org/10.1023/B:STCO.0000035301.49549.88
So, M. C., & Thomas, L. C. (2011). Modelling the profitability of
credit cards by Markov decision processes. European
Journal of Operational Research, 212(1), 123–130. https://doi.org/10.1016/j.ejor.2011.01.023
Sonnenburg, S., Braun, M. L., Ong, C. S., Bengio, S., Bottou, L.,
Holmes, G., LeCun, Y., Müller, K.-R., Pereira, F., Rasmussen, C. E.,
Rätsch, G., Schölkopf, B., Smola, A., Vincent, P., Weston, J., &
Williamson, R. (2007). The need for open source software in machine
learning. Journal of Machine Learning Research, 8,
2443–2466. https://www.jmlr.org/papers/v8/sonnenburg07a.html
Spärck Jones, K. (1972). A statistical interpretation of term
specificity and its application in retrieval. Journal of
Documentation, 28(1), 11–21. https://doi.org/10.1108/eb026526
Spence, M. (1973). Job market signaling. The Quarterly Journal of
Economics, 87(3), 355–374. https://doi.org/10.2307/1882010
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., &
Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural
networks from overfitting. Journal of Machine Learning
Research, 15(1), 1929–1958.
Stadler, T., Oprisanu, B., & Troncoso, C. (2022). Synthetic data -
anonymisation groundhog day. Proceedings of the 31st USENIX Security
Symposium (USENIX Security), 1451–1468.
Staiger, D., & Stock, J. H. (1997). Instrumental variables
regression with weak instruments. Econometrica, 65(3),
557–586. https://doi.org/10.2307/2171753
Stango, V., & Zinman, J. (2014). Limited and varying consumer
attention: Evidence from shocks to the salience of bank overdraft fees.
Review of Financial Studies, 27(4), 990–1030. https://doi.org/10.1093/rfs/hhu008
Stanton, R. (1995). Rational prepayment and the valuation of
mortgage-backed securities. The Review of Financial Studies,
8(3), 677–708. https://doi.org/10.1093/rfs/8.3.677
State Bank of Vietnam. (2016a). Circular
39/2016/TT-NHNN on lending activities of
credit institutions and foreign bank branches to customers. Hanoi.
https://www.sbv.gov.vn/
State Bank of Vietnam. (2016b). Circular
41/2016/TT-NHNN on capital adequacy ratios for
banks and foreign bank branches. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2018). Circular no.
13/2018/TT-NHNN on the system of internal control of
commercial banks and foreign bank branches. State Bank of Vietnam.
https://www.sbv.gov.vn/
State Bank of Vietnam. (2020a). Circular
16/2020/TT-NHNN on electronic
know-your-customer for payment account opening. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2020b). Circular no.
16/2020/TT-NHNN amending circular 23/2014 on opening and
use of payment accounts, including electronic know-your-customer (eKYC). State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2021a). Circular no.
11/2021/TT-NHNN on classification of assets, levels and
method of setting up of risk provisions, and use of provisions against
credit risks. State Bank of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2021b). Circular no.
11/2021/TT-NHNN on loan classification and provisioning for
credit institutions. State Bank of Vietnam. https://english.luatvietnam.vn/circular-no-11-2021-tt-nhnn-dated-july-30-2021-of-the-state-bank-of-vietnam-providing-the-classification-of-assets-risk-provisioning-levels-and-met-206806-doc1.html
State Bank of Vietnam. (2021c). Decision no.
810/QD-NHNN approving the plan for digital transformation
of the banking sector to 2025, orientation to 2030. State Bank of
Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2022). Annual report 2022. State Bank
of Vietnam. https://www.sbv.gov.vn/
State Bank of Vietnam. (2023a). Circular
22/2023/TT-NHNN amending circular
41/2016/TT-NHNN on capital adequacy ratios for
banks and foreign bank branches. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2023b). Decision
2345/QD-NHNN on solutions for safety and
security in online payments and bank card transactions. Hanoi. https://www.sbv.gov.vn/
State Bank of Vietnam. (2024). Regulatory sandbox for fintech
activities in the banking sector: Decree 94/2025/ND-CP. State Bank
of Vietnam. https://www.sbv.gov.vn/
State of California. (2018). California consumer privacy act of
2018. Cal. Civ. Code §§1798.100–1798.199.
Stein, J. C. (2002). Information production and capital allocation:
Decentralized versus hierarchical firms. Journal of Finance,
57(5), 1891–1921. https://doi.org/10.1111/0022-1082.00483
Steinwart, I., & Christmann, A. (2008). Support vector
machines. Information Science and Statistics. https://doi.org/10.1007/978-0-387-77242-4
Stekhoven, D. J., & Bühlmann, P. (2012).
MissForest—non-parametric missing value imputation for
mixed-type data. Bioinformatics, 28(1), 112–118. https://doi.org/10.1093/bioinformatics/btr597
Stepanova, M., & Thomas, L. C. (2001). PHAB scores:
Proportional hazards analysis behavioural scores. Journal of the
Operational Research Society, 52(9), 1007–1016. https://doi.org/10.1057/palgrave.jors.2601189
Stepanova, M., & Thomas, L. C. (2002). Survival analysis methods for
personal loan data. Operations Research, 50(2),
277–289. https://doi.org/10.1287/opre.50.2.277.426
Stevenson, M., Mues, C., & Bravo, C. (2021). The value of text for
small business default prediction: A deep learning approach.
European Journal of Operational Research, 295(2),
758–771. https://doi.org/10.1016/j.ejor.2021.03.008
Stiglitz, J. E., & Weiss, A. (1981). Credit rationing in markets
with imperfect information. The American Economic Review,
71(3), 393–410.
Stock, J. H., Wright, J. H., & Yogo, M. (2002). A survey of weak
instruments and weak identification in generalized method of moments.
Journal of Business and Economic Statistics, 20(4),
518–529. https://doi.org/10.1198/073500102288618658
Stock, J. H., & Yogo, M. (2005). Testing for weak instruments in
linear IV regression. Identification and Inference for
Econometric Models: Essays in Honor of Thomas Rothenberg, 80–108.
Stodden, V., McNutt, M., Bailey, D. H., Deelman, E., Gil, Y., Hanson,
B., Heroux, M. A., Ioannidis, J. P. A., & Taufer, M. (2016).
Enhancing reproducibility for computational methods. Science,
354(6317), 1240–1241. https://doi.org/10.1126/science.aah6168
Stone, M. (1974). Cross-validatory choice and assessment of statistical
predictions. Journal of the Royal Statistical Society. Series B
(Methodological), 36(2), 111–147.
Strahan, P. E. (1999). Borrower risk and the price and nonprice terms of
bank loans. Federal Reserve Bank of New York Staff Report,
(90). https://www.newyorkfed.org/research/staff_reports/sr90.html
Štrumbelj, E., & Kononenko, I. (2014). Explaining prediction models
and individual predictions with feature contributions. Knowledge and
Information Systems, 41(3), 647–665. https://doi.org/10.1007/s10115-013-0679-x
Stulz, R. M. (2019). FinTech, BigTech, and the
future of banks. Journal of Applied Corporate Finance,
31(4), 86–97. https://doi.org/10.1111/jacf.12378
Sugiyama, M., Krauledat, M., & Müller, K.-R. (2007). Covariate shift
adaptation by importance weighted cross validation. Journal of
Machine Learning Research, 8, 985–1005.
Sugiyama, M., Suzuki, T., Nakajima, S., Kashima, H., Bünau, P. von,
& Kawanabe, M. (2008). Direct importance estimation for covariate
shift adaptation. Annals of the Institute of Statistical
Mathematics, 60(4), 699–746. https://doi.org/10.1007/s10463-008-0197-x
Sun, B., Liu, L., Miao, W., Wirth, K., Robins, J., & Tchetgen
Tchetgen, E. J. (2018). Semiparametric estimation with data missing not
at random using an instrumental variable. Statistica Sinica,
28(4), 1965–1983. https://doi.org/10.5705/ss.202016.0324
Sun, L., & Abraham, S. (2021). Estimating dynamic treatment effects
in event studies with heterogeneous treatment effects. Journal of
Econometrics, 225(2), 175–199. https://doi.org/10.1016/j.jeconom.2020.09.006
Sun, X., & Xu, W. (2014). Fast implementation of
DeLong’s algorithm for comparing the areas under correlated
receiver operating characteristic curves. IEEE Signal Processing
Letters, 21(11), 1389–1393. https://doi.org/10.1109/LSP.2014.2337313
Sundararajan, M., & Najmi, A. (2020). The many Shapley
values for model explanation. Proceedings of the 37th International
Conference on Machine Learning, 9269–9278.
Sundararajan, M., Taly, A., & Yan, Q. (2017). Axiomatic attribution
for deep networks. Proceedings of the 34th International Conference
on Machine Learning, 3319–3328.
Sundaresan, S. (2013). A review of Merton’s model of the
firm’s capital structure with its wide applications. Annual Review
of Financial Economics, 5, 21–41. https://doi.org/10.1146/annurev-financial-110112-120923
Suri, T. (2017). Mobile money. Annual Review of Economics,
9, 497–520. https://doi.org/10.1146/annurev-economics-063016-103638
Suri, T., & Jack, W. (2016). The long-run poverty and gender impacts
of mobile money. Science, 354(6317), 1288–1292. https://doi.org/10.1126/science.aah5309
Suykens, J. A. K., & Vandewalle, J. (1999). Least squares support
vector machine classifiers. Neural Processing Letters,
9(3), 293–300. https://doi.org/10.1023/A:1018628609742
Swaminathan, A., & Joachims, T. (2015). Counterfactual risk
minimization: Learning from logged bandit feedback. International
Conference on Machine Learning (ICML).
Sy, J. P., & Taylor, J. M. G. (2000). Estimation in a
Cox proportional hazards cure model. Biometrics,
56(1), 227–236. https://doi.org/10.1111/j.0006-341X.2000.00227.x
Tang, H. (2019). Peer-to-peer lenders versus banks: Substitutes or
complements? Review of Financial Studies, 32(5),
1900–1938. https://doi.org/10.1093/rfs/hhy137
Tax, D. M. J., & Duin, R. P. W. (2004). Support vector data
description. Machine Learning, 54(1), 45–66. https://doi.org/10.1023/B:MACH.0000008084.60811.49
Tenney, I., Das, D., & Pavlick, E. (2019). BERT
rediscovers the classical NLP pipeline. Proceedings of
the 57th Annual Meeting of the Association for Computational Linguistics
(ACL), 4593–4601. https://doi.org/10.18653/v1/P19-1452
Tetlock, P. C. (2007). Giving content to investor sentiment: The role of
media in the stock market. The Journal of Finance,
62(3), 1139–1168. https://doi.org/10.1111/j.1540-6261.2007.01232.x
Tetlock, P. C., Saar-Tsechansky, M., & Macskassy, S. (2008). More
than words: Quantifying language to measure firms’ fundamentals. The
Journal of Finance, 63(3), 1437–1467. https://doi.org/10.1111/j.1540-6261.2008.01362.x
Thistlethwaite, D. L., & Campbell, D. T. (1960).
Regression-discontinuity analysis: An alternative to the ex post facto
experiment. Journal of Educational Psychology, 51(6),
309–317. https://doi.org/10.1037/h0044319
Thomas, L. C. (2000a). A survey of credit and behavioural scoring:
Forecasting financial risk of lending to consumers. International
Journal of Forecasting, 16(2), 149–172. https://doi.org/10.1016/S0169-2070(00)00034-0
Thomas, L. C. (2000b). A survey of credit and behavioural scoring:
Forecasting financial risk of lending to consumers. International
Journal of Forecasting, 16(2), 149–172. https://doi.org/10.1016/S0169-2070(00)00034-0
Thomas, L. C., Crook, J., & Edelman, D. (2017). Credit scoring
and its applications (2nd ed.). Society for Industrial; Applied
Mathematics (SIAM). https://doi.org/10.1137/1.9781611974560
Thomas, L. C., Ho, J., & Scherer, W. T. (2001). Time will tell:
Behavioural scoring and the dynamics of consumer credit assessment.
IMA Journal of Management Mathematics, 12(1), 89–103.
https://doi.org/10.1093/imaman/12.1.89
Tian, S., Yu, Y., & Guo, H. (2015). Variable selection and corporate
bankruptcy forecasts. Journal of Banking & Finance,
52, 89–100. https://doi.org/10.1016/j.jbankfin.2014.12.003
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso.
Journal of the Royal Statistical Society: Series B
(Methodological), 58(1), 267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x
Tomek, I. (1976). Two modifications of CNN. IEEE
Transactions on Systems, Man and Cybernetics, SMC-6(11),
769–772. https://doi.org/10.1109/TSMC.1976.4309452
Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei,
Y., Bashlykov, N., Batra, S., Bhargava, P., Bhosale, S., Bikel, D.,
Blecher, L., Ferrer, C. C., Chen, M., Cucurull, G., Esiobu, D.,
Fernandes, J., Fu, J., Fu, W., … Scialom, T. (2023). Llama
2: Open foundation and fine-tuned chat models.
arXiv:2307.09288.
Townsend, R. M. (1979). Optimal contracts and competitive markets with
costly state verification. Journal of Economic Theory,
21(2), 265–293. https://doi.org/10.1016/0022-0531(79)90031-0
Tran, K. Q., Duong, B. V., Tran, L. Q., Tran, A. L.-H., Nguyen, A. T.,
& Nguyen, K. V. (2021). Machine learning-based empirical
investigation for credit scoring in vietnam’s banking. International
Conference on Industrial, Engineering and Other Applications of Applied
Intelligent Systems, 564–574.
Treacy, W. F., & Carey, M. (2000). Credit risk rating systems at
large US banks. Journal of Banking & Finance,
24(1–2), 167–201. https://doi.org/10.1016/S0378-4266(99)00056-4
Trefethen, L. N., & Bau, D. (1997). Numerical linear
algebra. SIAM. https://doi.org/10.1137/1.9780898719574
Trench, M. S., Pederson, S. P., Lau, E. T., Ma, L., Wang, H., &
Nair, S. K. (2003). Managing credit lines and prices for Bank
One credit cards. Interfaces, 33(5), 4–21. https://doi.org/10.1287/inte.33.5.4.19245
Truong, C., Oudre, L., & Vayatis, N. (2020). Selective review of
offline change point detection methods. Signal Processing,
167, 107299. https://doi.org/10.1016/j.sigpro.2019.107299
Tsai, Y.-H. H., Bai, S., Yamada, M., Morency, L.-P., &
Salakhutdinov, R. (2019). Transformer dissection: An unified
understanding for transformer’s attention via the lens of kernel.
Proceedings of EMNLP, 4335–4344. https://doi.org/10.18653/v1/D19-1443
Tsiatis, A. (1975). A nonidentifiability aspect of the problem of
competing risks. Proceedings of the National Academy of
Sciences, 72(1), 20–22. https://doi.org/10.1073/pnas.72.1.20
Tsiatis, A. A. (1981). A large sample study of cox’s regression model.
The Annals of Statistics, 9(1), 93–108. https://doi.org/10.1214/aos/1176345335
Tsybakov, A. B. (2008). Introduction to nonparametric estimation.
Springer Series in Statistics.
Turjeman, D., & Feinberg, F. M. (2024). When the data are out:
Measuring behavioral changes following a data breach. Marketing
Science, 43(2), 440–461. https://doi.org/10.1287/mksc.2019.0208
United Mexican States. (2002). Ley para regular las sociedades de
información crediticia. Federal Official Gazette, 15 January 2002.
https://www.diputados.gob.mx/
United Mexican States. (2010). Ley federal de protección de datos
personales en posesión de los particulares (LFPDPPP).
Federal Official Gazette, 5 July 2010. https://www.diputados.gob.mx/
United Mexican States. (2018). Ley para regular las instituciones de
tecnología financiera (Fintech Law). Federal Official
Gazette, 9 March 2018. https://www.diputados.gob.mx/
United States Congress. (1970). Fair credit reporting act, 15 u.s.c.
§§ 1681 et seq. Public Law 91-508. https://www.consumer.ftc.gov/articles/pdf-0111-fair-credit-reporting-act.pdf
United States Congress. (1975). Home mortgage disclosure act of
1975. Public Law 94-200; 12 U.S.C. 2801 et seq.
Uno, H., Cai, T., Pencina, M. J., D’Agostino, R. B., & Wei, L. J.
(2011). On the C-statistics for evaluating overall adequacy
of risk prediction procedures with censored survival data.
Statistics in Medicine, 30(10), 1105–1117. https://doi.org/10.1002/sim.4154
Upper, C. (2011). Simulation methods to assess the danger of contagion
in interbank markets. Journal of Financial Stability,
7(3), 111–125. https://doi.org/10.1016/j.jfs.2010.12.001
U.S. Congress. (1974). Equal credit opportunity act, 15 u.s.c.
§1691. United States Code.
U.S. Department of Housing and Urban Development. (2013).
Implementation of the fair housing act’s discriminatory effects
standard (24 CFR § 100.500). HUD. https://www.federalregister.gov/documents/2013/02/15/2013-03375/implementation-of-the-fair-housing-acts-discriminatory-effects-standard
U.S. Department of the Treasury. (2024). Managing
AI-specific cybersecurity risks in the financial services
sector. U.S. Department of the Treasury. https://home.treasury.gov/system/files/136/Managing-Artificial-Intelligence-Specific-Cybersecurity-Risks-In-The-Financial-Services-Sector.pdf
Ustun, B., Spangher, A., & Liu, Y. (2019). Actionable recourse in
linear classification. Proceedings of the 2019 Conference on
Fairness, Accountability, and Transparency, 10–19. https://doi.org/10.1145/3287560.3287566
Vaart, A. W. van der. (1998). Asymptotic statistics. Cambridge
University Press. https://doi.org/10.1017/CBO9780511802256
Vallée, B., & Zeng, Y. (2019). Marketplace lending: A new banking
paradigm? The Review of Financial Studies, 32(5),
1939–1982. https://doi.org/10.1093/rfs/hhz005
Vansteelandt, S., Rotnitzky, A., & Robins, J. M. (2007). Estimation
of regression models for the mean of repeated outcomes under
nonignorable nonmonotone nonresponse. Biometrika,
94(4), 841–860. https://doi.org/10.1093/biomet/asm070
Vapnik, V. N. (1999). An overview of statistical learning theory.
IEEE Transactions on Neural Networks, 10(5), 988–999.
https://doi.org/10.1109/72.788640
Vapnik, V. N., & Chervonenkis, A. Y. (1971). On the uniform
convergence of relative frequencies of events to their probabilities.
Theory of Probability and Its Applications, 16(2),
264–280. https://doi.org/10.1137/1116025
Vasicek, O. A. (2002a). The distribution of loan portfolio value.
Risk, 15(12), 160–162.
Vasicek, O. A. (2002b). The distribution of loan portfolio value.
Risk Magazine, 15(12), 160–162.
Vassalou, M., & Xing, Y. (2004). Default risk in equity returns.
The Journal of Finance, 59(2), 831–868. https://doi.org/10.1111/j.1540-6261.2004.00650.x
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez,
A. N., Kaiser, L., & Polosukhin, I. (2017). Attention is all you
need. Advances in Neural Information Processing Systems 30
(NeurIPS), 5998–6008.
Vaupel, J. W., Manton, K. G., & Stallard, E. (1979). The impact of
heterogeneity in individual frailty on the dynamics of mortality.
Demography, 16(3), 439–454. https://doi.org/10.2307/2061224
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., &
Bengio, Y. (2018). Graph attention networks. International
Conference on Learning Representations (ICLR).
Vella, F. (1998). Estimating models with sample selection bias: A
survey. Journal of Human Resources, 33(1), 127–169. https://doi.org/10.2307/146317
Verbraken, T., Bravo, C., Weber, R., & Baesens, B. (2014).
Development and application of consumer credit scoring models using
profit-based classification measures. European Journal of
Operational Research, 238(2), 505–513. https://doi.org/10.1016/j.ejor.2014.04.001
Verbraken, T., Verbeke, W., & Baesens, B. (2013). A novel profit
maximizing metric for measuring classification performance of customer
churn prediction models. IEEE Transactions on Knowledge and Data
Engineering, 25(5), 961–973. https://doi.org/10.1109/TKDE.2012.50
Vilcassim, N. J., & Jain, D. C. (1991). Modeling purchase-timing and
brand-switching behavior incorporating explanatory variables and
unobserved heterogeneity. Journal of Marketing Research,
28(1), 29–41. https://doi.org/10.1177/002224379102800103
Villani, C. (2009). Optimal transport: Old and new. Grundlehren Der
Mathematischen Wissenschaften, 338. https://doi.org/10.1007/978-3-540-71050-9
Voigt, P., & Bussche, A. von dem. (2017). The EU
general data protection regulation (GDPR): A practical
guide. https://doi.org/10.1007/978-3-319-57959-7
Vovk, V., Gammerman, A., & Shafer, G. (2005). Algorithmic
learning in a random world. Springer. https://doi.org/10.1007/b106715
VPBank SMBC Finance Company Limited (FE Credit). (2023). Annual
report 2023. Ho Chi Minh City. https://fecredit.com.vn/
Vu, T., Nguyen, D. Q., Dras, M., Johnson, M., et al. (2018). VnCoreNLP:
A vietnamese natural language processing toolkit. Proceedings of the
2018 Conference of the North American Chapter of the Association for
Computational Linguistics: Demonstrations, 56–60.
Wachter, S., Mittelstadt, B., & Floridi, L. (2017a). Why a right to
explanation of automated decision-making does not exist in the general
data protection regulation. International Data Privacy Law,
7(2), 76–99. https://doi.org/10.1093/idpl/ipx005
Wachter, S., Mittelstadt, B., & Floridi, L. (2017b). Why a right to
explanation of automated decision-making does not exist in the general
data protection regulation. International Data Privacy Law,
7(2), 76–99. https://doi.org/10.1093/idpl/ipx005
Wachter, S., Mittelstadt, B., & Russell, C. (2018). Counterfactual
explanations without opening the black box: Automated decisions and the
GDPR. Harvard Journal of Law and Technology,
31(2), 841–887.
Wager, S., & Athey, S. (2018). Estimation and inference of
heterogeneous treatment effects using random forests. Journal of the
American Statistical Association, 113(523), 1228–1242. https://doi.org/10.1080/01621459.2017.1319839
Wager, S., Wang, S., & Liang, P. (2013). Dropout training as
adaptive regularization. Advances in Neural Information Processing
Systems, 26.
Wang, S., Shao, J., & Kim, J. K. (2014). An instrumental variable
approach for identification and estimation with nonignorable
nonresponse. Statistica Sinica, 24(3), 1097–1116. https://doi.org/10.5705/ss.2012.074
Wang, X., Wei, J., Schuurmans, D., Le, Q. V., Chi, E. H., Narang, S.,
Chowdhery, A., & Zhou, D. (2023). Self-consistency improves chain of
thought reasoning in language models. International Conference on
Learning Representations (ICLR).
Wedel, M., Kamakura, W. A., DeSarbo, W. S., & Ter Hofstede, F.
(1995). Implications for asymmetry, nonproportionality, and
heterogeneity in brand switching from piece-wise exponential mixture
hazard models. Journal of Marketing Research, 32(4),
457–462. https://doi.org/10.1177/002224379503200408
Wei, J., Wang, X., Schuurmans, D., Bosma, M., Ichter, B., Xia, F., Chi,
E. H., Le, Q. V., & Zhou, D. (2022). Chain-of-thought prompting
elicits reasoning in large language models. Advances in Neural
Information Processing Systems 35 (NeurIPS),
24824–24837.
Wei, Z., & Lin, M. (2017). Market mechanisms in online peer-to-peer
lending. Management Science, 63(12), 4236–4257. https://doi.org/10.1287/mnsc.2016.2531
Wen, R., Torkkola, K., Narayanaswamy, B., & Madeka, D. (2017). A
multi-horizon quantile recurrent forecaster. NeurIPS Time Series
Workshop. https://arxiv.org/abs/1711.11053
West, D. (2000). Neural network credit scoring models. Computers
& Operations Research, 27(11–12), 1131–1152. https://doi.org/10.1016/S0305-0548(99)00149-5
White, I. R., Royston, P., & Wood, A. M. (2011). Multiple imputation
using chained equations: Issues and guidance for practice.
Statistics in Medicine, 30(4), 377–399. https://doi.org/10.1002/sim.4067
Wiegreffe, S., & Pinter, Y. (2019). Attention is not not
explanation. Proceedings of EMNLP, 11–20. https://doi.org/10.18653/v1/D19-1002
Wilcoxon, F. (1945). Individual comparisons by ranking methods.
Biometrics Bulletin, 1(6), 80–83. https://doi.org/10.2307/3001968
Williams, C. K. I., & Seeger, M. (2000). Using the
Nyström method to speed up kernel machines.
Advances in Neural Information Processing Systems 13 (NIPS
2000).
Wilson, T. C. (1997a). Portfolio credit risk (i). Risk
Magazine, 10(9), 111–117.
Wilson, T. C. (1997b). Portfolio credit risk (II). Risk
Magazine, 10(10), 56–61.
Wolpert, D. H. (1992). Stacked generalization. Neural Networks,
5(2), 241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
Woo, G., Liu, C., Kumar, A., Xiong, C., Savarese, S., & Sahoo, D.
(2024). Unified training of universal time series forecasting
transformers. Proceedings of the 41st International Conference on
Machine Learning (ICML), PMLR 235. https://proceedings.mlr.press/v235/woo24a.html
World Bank. (2022a). The global findex database 2021. World
Bank Group. https://www.worldbank.org/en/publication/globalfindex/Data
World Bank. (2022b). The global findex database 2021: Financial
inclusion, digital payments, and resilience in the age of
COVID-19. Washington, DC. https://www.worldbank.org/en/publication/globalfindex
World Bank. (2022c). Vietnam: Financial sector assessment.
World Bank Group. https://www.worldbank.org/en/country/vietnam
World Bank. (2023). Vietnam: Digital economy policy
note. World Bank. https://www.worldbank.org/en/country/vietnam
Wu, H., Hu, T., Liu, Y., Zhou, H., Wang, J., & Long, M. (2023).
TimesNet: Temporal 2D-variation modeling for general time
series analysis. Proceedings of the International Conference on
Learning Representations (ICLR). https://openreview.net/forum?id=ju_Uqw384Oq
Wu, H., Xu, J., Wang, J., & Long, M. (2021).
Autoformer: Decomposition transformers with
auto-correlation for long-term series forecasting. Advances in
Neural Information Processing Systems 34 (NeurIPS). https://arxiv.org/abs/2106.13008
Wu, S., Irsoy, O., Lu, S., Dabravolski, V., Dredze, M., Gehrmann, S.,
Kambadur, P., Rosenberg, D., & Mann, G. (2023).
BloombergGPT: A large language model for finance.
arXiv:2303.17564.
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., & Yu, P. S. (2021).
A comprehensive survey on graph neural networks. IEEE Transactions
on Neural Networks and Learning Systems, 32(1), 4–24. https://doi.org/10.1109/TNNLS.2020.2978386
Wyner, A. J., Olson, M., Bleich, J., & Mease, D. (2017). Explaining
the success of AdaBoost and random forests as interpolating
classifiers. Journal of Machine Learning Research, 18,
1–33.
Xia, Y., Liu, C., Li, Y.-Y., & Liu, N. (2017). A boosted decision
tree approach using Bayesian hyper-parameter optimization
for credit scoring. Expert Systems with Applications,
78, 225–241. https://doi.org/10.1016/j.eswa.2017.02.017
Xu, K., Hu, W., Leskovec, J., & Jegelka, S. (2019). How powerful are
graph neural networks? International Conference on Learning
Representations (ICLR).
Xu, L., Skoularidou, M., Cuesta-Infante, A., & Veeramachaneni, K.
(2019). Modeling tabular data using conditional GAN.
Advances in Neural Information Processing Systems 32 (NeurIPS).
Yale Law Journal. (1979). Credit scoring and the ECOA:
Applying the effects test. The Yale Law Journal,
88(7), 1450–1486. https://doi.org/10.2307/795759
Yang, H., Liu, X.-Y., & Wang, C. D. (2023). FinGPT:
Open-source financial large language models. FinLLM Symposium at
IJCAI.
Yang, Q., Liu, Y., Chen, T., & Tong, Y. (2019). Federated machine
learning: Concept and applications. ACM Transactions on Intelligent
Systems and Technology, 10(2), 1–19. https://doi.org/10.1145/3298981
Yang, Y., & Land, K. C. (2008). Age-period-cohort analysis of
repeated cross-section surveys: Fixed or random effects?
Sociological Methods & Research, 36(3), 297–326.
https://doi.org/10.1177/0049124106292360
Yang, Y., Uy, M. C. S., & Huang, A. (2020). FinBERT: A
pretrained language model for financial communications. arXiv
Preprint.
Yeh, C.-K., Hsieh, C.-Y., Suggala, A. S., Inouye, D. I., &
Ravikumar, P. (2019). On the (in)fidelity and sensitivity of
explanations. Advances in Neural Information Processing Systems 32
(NeurIPS 2019).
Yeh, I.-C. (2016). Default of credit card clients. UCI Machine
Learning Repository. https://doi.org/10.24432/C55S3H
Yeh, I.-C., & Lien, C.-H. (2009). The comparisons of data mining
techniques for the predictive accuracy of probability of default of
credit card clients. Expert Systems with Applications,
36(2), 2473–2480. https://doi.org/10.1016/j.eswa.2007.12.020
Yin, W., Hay, J., & Roth, D. (2019). Benchmarking zero-shot text
classification: Datasets, evaluation and entailment approach.
Proceedings of the 2019 Conference on Empirical Methods in Natural
Language Processing (EMNLP), 3914–3923. https://doi.org/10.18653/v1/D19-1404
Ying, R., Bourgeois, D., You, J., Zitnik, M., & Leskovec, J. (2019).
GNNExplainer: Generating explanations for graph neural
networks. Advances in Neural Information Processing Systems 32
(NeurIPS).
Yoon, J., Jordon, J., & Schaar, M. van der. (2018).
GAIN: Missing data imputation using generative adversarial
nets. Proceedings of the 35th International Conference on Machine
Learning (ICML).
Young, H. P. (1985). Monotonic solutions of cooperative games.
International Journal of Game Theory, 14(2), 65–72. https://doi.org/10.1007/BF01769885
Yurdakul, B. (2018). Statistical properties of population stability
index [Master’s thesis]. Western Michigan University.
Zadrozny, B., & Elkan, C. (2002). Transforming classifier scores
into accurate multiclass probability estimates. 694–699. https://doi.org/10.1145/775047.775151
Zaharia, M., Chen, A., Davidson, A., Ghodsi, A., Hong, S. A., Konwinski,
A., Murching, S., Nykodym, T., Ogilvie, P., Parkhe, M., Xie, F., &
Zumar, C. (2018). Accelerating the machine learning lifecycle with
MLflow. IEEE Data Engineering Bulletin,
41, 39–45.
Zaharia, M., Das, T., Li, H., Hunter, T., Shenker, S., & Stoica, I.
(2013). Discretized streams: Fault-tolerant streaming computation at
scale. Proceedings of the 24th ACM Symposium on Operating Systems
Principles (SOSP), 423–438. https://doi.org/10.1145/2517349.2522737
Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A.,
Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., et al. (2016).
Apache Spark: A unified engine for big data processing.
Communications of the ACM, 59(11), 56–65. https://doi.org/10.1145/2934664
Zaharia, M., Xin, R. S., Wendell, P., Das, T., Armbrust, M., Dave, A.,
Meng, X., Rosen, J., Venkataraman, S., Franklin, M. J., Ghodsi, A.,
Gonzalez, J., Shenker, S., & Stoica, I. (2016). Apache
Spark: A unified engine for big data processing.
Communications of the ACM, 59(11), 56–65. https://doi.org/10.1145/2934664
Zeiler, M. D., & Fergus, R. (2014). Visualizing and understanding
convolutional networks. European Conference on Computer Vision
(ECCV), 818–833. https://doi.org/10.1007/978-3-319-10590-1\_53
Zemel, R., Wu, Y., Swersky, K., Pitassi, T., & Dwork, C. (2013).
Learning fair representations. Proceedings of the 30th International
Conference on Machine Learning (ICML 2013), 325–333.
Zeng, H., Zhou, H., Srivastava, A., Kannan, R., & Prasanna, V.
(2020). GraphSAINT: Graph sampling based inductive learning
method. International Conference on Learning Representations
(ICLR).
Zhang, B. H., Lemoine, B., & Mitchell, M. (2018). Mitigating
unwanted biases with adversarial learning. Proceedings of the 2018
AAAI/ACM Conference on AI, Ethics, and Society (AIES), 335–340. https://doi.org/10.1145/3278721.3278779
Zhang, T., & Yu, B. (2005). Boosting with early stopping:
Convergence and consistency. The Annals of Statistics,
33(4), 1538–1579. https://doi.org/10.1214/009053605000000255
Zheng, M., & Klein, J. P. (1995). Estimates of marginal survival for
dependent competing risks based on an assumed copula.
Biometrika, 82(1), 127–138. https://doi.org/10.1093/biomet/82.1.127
Zhou, H., Zhang, S., Peng, J., Zhang, S., Li, J., Xiong, H., &
Zhang, W. (2021). Informer: Beyond efficient transformer
for long sequence time-series forecasting. Proceedings of the AAAI
Conference on Artificial Intelligence, 35(12),
11106–11115. https://doi.org/10.1609/aaai.v35i12.17325
Zhu, X., & Goldberg, A. B. (2009). Introduction to semi-supervised
learning. Synthesis Lectures on Artificial Intelligence and Machine
Learning, 3(1), 1–130. https://doi.org/10.2200/S00196ED1V01Y200906AIM006
Zmijewski, M. E. (1984). Methodological issues related to the estimation
of financial distress prediction models. Journal of Accounting
Research, 22, 59–82. https://doi.org/10.2307/2490859
Zou, H., & Hastie, T. (2005). Regularization and variable selection
via the elastic net. Journal of the Royal Statistical Society.
Series B (Statistical Methodology), 67(2), 301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x