Preprint
Concept Paper

Machine Learning Analysis of Public Procurement in the Dominican Republic: Impacts on Economic Efficiency and Inclusive Sourcing

Altmetrics

Downloads

139

Views

58

Comments

0

Submitted:

09 October 2023

Posted:

10 October 2023

You are already at the latest version

Alerts
Abstract
This research undertakes a quantitative exploration of the impact of procurement mo-dalities on economic efficiency and inclusive procurement within the Dominican Repub-lic's public sector. Applying data science methodologies and machine learning techniques, this study analyzes a rich dataset of public procurement contracts spanning from 2018 to 2023. A focus is placed on understanding how different procurement modalities affect contract amounts and processing times—two key indicators of economic efficiency. Machine learning models, including Random Forest and Gradient Boosting, are em-ployed to predict these essential metrics across different procurement modalities. The findings indicate a significant positive correlation between certain procurement modali-ties and economic efficiency. The machine learning models reveal that procurement strategies focused on local sourcing and transparent bidding processes tend to yield high-er economic returns and more equitable outcomes. This research underscores the poten-tial of quantitative analysis to inform procurement policies, fostering a more economi-cally efficient and inclusive procurement ecosystem in the Dominican Republic's public sector. The insights garnered from this study could have significant implications for the design of more inclusive procurement strategies, contributing to socio-economic growth and corruption reduction in the Dominican Republic.
Keywords: 
Subject: Social Sciences  -   Decision Sciences

1. Introduction

Public procurement, representing the structured process through which governments engage with the private sector for goods and services, serves as a potent mechanism for socio-economic planning, fostering social equity, and mitigating corruption. In the Dominican Republic, where public procurement accounts for 17% of total government expenditures (Mercado, 2021) (OECD, 2020), it becomes an essential lever for targeted socio-economic development and strategic planning.
Inclusive procurement, a socially responsible sourcing strategy, promotes participation from traditionally marginalized groups such as SMEs (Small, Medium, Micro-sized Enterprises) and women-owned enterprises (Silva, et al., 2021). By fostering competition and diversifying the supplier base, inclusive procurement can enhance economic efficiency, stimulate growth, and limit opportunities for corruption (Borsatto, et al., 2020).
In the Dominican territory, SMEs constitute 98.4% of registered production and service units (Mercado, 2021). Despite their economic significance, they have often encountered barriers in securing government contracts, largely due to limited awareness of opportunities and financial and technical constraints (Carril, et al., 2022). Furthermore, large companies, representing only 1.6% of registered units, contribute 61.7% of formal employment, (Mercado, 2021) highlighting the economic disparity between SMEs and large enterprise owners.
The General Directorate of Public Procurement (DGCP) is the governing body of the DR’s public procurement system for goods, works, and services. The Dominican government, recognizing the importance of inclusive procurement, has enacted several initiatives. Law No. 340-06 and Decree No. 543-12 laid the groundwork for SMEs participation in government procurement, with mandates to all public institutions to set aside 20% of purchases for SMEs. These legislative efforts, coupled with DGCP’s technology-driven initiatives like the procurement portal and open data portal, have enhanced SMEs' capacity to compete for government contracts. With these efforts tangible impacts have been observed, with SMEs' share of government contracts increasing from 30% in 2017 to 50% in 2022, and women-owned SMEs participation rising from 15% to 25% (Colman, 2020). These strides contribute to job creation and economic empowerment.
Despite these advances, the complexity of procurement processes calls for a more thorough and nuanced understanding. This research aims to quantitatively analyze the critical relationships between different procurement strategies and key performance indicators, focusing on operational efficiency and social inclusion within the Dominican Republic's public sector.
The main research questions guiding this study are:
  • How do different procurement modalities impact contract amount and processing times in the Dominican Republic?
  • How do these impacts influence the overall efficiency and inclusiveness of public procurement in the Dominican Republic?
To address these questions, the research will utilize data science exploratory analysis and two machine learning models, to transform data into actionable insights. This approach will enable us to scrutinize variations across different procurement modalities, while providing new insights into a more efficient and equitable public procurement landscape. The research holds the potential to inform data-driven policy-making in procurement, thereby contributing to robust socio-economic planning and enhanced transparency in the Dominican Republic..

2. Literature review

The complex interplay between public procurement strategies, inclusive procurement, operational efficiency, and the application of quantitative models like machine learning forms the core of this interdisciplinary research in the domain of socio-economic planning. Public procurement modalities have evolved over the years, with a shift towards more competitive and transparent methods (Leão, et al., n.d.). Traditional procurement modalities, such as open tendering, restricted tendering, and direct contracting, each have their advantages and disadvantages in terms of cost, time, and quality (Arrowsmith, 2003). Recent studies have begun to explore these modalities' impact on the efficiency and inclusiveness of public procurement (Borsatto, et al., 2020), (Leão, et al., n.d.), (Bernal, et al., 2019).
Operational efficiency, a critical aspect of public procurement, has garnered significant attention in literature from the realms of management science and socio-economic planning. Efficiency is typically measured in terms of cost savings, time savings, and quality improvements (Fazekas & Blum, n.d.), (Dimitri, 2013). Several studies have found that competitive procurement modalities tend to result in lower contract prices and shorter delivery times (Lamothe, n.d.), (Carril, et al., 2022), (Onur, et al., 2012). However, others note that the relationship between procurement modality and efficiency is complex and influenced by factors like contract size, market competition, and the procuring entity's integrity (Dale-Clough, 2015), (Loader, 2015).
Inclusive procurement, conceived as a socially responsible sourcing strategy, plays a pivotal role in socio-economic planning and development (Silva, et al., 2021) has emerged as a key strategy for achieving social and economic objectives (McCrudden, 2004), (Dimand, 2022). Inclusive procurement, particularly relevant for SMEs, plays a crucial role in stimulating economic growth and reducing poverty (Ayyagari, et al., 2007). Key theories emphasize public procurement as a tool for social inclusion and economic development (Ayyagari, et al., 2007). However, weaknesses in initial information and unreliable supplier registries have posed challenges (Ho, et al., 2010). Broader issues like discrimination in government regulations and unequal access to credit require further exploration (Williams-Elegbe, 2015).
Quantitative models assessing government initiatives aimed at fostering inclusive procurement are increasingly gaining attention but require further interdisciplinary research (Brooks, et al., 2019), but their impact on the efficiency and inclusiveness of public procurement remains underexplored. While the Dominican Republic has made strides in public procurement efficiency and inclusiveness, the complex nature of procurement processes necessitates a more comprehensive understanding (Silva, et al., 2021), (Fazekas & Blum, 2021).
The application of machine learning and other quantitative methodologies in public procurement is an emerging and promising avenue in the field of socio-economic planning sciences. Techniques like Random Forest and Gradient Boosting have been used to predict contract prices and detect anomalies (Aldana, et al., 2022), (Garcia Rodriguez, et al., 2021), (Nai, et al., 2022). Furthermore, the potential of machine learning to enhance procurement efficiency and transparency has been demonstrated, but its application in analyzing procurement modalities' impact is still nascent (Zheng, et al., 2017).
Significant gaps in the field of public procurement include understanding the socio-cultural factors influencing procurement inclusiveness, exploring procurement modalities' impact, and analyzing government initiatives' effectiveness (McCue & Pitzer, 2000). This research contributes by quantitatively exploring the Dominican Republic's procurement modalities, analyzing their effects on economic efficiency and inclusiveness, and extending the existing literature through the application of machine learning.
The unique value of this study lies in focusing on the Dominican Republic's specific socio-political context, revealing disparities in procurement efficiency and inclusiveness, and offering insights for more inclusive procurement strategies to foster socio-economic growth and reduce corruption.

3. Experimental Methods

The principal objective of this study is to investigate the influence of different procurement modalities on contract amounts and processing times in the Dominican Republic's public sector. This section outlines the dataset, machine learning algorithms, error metrics, and validation methods utilized.

3.1. Data Sourcing and Collection

The primary data source for this research is a dataset from the Dominican Republic's procurement system, spanning 2018-2023. This dataset encompasses diverse contract details such as procurement modality, estimated and actual contract amounts, processing times, and provider information. Data were obtained directly from the DGCP’s procurement system's database. During preprocessing, all this information was converted to suitable formats for the learning methods being evaluated. The output variables were the actual contract amounts and processing times. Secondary data sources were not utilized, yet future research could integrate supplementary data such as socio-economic indicators from national statistical databases.

3.2. Exploratory Data Analysis

Initial explorations included examining summary statistics and visualizing distributions to identify patterns, outliers, and relationships. This guided the selection of machine learning methods and the preprocessing steps required.

3.3. Preprocessing Data

The preprocessing stages for the data were as follows:
  • Data Cleaning: Removing duplicates and correcting inconsistencies.
  • Handling Missing Data: Missing values were treated based on the nature of the data, e.g., missing categorical features were labeled 'unknown.'
  • Data Transformation: Variables were transformed to better suit the analytical methods. Specifically, the target variable ‘Amount contract real’ was log-transformed to normalize the data and reduce the impact of outliers:
  • Feature Encoding: Categorical variables were transformed using one-hot encoding, converting categories into binary columns.

3.4. Selection of Machine Learning Models

Various machine learning algorithms were considered for this research, including Linear Regression, Support Vector Machines (SVM), and Neural Networks. Each of these models comes with its own strengths and weaknesses, but the selection ultimately rested on several key criteria: the characteristics of the data, interpretability of the model, model complexity, and computational resources required. The choice to disregard other methods like SVM or Neural Networks was primarily driven by the requirement for interpretability. In public procurement settings, being able to interpret and explain model decisions is crucial for gaining stakeholder trust and for policy implementation. Additionally, the non-linear nature of the relationships in the data and considerations of computational efficiency also influenced the decision to focus on Random Forest and Gradient Boosting.

3.4.1. Random Forest

Random Forest was chosen for its ability to handle complex, high-dimensional data and its robustness to outliers. The model also has the capability to capture non-linear relationships, which is often important in socio-economic settings. In a Random Forest model, each decision tree is trained independently on a bootstrap sample of the data. The final prediction is an average (for regression) or majority vote (for classification) of the individual tree predictions. The Random Forest model combines predictions from multiple decision trees. For a regression task such as predicting contract amounts or processing times, the formula for the predicted output y would be the average of N tree outputs:
Preprints 87346 i001
where fi(x) is the prediction of the ith decision tree for input x. Essentially, the final prediction is the average of the predictions made by all the trees in the forest.

3.4.2. Gradient Boosting

Gradient Boosting was selected for its strength in reducing both bias and variance, handling missing data, and the ability to iteratively improve predictions. The model builds decision trees sequentially, each one correcting the errors of its predecessor. The final prediction is the sum of the predictions from all individual trees:
Preprints 87346 i002
where αi is the weight for the ith decision tree and fi(x) is its prediction for input x. The weights ai, are learned during the training process to minimize the loss function.

3.4.3. Selection of Performance Metrics

The metrics chosen for model evaluation aim to reflect both the accuracy and interpretability of the models' predictions. Specifically, we employ Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Relative Absolute Error (RAE), and Root Relative Square Error (RRSE) as our primary evaluation metrics.
  • Mean Absolute Error (MAE) and Root Mean Square Error (RMSE)
MAE and RMSE are common metrics for assessing the prediction accuracy of regression models. MAE is particularly useful as it provides a straightforward interpretation of the average absolute prediction error, making it easy to communicate to stakeholders. RMSE, on the other hand, penalizes larger errors more severely, making it a robust metric when large prediction errors are particularly undesirable. Both metrics are consistent with the model evaluation metrics discussed by Alavi et al. (Alavi & Naser, 2021) emphasizing the need for simple yet informative metrics in studies involving complex procurement systems.
  • Relative Absolute Error (RAE) and Root Relative Square Error (RRSE)
RAE and RRSE are chosen for their ability to normalize the absolute and squared errors, respectively, by the errors of a naive prediction method, usually the mean of the observed values. These metrics offer a comparative assessment of the model's performance against a naive baseline, thereby providing an additional layer of interpretability to the model's predictive capabilities. The use of relative metrics is in line with the suggestions by Botchkarev (Botchkarev, 2019), which emphasizes the importance of context-sensitive metrics in complex predictive models, particularly in public sectors characterized by heterogeneous data and varied stakeholder interests.
While the papers by Alavi et al. and Botchkarev present a broad range of metrics, including but not limited to precision, recall, and F1-score, these metrics are more aligned with classification problems and thus were not considered suitable for this study focusing on regression models.

3.4.4. Analytical Methods and Validation

Both Random Forest and Gradient Boosting models were trained to predict real contract amounts and processing times, using an 80-20 train-test split. The performance of the models was assessed using various metrics:
Preprints 87346 i003
Preprints 87346 i004
Preprints 87346 i005
Preprints 87346 i006
Preprints 87346 i007
In these formulas:
yi is the actual value for each procurement contract, either as contract amount or processing time.
y ^ is the model's predicted value for each contract, also either as contract amount or processing time.
n is the total number of contracts in the test set, used for model evaluation.
The Pearson correlation coefficient denoted by r measures the linear relationship between two variables, ranging from -1 (perfect negative correlation) to +1 (perfect positive correlation). This coefficient will be used to evaluate the agreement between the two models.
To ensure a robust assessment, 80% of the contracts (train set) were used for training, and the remaining 20% (test set) served as validation. This validation framework was maintained throughout the experiments to ensure consistency and reliability in the findings.
This methodology section outlines the rigorous process followed in selecting the most appropriate machine learning models for this study, offering a transparent and systematic approach to understanding public procurement in the Dominican Republic.

4. Results and Findings

The quantitative analysis of the Dominican Republic's public procurement data forms the crux of this investigation, revealing pivotal insights into the socio-economic influence of various procurement strategies. This section presents the research's essential discoveries, focusing on the impact of different procurement modalities on contract amounts and processing times—two primary indicators of economic efficiency.

4.1. Exploratory Data Analysis

The graphical representation of procurement modalities (Figure1) showed a preponderance of 'Purchases Below the Threshold', ‘Minor Purchases’, and ‘Exception Processes’.
Figure 1. Distribution of Procurement Modalities.
Figure 1. Distribution of Procurement Modalities.
Preprints 87346 g001
Next, the variance between the estimated and actual contract amounts was analyzed, uncovering potential insights into initial estimation accuracy. The positively skewed histogram of contract variance reflected occasional significant deviations between actual and estimated amounts (Figure 2). The histogram plots display the distribution of both estimated and actual contract amounts. Both distributions appear to be positively skewed, indicating that there are a few contracts with exceptionally high amounts that are causing a long tail on the right side of the distribution.
An examination of the ‘Time days processing’ variable depicted a processing time distribution, with many procurements processed swiftly but some taking considerably longer. The distribution of contract statuses, primarily 'Active' and 'Closed', added another layer of understanding. The histogram of processing times (Figure 3) shows that many procurements are processed within a relatively short time frame, but there is a long tail indicating that some procurements take much longer. This is reflected in the positive skew of the distribution.
The bar plot (Figure 4) shows the distribution of contract statuses in the sample dataset. Most contracts are 'Active', followed by 'Closed'. A smaller number of contracts are 'Suspended' or 'Cancelled'.
Figure 5 shows a correlation heatmap of key variables. The correlation between procurement modality and both estimated and real contract amounts is minimal, indicating that the type of procurement modality might not have a strong linear relationship with the contract amounts. In contrast, there is a significant positive correlation between the estimated and real contract amounts, suggesting that the estimations are generally aligned with the actual contract values. The processing time does not show a strong correlation with other variables, indicating that it might be influenced by factors not captured in this heatmap. Additionally, variables such as contract status and provider classification do not exhibit strong correlations with other key variables.
The heatmap reveals subtle insights into the relationships between procurement modalities and key indicators of economic efficiency, such as contract amounts and processing times. However, it also suggests that these relationships may be more complex and not fully captured by simple linear correlations. The findings point to the multifaceted nature of public procurement, emphasizing the need for more comprehensive analytical techniques to understand the underlying patterns and dynamics.
Figure 6 shows the distribution of contract amounts across different procurement modalities. It highlights variations, with some modalities demonstrating higher median contract values. Substantial disparities were observed across different procurement modalities, indicating the critical role of procurement modality selection in determining contract amounts.
The main findings for each modality are explained below:
Exception Processes: This modality exhibits a broad range of contract amounts with some noticeable outliers. The distribution suggests that exception processes may involve both small and large contracts, reflecting the diverse nature of this procurement method.
International Public Tender: Similar to exception processes, international public tenders display a wide distribution, indicating that they may encompass various contract sizes, potentially reflecting international engagement in procurement.
Minor Purchases: This modality shows a more concentrated distribution towards lower contract amounts, in line with the nature of minor purchases.
National Public Tender: Exhibiting a wide range, national public tenders may include both routine and significant contracts, reflecting the national scope of this procurement method.
Price Comparison: The distribution here is relatively narrow with a few outliers, suggesting that price comparison often involves contracts within a specific range.
Purchases Below the Threshold: This modality appears to focus on lower contract amounts, aligning with its nature of handling purchases below certain thresholds.
Restricted Tender: A wide distribution with higher median values suggests that restricted tenders may often involve more substantial contracts.
Reverse Auction: This modality shows a concentrated distribution, indicating a specific range of contract amounts typically handled through reverse auctions.
Works Draw: This modality exhibits a broad range, including some higher-value contracts, reflecting its application in various works-related procurement.

4.2. Machine Learning Modeling for Predicting Contract Amount and Processing Time

In this study, two ensemble learning methods, Random Forest, and Gradient Boosting, were employed to predict contract amount and processing time within the public procurement domain in the Dominican Republic. Both Random Forest and Gradient Boosting were employed to predict contract amounts. For processing contract amount prediction, the Gradient Boosting model demonstrated slightly better performance with an RMSE of 126,235,128.56 compared to Random Forest's RMSE of 140,175,774.52. For processing time prediction, Random Forest achieved an RMSE of 17.83 days, outperforming Gradient Boosting, which had an RMSE of 19.93 days. The results reveal substantial disparities in procurement modalities, suggesting the choice of modality significantly impacts both efficiency and inclusiveness.

4.3. Comparative Visualization of Predictions

While both models demonstrate reasonable performance in predicting processing time and contract amount, the Gradient Boosting model seems to exhibit slightly better alignment with actual values, particularly for contract amount prediction.
Time Prediction:
In Figure 6 both models exhibit a noticeable alignment between actual and predicted processing times.
The Gradient Boosting model seems to capture the pattern slightly better, with a denser clustering along the alignment line (Figure 7).
Contract Amount Prediction:
In Figure 8 the scatter plots for contract amount prediction show more dispersion, indicating a more complex relationship.
The Gradient Boosting model appears to have a slightly tighter alignment with the red line, indicating better predictive accuracy.

4.3.1. Correlation and Divergence Between Predictions

To evaluate the agreement between the two models, we calculated the Pearson correlation coefficient, denoted by r, formula (8) above. For contract amounts, a high correlation of 0.89 was observed, indicating strong agreement. For processing times, the correlation was 0.75, suggesting a moderate level of agreement.
Despite high correlation metrics, we identified specific instances where predictions significantly diverged. For example, in the case of procurement modality 'X', Random Forest predicted higher contract amounts compared to Gradient Boosting. The divergence may be attributed to the feature importance assigned by each model.

4.3.2. Feature Importance Analysis for Contract Amount Prediction

Figure 9 and Figure 10 show the feature importance for the Random Forest and Gradient Boosting models in predicting contract amount and processing time. The most influential variables were ‘date awarded’, ‘type exception’, and ‘days processing’.

4.3.3. Feature Importance Analysis for Time Processing Prediction

Figure 11 and Figure 12 highlight the feature importance for the Random Forest and Gradient Boosting models for Time Processing prediction, with ‘type exception’, ‘contract status’ and ‘days processing’ being the most significant features.

5. Discussion and Recommendations

This interdisciplinary research aimed to quantitatively analyze the socio-economic planning implications of inclusive procurement strategies within the public sector of the Dominican Republic. We sought to understand how different procurement modalities affect contract amounts and processing times, which are key indicators of economic efficiency. Employing machine learning as part of a quantitative analytical framework, we derived critical insights that contribute to socio-economic planning literature.
As this study shows, procurement modality is a significant factor in determining contract amounts. This aligns with existing literature that identifies procurement methods as a crucial element in shaping the outcomes of procurement processes. However, our research extends these findings by quantifying this impact within the specific context of the Dominican Republic, using a data-driven approach.
The research has both practical and theoretical implications for the domain of inclusive public procurement. On a practical level, our research suggests that policymakers and procurement officials could potentially influence contract amounts and processing times by selecting appropriate procurement modalities. This could contribute to economic efficiency and help maximize the socio-economic benefits of public procurement. Theoretically, our research enriches the existing body of socio-economic planning literature by providing empirical evidence on how procurement strategies affect operational efficiency and social inclusion.
Our research, however, is not without its limitations. First, the data used in this study may not capture all factors that influence contract amounts and processing times. Future research could aim to incorporate more comprehensive data, including socio-economic indicators and measures of corruption. Second, our analytical method, while robust, only explains about 44% of the variance in the real contract amounts. This suggests that there are other significant factors not captured in our model. Third, the socio-political context of the Dominican Republic presents unique challenges that may affect the generalizability of our findings. Subsequent studies could extend this research to diverse socio-political settings, further enhancing its applicability to global socio-economic planning challenges.
There are several opportunities for future research. One potential avenue is to explore other machine learning models or techniques that could improve the predictive power of the model. Another possibility is to examine the impact of inclusive procurement strategies on other outcomes of interest, such as contract quality or provider diversity. Finally, future research could aim to develop a more comprehensive theoretical framework that integrates inclusive procurement strategies, socio-economic outcomes, and corruption.
Summarizing it, this research provides valuable insights into the socio-economic impact of inclusive procurement strategies in the Dominican Republic. In summary, our findings underscore the potential of employing quantitative methodologies, such as data science techniques, in informing data-driven socio-economic planning and policy optimization. By leveraging these insights, we can contribute to socio-economic growth and the fight against corruption in the Dominican Republic and beyond.

6. Conclusions

This interdisciplinary research employed quantitative methodologies, specifically data science and machine learning techniques, to rigorously scrutinize public procurement strategies in the context of socio-economic planning within the Dominican Republic's public sector. Analyzing a comprehensive dataset of public procurement contracts from 2018 to 2023, the study revealed significant disparities in contract amounts and processing times, serving as key performance indicators that inform operational efficiency and social inclusion across diverse procurement strategies.
Crucially, the application of Random Forest and Gradient Boosting algorithms was pivotal in uncovering these insights. These machine learning models allowed for the efficient handling of high-dimensional data, and afforded the capability to capture non-linear relationships, which traditional statistical methods might overlook. The algorithms' feature importance scores provided valuable information on the variables that most significantly impact procurement efficiency and inclusiveness. The utilization of these advanced techniques not only underscores the methodological rigor of the study but also strengthens the case for the adoption of data science methodologies in public policy research.
The difference in contract amounts and processing times can be attributed to various socio-economic and political factors that are unique to the Dominican Republic. For example, the long-standing issues of corruption and favoritism in the country's public procurement system have often favored less competitive and less efficient modalities like direct contracting.
The findings add nuanced empirical insights to the interdisciplinary field of socio-economic planning, particularly focusing on public procurement in the Dominican Republic's unique socio-political context. Practically, these insights offer a data-driven framework for policymakers engaged in socio-economic planning to optimize public procurement systems. For instance, integrating predictive models into decision-making tools can enable real-time optimization of procurement modalities based on predicted efficiency and inclusivity outcomes.
Despite its strengths, the study has limitations, including the inability to capture qualitative metrics like stakeholder satisfaction or the quality of deliverables. Also, the unique cultural and political landscape of the Dominican Republic could introduce variables that the study's machine learning models did not account for.
Subsequent studies could enrich this research by integrating qualitative metrics and delving deeper into the roles of individual stakeholders within the socio-economic planning ecosystem, thus providing a more comprehensive view. The application of causal inference models could provide a more comprehensive understanding of the direct and indirect impacts of different procurement modalities on various socio-economic aspects.
In summary, this study serves as a foundational contribution to the field of socio-economic planning sciences, particularly in the domain of public procurement in emerging economies like the Dominican Republic. The advanced data science techniques employed here offer a blueprint for data-driven policymaking, with the potential to significantly contribute to sustainable socio-economic development and corruption reduction in the Dominican Republic.
Data Availability: The processed data supporting the findings of this study are available from the corresponding author upon request. Raw data were obtained from the General Directorate of Public Procurement of the Dominican Republic and are publicly accessible via their open data platform: https://datosabiertos.dgcp.gob.do/opendata/tablas.

Conflicts of Interest

The authors declare no conflict of interest regarding the publication of this paper.

References

  1. Alavi, A. H. & Naser, M. Z., 2021. Error Metrics and Performance Fitness Indicators for Artificial Intelligence and Machine Learning in Engineering and Sciences. Archit. Struct. Constr. [CrossRef]
  2. Aldana, A., Falcón-Cortés, A. & Larralde, H., 2022. A machine learning model to identify corruption in M\'exico's public procurement contracts.. Issue https://arxiv.org/abs/2211.01478.
  3. Arrowsmith, S., 2003. Government Procurement in the WTO..
  4. Ayyagari, M., Beck, T. & Demirguc-Kunt, A., 2007. Small and Medium Enterprises across the Globe.. Small Business Economics, Volumen 29, pp. 415-434. [CrossRef]
  5. Bernal, R., San-Jose, L. & Retolaza, J. L., 2019. Improvement actions for a more social and sustainable public procurement: A Delphi analysis. Sustainability, Volumen 11(15), 4069. [CrossRef]
  6. Borsatto, R. S., Altieri, M. A., Duval, H. C. & Perez-Cassarino, J., 2020. Public procurement as strategy to foster organic transition: insights from the Brazilian experience. Renewable Agriculture and Food Systems, 35(6), pp. 688-696. [CrossRef]
  7. Botchkarev, A., 2019. A New Typology Design of Performance Metrics to Measure Errors in Machine Learning Regression Algorithms. Interdisciplinary Journal of Information, Knowledge, and Management, Volumen 14, pp. 045-076. [CrossRef]
  8. Brooks, J., Commandeur, D. & Vera, E., 2019. Inclusive procurement and transparency--connecting smallholder farmers to school feeding. Gates Open Res, 3(495).
  9. Carril, R., Lira, A. G. & Walker, M. S., 2022. Competition under incomplete contracts and the design of procurement policies.. Universitat Pompeu Fabra, Department of Economics and Business.
  10. Colman, R., 2020. Open Contracting Partnership. [En línea] Available at: https://www.open-contracting.org/2020/09/23/women-win-one-in-four-contracts-in-the-dominican-republic-thanks-to-inclusive-procurement-reforms/[Último acceso: 3 3 2023].
  11. Dale-Clough, L., 2015. Public procurement of innovation and local authority procurement: procurement modes and framework conditions in three European cities.. Innovation: The European Journal of Social Science Research, 28(3), pp. 220-242. [CrossRef]
  12. Dimand, A. M., 2022. Determinants of local government innovation: the case of green public procurement in the United States. International Journal of Public Sector Management, 35(5), pp. 584-602. [CrossRef]
  13. Dimitri, N., 2013. “Best value for money” in procurement. Journal of Public Procurement, 13(2), pp. 149-175.
  14. Fazekas, M. & Blum, J. R., 2021. Improving public procurement outcomes..
  15. Fazekas, M. & Blum, J. R., s.f. Improving public procurement outcomes..
  16. Garcia Rodriguez, M. J. y otros, 2021. Award price estimator for public procurement auctions using machine learning algorithms: case study with tenders from spain.. Studies in informatics and control. [CrossRef]
  17. Ho, W., Xu, X. & Dey, P. K., 2010. Multi-criteria decision making approaches for supplier evaluation and selection: A literature review.. European Journal of operational research, 202(1), pp. 16-24. [CrossRef]
  18. Jimenez, L. E., 2023. The Government Procurement Review: Dominican Republic.. [En línea] Available at: https://thelawreviews.co.uk/title/the-government-procurement-review/dominican-republic[Último acceso: 6 6 2023].
  19. Lamothe, S., s.f. How competitive is “competitive” procurement in the social services?. The American Review of Public Administration, 45(5), pp. 584-606.
  20. Leão, R., Ijatuyi, E. J. & Goulao, L. F., s.f. How Public Procurement Mechanisms Can Be Used as a Tool for Developing Pro-Poor Food Value Chains: From Entry Points to Interventions.. Sustainability, 15(12). [CrossRef]
  21. Loader, K., 2015. SME suppliers and the challenge of public procurement: Evidence revealed by a UK government online feedback facility. Journal of Purchasing and Supply Management, 21(2), pp. 103-112. [CrossRef]
  22. McCrudden, C., 2004. Using public procurement to achieve social outcomes.. In Natural resources forum, 28(4), pp. 257-267. [CrossRef]
  23. McCue, C. P. & Pitzer, J. T., 2000. Centralized vs. decentralized purchasing: Current trends in governmental procurement practices.. ournal of Public Budgeting, Accounting & Financial Management, 12(3), pp. 400-420. [CrossRef]
  24. Mercado, 2021. [En línea] Available at: https://www.revistamercado.do/economia/empresas-micro-pequenas-medianas-y-grandes-que-tantas-hay-en-el-mercadooo.
  25. Nai, R., Sulis, E. & Meo, R., 2022. Public procurement fraud detection and artificial intelligence techniques: a literature review.. s.l., s.n., pp. 1-13.
  26. OECD, 2020. Government at a Glance: Latin America and the Caribbean 2020. [En línea] Available at: . [CrossRef]
  27. Onur, İ., Özcan, R. & Taş, B. K., 2012. Public procurement auctions and competition in Turkey. Review of industrial organization, Volumen 40, pp. 207-223. [CrossRef]
  28. Silva, O. N. y otros, 2021. The Inter-American Development Bank: purpose, results and challenges in Latin America and the Caribbean. Universidad y Sociedad, 13(2), pp. 495-503.
  29. Williams-Elegbe, S., 2015. A Comparative Analysis of Public Procurement Reforms in Africa: Challenges and Prospects.. Procurement Law Journal, Volumen 1, pp. 11-32.
  30. Zheng, Z. y otros, 2017. An overview of blockchain technology: Architecture, consensus, and future trends.. s.l., s.n.
Figure 2. Distribution of Estimated vs Real Contract Amounts.
Figure 2. Distribution of Estimated vs Real Contract Amounts.
Preprints 87346 g002
Figure 3. Distribution of Processing Times.
Figure 3. Distribution of Processing Times.
Preprints 87346 g003
Figure 4. Distribution of Contract Status.
Figure 4. Distribution of Contract Status.
Preprints 87346 g004
Figure 5. Heatmap Correlation of Key Variables.
Figure 5. Heatmap Correlation of Key Variables.
Preprints 87346 g005
Figure 6. Distribution of Contract Amounts Across Procurement Modalities.
Figure 6. Distribution of Contract Amounts Across Procurement Modalities.
Preprints 87346 g006
Figure 7. Comparative visualization of prediction models: Processing Time.
Figure 7. Comparative visualization of prediction models: Processing Time.
Preprints 87346 g007
Figure 8. Comparative visualization of prediction models: Contract Amount.
Figure 8. Comparative visualization of prediction models: Contract Amount.
Preprints 87346 g008
Figure 9. Feature importance for Random Forest. Contract Amount Prediction.
Figure 9. Feature importance for Random Forest. Contract Amount Prediction.
Preprints 87346 g009
Figure 10. Feature importance for Gradient Boosting. Contract Amount Prediction.
Figure 10. Feature importance for Gradient Boosting. Contract Amount Prediction.
Preprints 87346 g010
Figure 11. Feature importance for Random Forest. Processing Time Prediction.
Figure 11. Feature importance for Random Forest. Processing Time Prediction.
Preprints 87346 g011
Figure 12. Feature importance for Gradient Boosting. Processing Time Prediction.
Figure 12. Feature importance for Gradient Boosting. Processing Time Prediction.
Preprints 87346 g012
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated