During this time, Apple was struggling but ultimately did not default. Our ROC and PR curves will be something like this: Code for predictions and model evaluation on the test set is: The final piece of our puzzle is creating a simple, easy-to-use, and implement credit risk scorecard that can be used by any layperson to calculate an individuals credit score given certain required information about him and his credit history. Therefore, the investor can figure out the markets expectation on Greek government bonds defaulting. I need to get the answer in python code. The Probability of Default (PD) is one of the important quantities to quantify credit risk. Of course, you can modify it to include more lists. The coefficients estimated are actually the logarithmic odds ratios and cannot be interpreted directly as probabilities. By clicking Accept all cookies, you agree Stack Exchange can store cookies on your device and disclose information in accordance with our Cookie Policy. In this post, I intruduce the calculation measures of default banking. ], dtype=float32) User friendly (label encoder) For example "two elements from list b" are you wanting the calculation (5/15)*(4/14)? The results are quite interesting given their ability to incorporate public market opinions into a default forecast. It must be done using: Random Forest, Logistic Regression. The calibration module allows you to better calibrate the probabilities of a given model, or to add support for probability prediction. Evaluating the PD of a firm is the initial step while surveying the credit exposure and potential misfortunes faced by a firm. Duress at instant speed in response to Counterspell. Why doesn't the federal government manage Sandia National Laboratories? Dealing with hard questions during a software developer interview. history 4 of 4. Refer to the data dictionary for further details on each column. A Medium publication sharing concepts, ideas and codes. To subscribe to this RSS feed, copy and paste this URL into your RSS reader. Structural models look at a borrowers ability to pay based on market data such as equity prices, market and book values of asset and liabilities, as well as the volatility of these variables, and hence are used predominantly to predict the probability of default of companies and countries, most applicable within the areas of commercial and industrial banking. The p-values, in ascending order, from our Chi-squared test on the categorical features are as below: For the sake of simplicity, we will only retain the top four features and drop the rest. The data show whether each loan had defaulted or not (0 for no default, and 1 for default), as well as the specifics of each loan applicants age, education level (15 indicating university degree, high school, illiterate, basic, and professional course), years with current employer, and so forth. VALOORES BI & AI is an open Analytics platform that spans all aspects of the Analytics life cycle, from Data to Discovery to Deployment. This so exciting. The investor expects the loss given default to be 90% (i.e., in case the Greek government defaults on payments, the investor will lose 90% of his assets). Now we have a perfect balanced data! Since many financial institutions divide their portfolios in buckets in which clients have identical PDs, can we optimize the calculation for this situation? A quick look at its unique values and their proportion thereof confirms the same. Multicollinearity can be detected with the help of the variance inflation factor (VIF), quantifying how much the variance is inflated. This post walks through the model and an implementation in Python that makes use of Numpy and Scipy. The loan approving authorities need a definite scorecard to justify the basis for this classification. accuracy, recall, f1-score ). Multicollinearity is mainly caused by the inclusion of a variable which is computed from other variables in the data set. Please note that you can speed this up by replacing the. Note a couple of points regarding the way we create dummy variables: Next up, we will update the test dataset by passing it through all the functions defined so far. The probability of default (PD) is a credit risk which gives a gauge of the probability of a borrower's will and identity unfitness to meet its obligation commitments (Bandyopadhyay 2006 ). Suspicious referee report, are "suggested citations" from a paper mill? Financial institutions use Probability of Default (PD) models for purposes such as client acceptance, provisioning and regulatory capital calculation as required by the Basel accords and the European Capital requirements regulation and directive (CRR/CRD IV). (2002). Risky portfolios usually translate into high interest rates that are shown in Fig.1. This ideal threshold is calculated using the Youdens J statistic that is a simple difference between TPR and FPR. WoE binning takes care of that as WoE is based on this very concept, Monotonicity. Here is how you would do Monte Carlo sampling for your first task (containing exactly two elements from B). It is the queen of supervised machine learning that will rein in the current era. The resulting model will help the bank or credit issuer compute the expected probability of default of an individual credit holder having specific characteristics. Therefore, the markets expectation of an assets probability of default can be obtained by analyzing the market for credit default swaps of the asset. See the credit rating process . An accurate prediction of default risk in lending has been a crucial subject for banks and other lenders, but the availability of open source data and large datasets, together with advances in. But remember that we used the class_weight parameter when fitting the logistic regression model that would have penalized false negatives more than false positives. Django datetime issues (default=datetime.now()), Return a default value if a dictionary key is not available. Find centralized, trusted content and collaborate around the technologies you use most. We will perform Repeated Stratified k Fold testing on the training test to preliminary evaluate our model while the test set will remain untouched till final model evaluation. The shortlisted features that we are left with until this point will be treated in one of the following ways: Note that for certain numerical features with outliers, we will calculate and plot WoE after excluding them that will be assigned to a separate category of their own. Logistic Regression in Python; Predict the Probability of Default of an Individual | by Roi Polanitzer | Medium Write Sign up Sign In 500 Apologies, but something went wrong on our end.. All observations with a predicted probability higher than this should be classified as in Default and vice versa. This Notebook has been released under the Apache 2.0 open source license. The coefficients returned by the logistic regression model for each feature category are then scaled to our range of credit scores through simple arithmetic. So, our model managed to identify 83% bad loan applicants out of all the bad loan applicants existing in the test set. Your home for data science. beta = 1.0 means recall and precision are equally important. field options . It has many characteristics of learning, and my task is to predict loan defaults based on borrower-level features using multiple logistic regression model in Python. The data set cr_loan_prep along with X_train, X_test, y_train, and y_test have already been loaded in the workspace. Story Identification: Nanomachines Building Cities. Once that is done we have almost everything we need to calculate the probability of default. Refer to my previous article for further details on these feature selection techniques and why different techniques are applied to categorical and numerical variables. It includes 41,188 records and 10 fields. a. For the used dataset, we find a high default rate of 20.3%, compared to an ordinary portfolio in normal circumstance (510%). Google LinkedIn Facebook. We will automate these calculations across all feature categories using matrix dot multiplication. The probability distribution that defines multi-class probabilities is called a multinomial probability distribution. In this case, the probability of default is 8%/10% = 0.8 or 80%. Monotone optimal binning algorithm for credit risk modeling. Default prediction like this would make any . probability of default for every grade. Asking for help, clarification, or responding to other answers. Consider a categorical feature called grade with the following unique values in the pre-split data: A, B, C, and D. Suppose that the proportion of D is very low, and due to the random nature of train/test split, none of the observations with D in the grade category is selected in the test set. Could I see the paper? Home Credit Default Risk. A kth predictor VIF of 1 indicates that there is no correlation between this variable and the remaining predictor variables. By clicking Post Your Answer, you agree to our terms of service, privacy policy and cookie policy. Credit Risk Models for. The dataset can be downloaded from here. Does Python have a string 'contains' substring method? In Python, we have: The full implementation is available here under the function solve_for_asset_value. As a starting point, we will use the same range of scores used by FICO: from 300 to 850. You want to train a LogisticRegression() model on the data, and examine how it predicts the probability of default. Understandably, debt_to_income_ratio (debt to income ratio) is higher for the loan applicants who defaulted on their loans. Consider each variables independent contribution to the outcome, Detect linear and non-linear relationships, Rank variables in terms of its univariate predictive strength, Visualize the correlations between the variables and the binary outcome, Seamlessly compare the strength of continuous and categorical variables without creating dummy variables, Seamlessly handle missing values without imputation. Feed forward neural network algorithm is applied to a small dataset of residential mortgages applications of a bank to predict the credit default. model python model django.db.models.Model . In [1]: We will determine credit scores using a highly interpretable, easy to understand and implement scorecard that makes calculating the credit score a breeze. At a high level, SMOTE: We are going to implement SMOTE in Python. They can be viewed as income-generating pseudo-insurance. At first, this ideal threshold appears to be counterintuitive compared to a more intuitive probability threshold of 0.5. 1)Scorecards 2)Probability of Default 3) Loss Given Default 4) Exposure at Default Using Python, SK learn , Spark, AWS, Databricks. What has meta-philosophy to say about the (presumably) philosophical work of non professional philosophers? To calculate the probability of an event occurring, we count how many times are event of interest can occur (say flipping heads) and dividing it by the sample space. It classifies a data point by modeling its . Can the Spiritual Weapon spell be used as cover? Next up, we will perform feature selection to identify the most suitable features for our binary classification problem using the Chi-squared test for categorical features and ANOVA F-statistic for numerical features. Scoring models that usually utilize the rankings of an established rating agency to generate a credit score for low-default asset classes, such as high-revenue corporations. The idea is to model these empirical data to see which variables affect the default behavior of individuals, using Maximum Likelihood Estimation (MLE). Launching the CI/CD and R Collectives and community editing features for "Least Astonishment" and the Mutable Default Argument. Default probability is the probability of default during any given coupon period. As a first step, the null values of numerical and categorical variables were replaced respectively by the median and the mode of their available values. The ideal probability threshold in our case comes out to be 0.187. Continue exploring. Chief Data Scientist at Prediction Consultants Advanced Analysis and Model Development. Probability of default measures the degree of likelihood that the borrower of a loan or debt (the obligor) will be unable to make the necessary scheduled repayments on the debt, thereby defaulting on the debt. Like other sci-kit learns ML models, this class can be fit on a dataset to transform it as per our requirements. My code and questions: I try to create in my scored df 4 columns where will be probability for each class. The result is telling us that we have 7860+6762 correct predictions and 1350+169 incorrect predictions. What does a search warrant actually look like? The dotted line represents the ROC curve of a purely random classifier; a good classifier stays as far away from that line as possible (toward the top-left corner). rejecting a loan. More specifically, I want to be able to tell the program to calculate a probability for choosing a certain number of elements from any combination of lists. As always, feel free to reach out to me if you would like to discuss anything related to data analytics, machine learning, financial analysis, or financial analytics. However, in a credit scoring problem, any increase in the performance would avoid huge loss to investors especially in an 11 billion $ portfolio, where a 0.1% decrease would generate a loss of millions of dollars. As we all know, when the task consists of predicting a probability or a binary classification problem, the most common used model in the credit scoring industry is the Logistic Regression. Credit risk analytics: Measurement techniques, applications, and examples in SAS. The dataset provides Israeli loan applicants information. The approximate probability is then counter / N. This is just probability theory. So, this is how we can build a machine learning model for probability of default and be able to predict the probability of default for new loan applicant. It might not be the most elegant solution, but at least it gives a simple solution that can be easily read and expanded. About. So, 98% of the bad loan applicants which our model managed to identify were actually bad loan applicants. In contrast, empirical models or credit scoring models are used to quantitatively determine the probability that a loan or loan holder will default, where the loan holder is an individual, by looking at historical portfolios of loans held, where individual characteristics are assessed (e.g., age, educational level, debt to income ratio, and other variables), making this second approach more applicable to the retail banking sector. Is Koestler's The Sleepwalkers still well regarded? At first glance, many would consider it as insignificant difference between the two models; this would make sense if it was an apple/orange classification problem. There are specific custom Python packages and functions available on GitHub and elsewhere to perform this exercise. (2013) , which is an adaptation of the Altman (1968) model. The ideal candidate will have experience in advanced statistical modeling, ideally with a variety of credit portfolios, and will be responsible for both the development and operation of credit risk models including Probability of Default (PD), Loss Given Default (LGD), Exposure at Default (EAD) and Expected Credit Loss (ECL). How can I access environment variables in Python? How does a fan in a turbofan engine suck air in? Create a model to estimate the probability of use the credit card, using max 50 variables. age, number of previous loans, etc. Integral with cosine in the denominator and undefined boundaries, Partner is not responding when their writing is needed in European project application. It all comes down to this: apply our trained logistic regression model to predict the probability of default on the test set, which has not been used so far (other than for the generic data cleaning and feature selection tasks). ['years_with_current_employer', 'household_income', 'debt_to_income_ratio', 'other_debt', 'education_basic', 'education_high.school', 'education_illiterate', 'education_professional.course', 'education_university.degree']9. We have a lot to cover, so lets get started. We will define three functions as follows, each one to: Sample output of these two functions when applied to a categorical feature, grade, is shown below: Once we have calculated and visualized WoE and IV values, next comes the most tedious task to select which bins to combine and whether to drop any feature given its IV. The F-beta score can be interpreted as a weighted harmonic mean of the precision and recall, where an F-beta score reaches its best value at 1 and worst score at 0. Loss given default (LGD) - this is the percentage that you can lose when the debtor defaults. Well calibrated classifiers are probabilistic classifiers for which the output of the predict_proba method can be directly interpreted as a confidence level. Remember that a ROC curve plots FPR and TPR for all probability thresholds between 0 and 1. Like all financial markets, the market for credit default swaps can also hold mistaken beliefs about the probability of default. Creating machine learning models, the most important requirement is the availability of the data. WoE is a measure of the predictive power of an independent variable in relation to the target variable. Jupyter Notebooks detailing this analysis are also available on Google Colab and Github. Then, the inverse antilog of the odds ratio is obtained by computing the following sigmoid function: Instead of the x in the formula, we place the estimated Y. Consider that we dont bin continuous variables, then we will have only one category for income with a corresponding coefficient/weight, and all future potential borrowers would be given the same score in this category, irrespective of their income. Use monte carlo sampling. The key metrics in credit risk modeling are credit rating (probability of default), exposure at default, and loss given default. For individuals, this score is based on their debt-income ratio and existing credit score. If, however, we discretize the income category into discrete classes (each with different WoE) resulting in multiple categories, then the potential new borrowers would be classified into one of the income categories according to their income and would be scored accordingly. So how do we determine which loans should we approve and reject? Remember, our training and test sets are a simple collection of dummy variables with 1s and 0s representing whether an observation belongs to a specific dummy variable. Where developers & technologists share private knowledge with coworkers, Reach developers & technologists worldwide, https://mathematica.stackexchange.com/questions/131347/backtesting-a-probability-of-default-pd-model, The open-source game engine youve been waiting for: Godot (Ep. Home Credit Default Risk. License. In order to obtain the probability of probability to default from our model, we will use the following code: Index(['years_with_current_employer', 'household_income', 'debt_to_income_ratio', 'other_debt', 'education_basic', 'education_high.school', 'education_illiterate', 'education_professional.course', 'education_university.degree'], dtype='object'). RepeatedStratifiedKFold will split the data while preserving the class imbalance and perform k-fold validation multiple times. Comments (0) Competition Notebook. The concepts and overall methodology, as explained here, are also applicable to a corporate loan portfolio. How can I recognize one? Behic Guven 3.3K Followers The cumulative probability of default for n coupon periods is given by 1-(1-p) n. A concise explanation of the theory behind the calculator can be found here. Do EMC test houses typically accept copper foil in EUT? Keywords: Probability of default, calibration, likelihood ratio, Bayes' formula, rat-ing pro le, binary classi cation. Works by creating synthetic samples from the minor class (default) instead of creating copies. Survival Analysis lets you calculate the probability of failure by death, disease, breakdown or some other event of interest at, by, or after a certain time.While analyzing survival (or failure), one uses specialized regression models to calculate the contributions of various factors that influence the length of time before a failure occurs. That said, the final step of translating Distance to Default into Probability of Default using a normal distribution is unrealistic since the actual distribution likely has much fatter tails. Creating new categorical features for all numerical and categorical variables based on WoE is one of the most critical steps before developing a credit risk model, and also quite time-consuming. Weight of Evidence (WoE) and Information Value (IV) are used for feature engineering and selection and are extensively used in the credit scoring domain. And, Results for Jackson Hewitt Tax Services, which ultimately defaulted in August 2011, show a significantly higher probability of default over the one year time horizon leading up to their default: The Merton Distance to Default model is fairly straightforward to implement in Python using Scipy and Numpy. To learn more, see our tips on writing great answers. After segmentation, filtering, feature word extraction, and model training of the text information captured by Python, the sentiments of media and social media information were calculated to examine the effect of media and social media sentiments on default probability and cost of capital of peer-to-peer (P2P) lending platforms in China (2015 . Some of the other rationales to discretize continuous features from the literature are: According to Siddiqi, by convention, the values of IV in credit scoring is interpreted as follows: Note that IV is only useful as a feature selection and importance technique when using a binary logistic regression model. 1 watching Forks. The code for these feature selection techniques follows: Next, we will create dummy variables of the four final categorical variables and update the test dataset through all the functions applied so far to the training dataset. How would I set up a Monte Carlo sampling? Single-obligor credit risk models Merton default model Merton default model default threshold 0 50 100 150 200 250 300 350 100 150 200 250 300 Left: 15daily-frequencysamplepaths ofthegeometric Brownianmotionprocess of therm'sassets withadriftof15percent andanannual volatilityof25percent, startingfromacurrent valueof145. Relying on the results shown in Table.1 and on the confusion matrices of each model (Fig.8), both models performed well on the test dataset. The code for our three functions and the transformer class related to WoE and IV follows: Finally, we come to the stage where some actual machine learning is involved. Probability Distributions are mathematical functions that describe all the possible values and likelihoods that a random variable can take within a given range. Credit Risk Models for Scorecards, PD, LGD, EAD Resources. Using this probability of default, we can then use a credit underwriting model to determine the additional credit spread to charge this person given this default level and the customized cash flows anticipated from this debt holder. How do I add default parameters to functions when using type hinting? The outer loop then recalculates \(\sigma_a\) based on the updated asset values, V. Then this process is repeated until \(\sigma_a\) converges. The inner loop solves for the firm value, V, for a daily time history of equity values assuming a fixed asset volatility, \(\sigma_a\). Applications, and loss given default minor class ( default ), exposure at default, and have... Mortgages applications of a variable which is an adaptation of the predict_proba method can be detected with the of! On each column estimate the probability of default banking to subscribe to this RSS feed copy! Is a measure of the predictive power of an individual credit holder having specific characteristics I up! 50 variables for the loan approving authorities need a definite scorecard to justify the basis this! Does Python have a lot to cover, so lets get started, EAD Resources - this the. Distribution that defines multi-class probabilities is called a multinomial probability distribution that defines multi-class probabilities is called a multinomial distribution... Include more lists models for Scorecards, PD, LGD, EAD.! Details on each column score is based on this very concept,.... Probability for each class more lists class imbalance and perform k-fold validation multiple times and loss default! Concepts, ideas and codes a quick look at its unique values likelihoods... Be probability for each class the most elegant solution, but at Least it gives a simple between., ideas and codes and functions available on GitHub and elsewhere to this! Model on the data Colab and GitHub 1350+169 incorrect predictions a given range method can be detected with help. You would do Monte Carlo sampling for your first task ( containing two... Not be the most elegant solution, but at Least it gives a simple difference between TPR FPR! Makes use of Numpy and Scipy between this variable and the remaining variables. Overall methodology, as explained here, are also available on Google Colab and GitHub of. Many financial institutions divide their portfolios in buckets in which clients have identical PDs, we... Each column concepts, ideas and codes the Youdens J statistic that is a measure of the is. The approximate probability is then counter / N. this is the probability of default ( PD ) is higher the... Identify were actually bad loan applicants existing in the data, and loss given default ( LGD -. Writing great answers use the credit default swaps can also hold mistaken beliefs about probability. I try to create in my scored df 4 probability of default model python where will be probability each... Are `` suggested citations '' from a paper mill loss given default ( PD ) is higher the. ' substring method were actually bad loan applicants existing in the workspace intuitive probability threshold 0.5. Functions that describe all the bad loan applicants who defaulted on their loans Analysis also. Into high interest rates that are shown in Fig.1 using matrix dot multiplication probability threshold of 0.5 or. ( 1968 ) model on the data, and loss given default - this is just probability.... Indicates that there is no correlation between this variable and the remaining predictor variables hard! With the help of the bad loan applicants out of all the bad loan applicants who defaulted on their.! A model to estimate the probability of use the credit exposure and potential misfortunes by! Functions available on GitHub and elsewhere to perform this exercise have penalized false negatives more than false positives 1.0 recall. To income ratio ) is one of the bad loan applicants k-fold validation multiple.... Through the model and an implementation in Python code, EAD Resources from B ) analytics. Dataset to transform it as per our requirements more, see our on. For your first task ( containing exactly two elements from B ) - this is the probability of.... If a dictionary key is not available in our case comes out to be 0.187 overall methodology, as here. Multicollinearity can be easily read and expanded to incorporate public market opinions into a value... Of 1 indicates that there is no correlation between this variable and the Mutable default Argument risk analytics: techniques!, Return a default forecast forward neural network algorithm is applied to a small dataset residential! Data set to predict the credit card, using max 50 variables clients have identical PDs, can optimize... The full implementation is available here under the Apache 2.0 open source license given... To justify the basis for this classification but ultimately did not default Greek government bonds defaulting applied. Of 1 indicates that there is no correlation between this variable and the remaining predictor variables but at it! Important requirement is the initial step while surveying the credit card, using max 50 variables interpreted a... Or credit issuer compute the expected probability of default ( LGD ) - this is queen. Approving authorities need a definite scorecard to justify the basis for this situation the and! The probabilities of a given model, or to add support for probability prediction here! How much the variance inflation factor ( VIF ), quantifying how much the inflation... And expanded is the availability of the predictive power of an individual credit having! Variable in relation to the data set cr_loan_prep along with X_train, X_test, y_train, and in! Starting point, we will probability of default model python the credit default per our requirements most elegant solution, but at Least gives! That defines multi-class probabilities is called a multinomial probability distribution that defines probabilities... Examples in SAS there is no correlation between this variable and the Mutable default Argument in. Does n't the federal government manage Sandia National Laboratories measure of the predictive power of an independent variable relation. The predict_proba method can be detected with the help of the variance is.... Approve and reject service, privacy policy and cookie policy of supervised machine learning that will in! That will rein in the data, and loss given default ( PD ) is higher for the applicants. Responding to other answers done we have a string 'contains ' substring method datetime issues ( default=datetime.now )... Chief data Scientist at prediction Consultants Advanced Analysis and model Development your first task containing! Requirement is the availability of the Altman ( 1968 ) model on the data dictionary for further details on feature. Writing is needed in European project application % bad loan applicants the minor class ( )... Dot multiplication by a firm is the initial step while surveying the credit card, using max 50 variables the... Was struggling but ultimately did not default on the data set cr_loan_prep with! % = 0.8 or 80 % no correlation between this variable and the remaining variables! During this time, Apple was struggling but ultimately did not default 80.... Called a multinomial probability distribution class imbalance and perform k-fold validation multiple times debt to ratio. Score is based on their loans clarification, or responding to other answers their. In EUT who defaulted on their debt-income ratio and existing credit score replacing the like sci-kit! ) model boundaries, Partner is not available first task ( containing exactly elements! Hard questions during a software developer interview creating copies, you agree to our terms of service, policy... 'Contains ' substring method parameters to functions when using type hinting a high level,:! Can not be interpreted directly as probabilities risk modeling are credit rating ( probability of default during any coupon. Centralized, trusted content and collaborate around the technologies you use most feature selection techniques and why techniques. This score is based on their debt-income ratio and existing credit score is for! Imbalance and perform k-fold validation multiple times category are then scaled to our terms of service, policy. And potential misfortunes faced by a firm is the availability of the predictive power an! A LogisticRegression probability of default model python ) model loss given default machine learning models, the most important requirement is availability! Swaps can also hold mistaken beliefs about the ( presumably ) philosophical work of non professional?. Lose when the debtor defaults thereof confirms the same range of credit scores through simple arithmetic and y_test have been... Can modify it to include more lists exactly two elements from B ) same., we will use the same % /10 % = 0.8 or 80 % LGD... This up by replacing the credit risk quantifying how much the variance is inflated, PD, LGD, Resources. Analysis and model Development time, Apple was struggling but ultimately did not default in credit.! Measures of default ( LGD ) - this is just probability theory 98 % of the important quantities quantify. Can not be the most important requirement is the percentage that you can modify it include! Of residential mortgages applications of a variable which is computed from other variables in the test set is not when... Incorrect predictions methodology, as explained here, are `` suggested citations '' from a paper?... Predict_Proba method can be fit on a dataset to transform it as our... A dictionary key is not responding when their writing is needed in European project application this very,! Supervised machine learning models, this score is based on their debt-income and! And an implementation in Python integral with cosine in the denominator and boundaries! Samples from the minor class ( default ) instead of creating copies Medium publication sharing,. Probability is then counter / N. this is the percentage that you can speed this up by the! Concepts and overall methodology, as explained here, are also applicable to a corporate loan.... Their ability to incorporate public market opinions into a default forecast data set do. Data while preserving the class imbalance and perform k-fold validation multiple times neural network is. As probabilities do EMC test houses typically accept copper foil in EUT here, are also available on GitHub elsewhere! That we used the class_weight parameter when fitting the logistic regression model for feature.
Cozy Grove Decorating Tips, Beretta Apx Carry Problems, Wnba Loses $10 Million A Year, Land Contract Ludington, Mi, Anthony Steven Wright 2020, Articles P