Figure 1.
The example script I provided in Data Science 1 is also a good example of how to document your code; albeit that one was done in R.
- Download the template python script over here and fill it in according to the guidelines described in there. To download the script, open the raw file, right-click and ‘save as' into a location of choice. Please fill in you student number and name. In this class there are no students called: ‘FirstName', ‘LastName' or students with the following student number: ‘StudentNumber', I checked.
- Load in the youthcare dataset you created in Business Intelligence. Then save your file to your GitHub repository.
- Open your research design and use in-line comments to formulate a multiple regression analysis based on your research question (or when not answerable using multiple regression: related to your research question). Start by listing the variables which you think could predict the outcome variable you're interested in and motivate why you think they might predict your outcome variable.
- Create your fully fitted model (so the model containing all variables you wrote down in step 2) under the python code you just wrote.
- Test, re-fit and validate your model. Create a new model on a new line for every re-fit. Keep track of any predictor variables you exclude from the full model when re-fitting. Motivate why you are excluding; or including new variables using in-line comments.
- Continue till 16:00, or stop when you feel you can no longer improve the model.Then save your file to your GitHub repository.
4) Day-reflection
At 16:30, there's a meeting you're encouraged to take part in to ask questions and to discuss our progress and reflect on today activities.
Up Next!
Next week, we will cover classification algorithms!
Resources
2023,Applied Data Science and Artificial Intelligence @ Breda University of Applied Sciences