7PAM2000 - Applied Data Science 1 Assignment 3 - Clustering and Fitting

Assignment Task

7PAM2000 - Applied Data Science 1 Assignment 3 - Clustering and Fitting

For this assignment, your task is to create a poster focusing on clustering and fitting, suitable for presentation.

Once again, we will delve into public data from the World Bank, particularly examining country-by-country indicators related to climate change. Additional relevant indicators, like GDP per capita, can be explored using the complete list. Please note that some countries may lack entries for recent years.

Your Objectives:

  • Discover interesting clusters of data. Normalized values such as GDP per capita, CO2 production per head, or CO2 per $ of GDP can yield meaningful clusters. Utilize at least one clustering method discussed in lectures, preferably on normalized data. Visualize the clustering results by adding classifications as a new column to the dataframes and employing logical slicing. Generate a plot showcasing cluster membership and cluster centers using pyplot.

  • Develop simple model(s) fitting data sets with curve_fit. This could involve fitting time series data or modeling one attribute as a function of another. Keep the models straightforward, such as exponential growth, logistic function, or low-order polynomials. Utilize the models for predictions, including values in ten or twenty years with confidence ranges.

  • Utilize the attached function err_ranges to estimate lower and upper limits of the confidence range and create a plot illustrating the best fitting function and the confidence range.

Content and Presentation:

  • Exercise your initiative to craft a narrative with the data, employing suitable visualizations and providing a concise text narrative to communicate and elucidate your findings.

  • Recognize the distinctions between a good report and a good poster. Keep text concise in the poster, considering itemization or similar approaches for clarity. Overview tables may be preferable for presenting information like comparisons between countries.

  • Avoid exhaustive explanations. Ensure the poster contains all necessary information for basic comprehension, with additional details available during a poster session. Focus on self-contained graphs and provide explicit conclusions summarizing the essential findings. Omit technical information, as the audience seeks results rather than methodology details.

Your poster should feature an introduction and clear conclusions summarizing the key findings. Minimize technical information, emphasizing instead the effective presentation of results for the audience`s understanding.

WhatsApp icon