We started by creating a Multiple Linear Regression model on the full data set (including 
            the height outliers). For this first model, we calculated an R2 value of 50.785%. 
            This indicates that our model accounts for about half of the variance in the data set. 
            
            We then created a second Multiple Linear Regression model on a data set that excludes the 
            four height outliers that were found in the Visualizing Parameters stage. This model produced 
            an R2 value of 51.986%. This is indicates that removing the four outliers improved the 
            model's performance. 
            
            Because the data are all positive, we next tried using the Min-Max Scaler from SciKit-Learn 
            instead of the Standard Scaler. This resulted in an R2 value of 51.952%. Since it 
            did not produce an appreciable improvement in the model's performance, we returned to using 
            the Standard Scaler for the rest of the non-neural network models. 
            
            Since we weren't obtaining strong R2 values with the linear regression models, we 
            decided to try other types of supervised regression models.