Data Science Questions

Pandas, NumPy, and Machine Learning concepts.

1. Pandas Merge vs Join

Question: What is the difference between merge and join in Pandas?

join is used to combine two DataFrames on the index or on a key column. merge is more versatile and allows you to specify columns to join on for both left and right DataFrames.

2. Overfitting vs Underfitting

Question: Explain overfitting and underfitting.

Overfitting: The model learns the training data too well, including noise, and performs poorly on new data. High variance.

Underfitting: The model is too simple to capture the underlying pattern of the data. High bias.