I recently started learning Machine Learning, and importing datasets is the first set to take. ScikitLearn offers many datasets, so this time, I am going to use it.
<pre><code class="lang-python">from sklearn.datasets import load_boston
import pandas as pd
import matplotlib.pyplot as plt

boston = load_boston()
df = pd.DataFrame(boston.data, columns=boston.feature_names)
print(df.head())
</code></pre>
we create a <code>pandas</code> DataFrame from the Boston Housing dataset using the <code>pd.DataFrame()</code> function. We pass in the <a target="_blank" href="http://boston.data"><code>boston.data</code></a> array as the data, and use <code>boston.feature_names</code> to set the column names.
boston.feature_names has name of each feature, in another word, each column.
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1677860076484/d77ec805-a130-4432-bd57-fc0a1499be7b.png" alt class="image--center mx-auto" />
boston.data has an array on arrays that has data of each feature. This is what I got by printing <a target="_blank" href="http://boston.data">boston.data</a>.
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1677859991576/37b76a92-e1c7-48b1-b5e9-7262aeea54a6.png" alt class="image--center mx-auto" />
<code>head()</code> method is used to display the first few rows of the DataFrame. You can modify the number of rows displayed by passing in a different argument to the <code>head()</code> method. If I set df.head(100), the first 100 records are displayed.
<img src="https://cdn.hashnode.com/res/hashnode/image/upload/v1677860107456/1cff9b1b-3d41-4924-8e70-955fec2be178.png" alt class="image--center mx-auto" />

I recently started learning Machine Learning, and importing datasets is the first set to take. ScikitLearn offers many datasets, so this time, I am going to use it.

```python
from sklearn.datasets import load_boston
import pandas as pd
import matplotlib.pyplot as plt

boston = load_boston()
df = pd.DataFrame(boston.data, columns=boston.feature_names)
print(df.head())
```

we create a `pandas` DataFrame from the Boston Housing dataset using the `pd.DataFrame()` function. We pass in the [`boston.data`](http://boston.data) array as the data, and use `boston.feature_names` to set the column names.

boston.feature\_names has name of each feature, in another word, each column.

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1677860076484/d77ec805-a130-4432-bd57-fc0a1499be7b.png align="center")

boston.data has an array on arrays that has data of each feature. This is what I got by printing [boston.data](http://boston.data).

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1677859991576/37b76a92-e1c7-48b1-b5e9-7262aeea54a6.png align="center")

`head()` method is used to display the first few rows of the DataFrame. You can modify the number of rows displayed by passing in a different argument to the `head()` method. If I set df.head(100), the first 100 records are displayed.

![](https://cdn.hashnode.com/res/hashnode/image/upload/v1677860107456/1cff9b1b-3d41-4924-8e70-955fec2be178.png align="center")

How to see the imported sklearn datasets