Create a dataset from pandas dataframe

PHOTO EMBED

Fri Apr 08 2022 23:01:15 GMT+0000 (Coordinated Universal Time)

Saved by @wessim

from azureml.core import Workspace, Datastore, Dataset
import pandas as pd

pandas_df = pd.read_csv('<path to your csv file>')
ws = Workspace.from_config()
datastore = Datastore.get(ws, '<name of your datastore>')
dataset = Dataset.Tabular.register_pandas_dataframe(pandas_df, datastore, "dataset_from_pandas_df", show_progress=True)
content_copyCOPY

To create a TabularDataset from an in memory pandas dataframe use the register_pandas_dataframe() method. This method registers the TabularDataset to the workspace and uploads data to your underlying storage, which incurs storage costs.

https://docs.microsoft.com/en-us/azure/machine-learning/how-to-create-register-datasets