Data Processing
Process data from ASE trajectory object.
- sparc.src.data_processing.get_data(ase_traj='AseMD.traj', dir_name='Dataset', skip_min=0, skip_max=None)[source]
Process an ASE trajectory file and split the data into training and validation datasets. The training data consists of 80% of the frames, and the validation data consists of 20%. The data is then saved in the specified directory
data_dirin the.npyformat.Args:
- ase_traj (str):
ASE trajectory file name (default:
AseMD.traj).- dir_name (str):
Path to the directory for saving training and validation datasets.
- skip_min (int):
Skip the first n frames.
- skip_max (int):
Skip the last n frames.
Example
from sparc.src.data_processing import get_data get_data(ase_traj="AseMD.traj", dir_name="Dataset", skip_min=0, skip_max=None)
Reference
Check dpdata for more details.