Symptom
When creating a data flow and using a script operator in SAP Datasphere, the incoming table is not provided as a complete dataset. Instead, the data is split into multiple batches of rows (depending on the size of the table).
- The transform function is therefore invoked multiple times, once for each batch.
- In each call, the data parameter only contains the rows of the current batch.
As a result, operations that require access to the entire table within the script operator are not possible. For example, removing duplicates.
Read more...
Environment
SAP Datasphere
Product
Keywords
script, DataFrame, datasphere, script operator, NumPy, data flow, script operator, transform function, batches, duplicates, restriction, complete table , KBA , DS-DI-DF , Data Flows , Problem
About this page
This is a preview of a SAP Knowledge Base Article. Click more to access the full version on SAP for Me (Login required).Search for additional results
Visit SAP Support Portal's SAP Notes and KBA Search.
SAP Knowledge Base Article - Preview