💡 You can skip this section if the default even distribution works for your environment.
Verify splits large tables/files into partitions processed in parallel — similar to dividing work across multiple workers. The default method distributes records based on key values:
For CSV files:
To implement custom partition logic:
Create a new Broadway flow.
Define the following External flow inputs:
Define flow output bucketRows — an array of maps where each map uses customizedKey as the key and the combined key values as the value:
[
{ "CUSTOMER_ID|SSN": "3|948374937" },
{ "CUSTOMER_ID|SSN": "11|1706304573" }
]
verify_partition_method.Partitions Assignment Method for the relevant configuration to this flow.💡 You can skip this section if the default even distribution works for your environment.
Verify splits large tables/files into partitions processed in parallel — similar to dividing work across multiple workers. The default method distributes records based on key values:
For CSV files:
To implement custom partition logic:
Create a new Broadway flow.
Define the following External flow inputs:
Define flow output bucketRows — an array of maps where each map uses customizedKey as the key and the combined key values as the value:
[
{ "CUSTOMER_ID|SSN": "3|948374937" },
{ "CUSTOMER_ID|SSN": "11|1706304573" }
]
verify_partition_method.Partitions Assignment Method for the relevant configuration to this flow.