Exporting data

Edit on GitHub

To quickly populate an external system like ERP or OMS with data from your Spryker shop, you can export it as CSV files from the Spryker shop and then import them into the external system.

You can export only order data, which includes data about the following:

  • Orders
  • Order items
  • Order expenses
Export file format

Out of the box, Spryker supports only CSV as a format for exporting files.

To export the order data, follow these steps:

  1. Make sure you have the Sales Data Export feature installed for your project.
  2. Specify necessary configurations in the YML export configuration file residing in ./data/export/config/. For details about the YML export configuration file structure and configuration options, see .yml Export configuration file.
  3. Run console data:export --config file-name.yml, where file-name.yml is the name of the YML export configuration file. The command creates export CSV files in the ./data/export/ folder for each data_entity of the YML file. For each store specified in the YML file, a separate file is created. For an example of how the export works, see Structure of the YML export configuration file.
Multi-store support

Spryker Data Export supports the multi-store functionality, which means that you can export data for multiple stores.

YML export configuration file

The YML export configuration file lets you define what orders you want to export. The following content is exported:

  • order
  • order-item
  • order-expense

By default, the YML export configuration file resides in ./data/export/config/. You can adjust your YML export configuration file, but stick to its structure and take the possible data filtering options into account.

Note

The root of data export files is configured globally, and data export does not change it.

Structure of the YML export configuration file

Structure of the YML export configuration file is as follows:

defaults:
  filter_criteria: &default_filter_criteria
        order_created_at:  
            type: between
            from: '<order_created_at_date-time_from_value>'
            to: '<order_created_at_date-time_to_value>'

actions:  
   - data_entity: order
      destination: '{data_entity}s_<store_name_value_1>_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [<store_name_value_1>]
    - data_entity: order
      destination: '{data_entity}s_<store_name_value_2>_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [<store_name_value_2>]
    - data_entity: order
      destination: '{data_entity}s_<store_name_value_n>_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [<store_name_value_n>]

   - data_entity: order-item
      destination: '{data_entity}s_<store_name_value_1>_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [<store_name_value_1>]

   -  data_entity: order-expense             
      destination: '{data_entity}s_<store_name_value_1>_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [<store_name_value_1>]

The type of content to export is defined in section actions by data_entity and must be order, order-item, and order-expense. You can define what stores to run export for, and specify order dates to export data for. For information about exporting order data for specific stores and time periods, see Setting the filter criteria in a YML file.

For example, check out the default YML export configuration file order_export_config.yml. It’s configuration presupposes batch export of the three data entities: order, order-item, and order-expense.

When running the command for data export with this file, console data:export --config order_export_config.yml, exported CSV files are created in data/export. For each data entity and store, a separate file is generated:

  • order-expenses_AT.csv
  • order-expenses_DE.csv
  • order-items_AT.csv
  • order-items_DE.csv
  • orders_AT.csv
  • orders_DE.csv

For details about the content of each of the files, see Data Export Orders CSV Files Format.

Setting the filter criteria in a YML file

You can set the following filter criteria for the order data export in your YML export configuration file:

  • Store names: stores from which the data are exported.
  • Date and time range: interval from what date and time to what date and time the order was created, including the from and the to values. If you use the label order_updated_at, the range is relative to the date and time the order was updated.

Defining the stores for order data export

To define the stores you want to export the order data for, specify them in destination for the specific data entities.

Warning

You must create individual files for each data entity and each store if your filter criteria include store_name.

For example, to export the order-expenses data for the DE store and order-items data for DE and AT stores, the YML file looks like this:

defaults:
    filter_criteria: &default_filter_criteria
        order_created_at:
            type: between
            from: '2020-05-01 00:00:00'
            to: '2020-12-31 23:59:59'

actions:
    - data_entity: order-expense
      destination: '{data_entity}s_DE_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [DE]

    - data_entity: order-item
      destination: '{data_entity}s_DE_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [DE]
    - data_entity: order-item
      destination: '{data_entity}s_AT_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: [AT]

Defining the Ddte and time range for order data export

The default date and time range filter criteria—for example, the order creation dates filter applied to all data_entity items by default—is specified in the defaults section:

defaults:
  filter_criteria: &default_filter_criteria
        order_created_at:  
            type: between
            from: '2020-05-01 08:00:00'
            to: '2020-06-07 08:00:00'  
Info

To use the date and time range filter criteria of the defaults section and apply the filter criteria to data_entity items of the actions section, <<: *default_filter_criteria must be declared in the filter_criteria parameter of each data_entity item.

You can change the filter criteria for any data_entity items by replacing <<: *default_filter_criteria with the values you need.

For example, to export only orders created on May 15th, 2020, for the order-expenses data entity in the AT store, you need to replace the <<: *default_filter_criteria line under filter_criteria for order-expenses in AT store with the following:

order_created_at:  
  type: between
  from: '2020-05-15 00:00:00'
  to: '2020-05-15 23:59:59'

Example:

defaults:
filter_criteria: &default_filter_criteria
      order_created_at:  
          type: between
          from: '2020-05-01 00:00:00'
          to: '2020-06-07 23:59:59'

actions:  
 - data_entity: order-expense
    destination: '{data_entity}s_DE.{extension}'
    filter_criteria:
        <<: *default_filter_criteria
        store_name: DE
  - data_entity: order-expense
    destination: '{data_entity}s_AT.{extension}'
    filter_criteria:
        order_created_at:  
          type: between
          from: '2020-05-15 00:00:00'
          to: '2020-05-15 23:59:59'
        store_name: AT
  ...

After running the command with the changed filter criteria for order-expense, the order-expenses_AT.csv file only contains the orders created on May 15th, 2020. The other files contain the orders created from May 1st till July 6th, as specified in &default_filter_criteria of the defaults section.

Overwriting existing CSV files upon repeated command run

When exporting data, the newly generated CSV files overwrite the existing ones. Currently, this behavior is not configurable.

If you want to generate new CSV files without overwriting eventual existing ones, you may use a {timestamp} tag in the name of the file to be generated. For example, if you use the default structure of the YAML export configuration file, upon repeated launch of the console data:export --config file-name.yml, the already existing export CSV files are generated with different file names according to the {timestamp} on the moment of its creation, and therefore are not overwritten.

And vice versa: if you want to overwrite the existing files, remove {timestamp} from the destination parameter of the YML file for the necessary data_entity items—for example:

Initial file:

defaults:
  filter_criteria: &default_filter_criteria
        order_created_at:  
            type: between
            from: '2020-05-01 08:00:00'
            to: '2020-06-07 08:00:00'

actions:  
   - data_entity: order-expense
      destination: '{data_entity}s_DE_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: DE
    - data_entity: order-expense
      destination: '{data_entity}s_AT_{timestamp}.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: AT
    ...

File with the removed {timestamp}:

defaults:
  filter_criteria: &default_filter_criteria
        order_created_at:  
            type: between
            from: '2020-05-01 08:00:00'
            to: '2020-06-07 08:00:00'

actions:  
   - data_entity: order-expense
      destination: '{data_entity}s_DE.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: DE
    - data_entity: order-expense
      destination: '{data_entity}s_AT.{extension}'
      filter_criteria:
          <<: *default_filter_criteria
          store_name: AT
    ...