The Data Extractor feature allows to extract data as custom fields.
What problem does it solve?
Clients have different data structure and often provide data not optimized for Search:
Product attributes stored in Tag1 or other fields together with other attributes;
Attribute values are provided as a string like Field:Value1,Value2 or "Value1,Value2";
Attribute values are provided as a JSON array like ["Value1", "Value2"]
This way, they can’t use this data in facets, merchandising, and other areas. Data extractor feature allows to generate attributes as separate fields that can later be used in Search & Categories.
Use cases today:
Create more facets
Merchandise by various attributes e.g. promotion type
Add as Searchable Fields for better search relevance
Building custom frontend template elements
Decide what attributes you would like to extract to separate data field
Go to Catalog Explorer → Settings → Data Extractor
Name new your new data extractor e.g. Attributes from Tag1
Select the type of your Data Extractor based on field's data structure
Select a field where your desired attributes are placed now
(If needed) Write a separator - a character that separates attribute’s name from the value
e.g. if the attribute is Style:Oversized, then your separator is :
It’s useful to extract more attributes to be available for you but you can also specify optional settings to extract something even more specific
If you’d like to extract only Style attribute, Name Filter should be set Style
If you’d like to extract only certain attribute values e.g. Oversized, Value Filter should be set Oversized
Once data extractor is created, new data fields will be available after the next re-index which happens 4 times a day. To know when it’s available and to use it in search, you should:
Go to Search & Categories → Settings → Indexed Fields and add the new field, that you’ve created
When it’s added, you should wait for another re-index
Finally, your new attributes can be used in facets, merchandising, and searchable fields
Sometime later this data will also be available for the rest of the Nosto Platform.