Everyday Italian

{"type":"doc","content":[{"type":"paragraph","content":[{"text":"Plugin version: 2.11.0","type":"text"}]},{"type":"paragraph","content":[{"text":"The XML Parser Transform uses XPath to extract fields from a complex XML event. This plugin should generally be used in conjunction with the XML Reader Batch Source. The XML Reader will provide individual events to the XML Parser, which will be responsible for extracting fields from the events and mapping them to the output schema.","type":"text"}]},{"type":"paragraph","content":[{"text":"The transform takes an input record that contain XML events or records, parses it using the specified XPaths and returns a structured record according to the specified schema. For example, this plugin can be used in conjunction with the XML Reader Batch Source to extract values from XMLNews documents and create structured records which are easier to query.","type":"text"}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Configuration","type":"text"}]},{"type":"table","attrs":{"layout":"default","localId":"1eff71a9-89f4-469a-99e8-1ff3fda8cbbf"},"content":[{"type":"tableRow","content":[{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Property","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"Macro Enabled?","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Description","type":"text","marks":[{"type":"strong"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Input field to parse as an XML record","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"Yes","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. The field in the input record that is the source of the XML event or record. ","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"XML encoding","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"Yes","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. The source XML character set encoding.","type":"text"}]},{"type":"paragraph","content":[{"text":"Default is UTF-8.","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"XPath Mappings","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. Mapping of the field names to the XPaths of the XML record. A comma-separated list, each element of which is a field name, followed by a colon, followed by an XPath expression. XPath location paths can include predicates and supports XPath 1.0. Example : ","type":"text"},{"text":":","type":"text","marks":[{"type":"code"}]},{"text":".","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Field Name Schema Type Mapping","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. Mapping of field names in the output schema to data types. Consists of a comma-separated list, each element of which is a field name followed by a colon and a type, where the field names are the same as used in the xPathMappings, and the type is one of: boolean, int, long, float, double, bytes, or string. Example : ","type":"text"},{"text":":","type":"text","marks":[{"type":"code"}]},{"text":".","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Error handling","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. The action to take in case of an error.","type":"text"}]},{"type":"bulletList","content":[{"type":"listItem","content":[{"type":"paragraph","content":[{"text":"Ignore error and continue","type":"text"}]}]},{"type":"listItem","content":[{"type":"paragraph","content":[{"text":"Exit on error: Stops processing upon encountering an error","type":"text"}]}]},{"type":"listItem","content":[{"type":"paragraph","content":[{"text":"Write to error dataset: Writes the error record to an error dataset and continues.","type":"text"}]}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Fail on Array","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Optional. Whether to allow XPaths that are arrays. If false, the first element will be chosen. ","type":"text"}]},{"type":"paragraph","content":[{"text":"Default is false.","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Disallow Doctype DTD","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Optional. This prevents processing any DTDs while reading xml files. This defaults to ","type":"text"},{"text":"false","type":"text","marks":[{"type":"code"}]},{"text":" from the plugin but when configuring the plugin via UI this will be set to true. This is to prevent xxe based xml vulnerabilities while reading the xml file. Please read more about ","type":"text"},{"text":"xxe xml vulnerability here","type":"text","marks":[{"type":"link","attrs":{"href":"https://owasp.org/www-community/vulnerabilities/XML_External_Entity_(XXE"}}]},{"text":"_Processing).","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Load external DTD","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Optional. Enable loading external DTD while reading xml file. Sets ","type":"text"},{"text":"http://apache.org/xml/features/load-external-dtd","type":"text","marks":[{"type":"link","attrs":{"href":"http://apache.org/xml/features/load-external-dtd"}}]},{"text":" ","type":"text"}]},{"type":"paragraph","content":[{"text":"Default is Off.","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Enable External Parameter Entities","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Optional. Enable external parameter entities while reading xml file. Sets ","type":"text"},{"text":"http://xml.org/sax/features/external-parameter-entities","type":"text","marks":[{"type":"link","attrs":{"href":"http://xml.org/sax/features/external-parameter-entities"}}]},{"text":" ","type":"text"}]},{"type":"paragraph","content":[{"text":"Default is Off.","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Enable External General Entities","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Optional. Enable external parameter entities while reading xml file. Sets ","type":"text"},{"text":"http://xml.org/sax/features/external-general-entities","type":"text","marks":[{"type":"link","attrs":{"href":"http://xml.org/sax/features/external-general-entities"}}]}]},{"type":"paragraph","content":[{"text":"Default is Off.","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[253.0]},"content":[{"type":"paragraph","content":[{"text":"Output Schema","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[103.0]},"content":[{"type":"paragraph","content":[{"text":"No","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[403.0]},"content":[{"type":"paragraph","content":[{"text":"Required. The output schema for the data.","type":"text"}]}]}]}]},{"type":"heading","attrs":{"level":2},"content":[{"text":"Example","type":"text"}]},{"type":"paragraph","content":[{"text":"This example parses an XML record received in the \"body\" field of the input record following the ","type":"text"},{"text":"XPath Mappings ","type":"text","marks":[{"type":"strong"}]},{"text":"for each field name. The output structured record will be created using the type specified for each field in the ","type":"text"},{"text":"Field Name Schema Type Mapping","type":"text","marks":[{"type":"strong"}]},{"text":". Only years and prices will be passed on for books with a price over 35.00:","type":"text"}]},{"type":"table","attrs":{"layout":"default","localId":"e1d784ad-4753-4a16-96f0-23168eef1fe2"},"content":[{"type":"tableRow","content":[{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"Property","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"paragraph","content":[{"text":"Value","type":"text","marks":[{"type":"strong"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"Input field to parse as an XML record","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"paragraph","content":[{"text":"body","type":"text","marks":[{"type":"code"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"XML encoding","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"paragraph","content":[{"text":"UTF-8","type":"text","marks":[{"type":"code"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"XPath Mappings","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"codeBlock","content":[{"text":"category://book/@category\ntitle://book/title\nyear:/bookstore/book[price>35.00]/year,\nprice:/bookstore/book[price>35.00]/price,\nsubcategory://book/subcategory","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"Field Name Schema Type Mapping","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"codeBlock","content":[{"text":"category:string \ntitle:string\nyear:int\nprice:double\nsubcategory:string","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[310.0]},"content":[{"type":"paragraph","content":[{"text":"Error handling","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[449.0]},"content":[{"type":"paragraph","content":[{"text":"Ignore error and continue","type":"text","marks":[{"type":"code"}]}]}]}]}]},{"type":"paragraph","content":[{"text":"For example, suppose the transform receives these input records:","type":"text"}]},{"type":"table","attrs":{"layout":"default","localId":"fc504400-7365-4652-9749-5f5b7b4b0501"},"content":[{"type":"tableRow","content":[{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[131.0]},"content":[{"type":"paragraph","content":[{"text":"offset","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[628.0]},"content":[{"type":"paragraph","content":[{"text":"body","type":"text","marks":[{"type":"strong"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[131.0]},"content":[{"type":"paragraph","content":[{"text":"1","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[628.0]},"content":[{"type":"paragraph","content":[{"text":"ContinentalEuropean cuisinesEveryday ItalianGiada DeLaurentiis200530.00","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[131.0]},"content":[{"type":"paragraph","content":[{"text":"2","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[628.0]},"content":[{"type":"paragraph","content":[{"text":"Seriesfantasy literatureHarry PotterJ. K. Rowling200549.99","type":"text"}]}]}]}]},{"type":"paragraph","content":[{"text":"The output records will contain:","type":"text"}]},{"type":"table","attrs":{"layout":"default","localId":"3b521daf-7b41-46fb-b158-efde858be4ba"},"content":[{"type":"tableRow","content":[{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[151.0]},"content":[{"type":"paragraph","content":[{"text":"category","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[152.0]},"content":[{"type":"paragraph","content":[{"text":"title","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[80.0]},"content":[{"type":"paragraph","content":[{"text":"year","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[163.0]},"content":[{"type":"paragraph","content":[{"text":"price","type":"text","marks":[{"type":"strong"}]}]}]},{"type":"tableHeader","attrs":{"colspan":1,"rowspan":1,"colwidth":[213.0]},"content":[{"type":"paragraph","content":[{"text":"subcategory","type":"text","marks":[{"type":"strong"}]}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[151.0]},"content":[{"type":"paragraph","content":[{"text":"cooking","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[152.0]},"content":[{"type":"paragraph","content":[{"text":"Everyday Italian","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[80.0]},"content":[{"type":"paragraph","content":[{"text":"null","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[163.0]},"content":[{"type":"paragraph","content":[{"text":"null","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[213.0]},"content":[{"type":"paragraph","content":[{"text":"ContinentalEuropean cuisines","type":"text"}]}]}]},{"type":"tableRow","content":[{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[151.0]},"content":[{"type":"paragraph","content":[{"text":"children","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[152.0]},"content":[{"type":"paragraph","content":[{"text":"Harry Potter","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[80.0]},"content":[{"type":"paragraph","content":[{"text":"2005","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[163.0]},"content":[{"type":"paragraph","content":[{"text":"49.99","type":"text"}]}]},{"type":"tableCell","attrs":{"colspan":1,"rowspan":1,"colwidth":[213.0]},"content":[{"type":"paragraph","content":[{"text":"Seriesfantasy literature","type":"text"}]}]}]}]},{"type":"paragraph","content":[{"text":"Here, since the subcategory contains child nodes, the plugin will return the complete subcategory node (along with its child elements) as string as ","type":"text"},{"text":"ContinentalEuropean cuisines","type":"text","marks":[{"type":"code"}]},{"text":" . This is to ensure that the plugin returns a single XML event for a structured record instead of the two child events: ","type":"text"},{"text":"Continental","type":"text","marks":[{"type":"code"}]},{"text":" and ","type":"text"},{"text":"European cuisines","type":"text","marks":[{"type":"code"}]},{"text":".","type":"text"}]},{"type":"paragraph","content":[{"type":"hardBreak"}]}],"version":1}

Browser not supported