Skip to content

[META] Add missing unicode generation for IWSLTs #1607

Closed
@erip

Description

@erip

With #1599 most tests have utf-8 roundtrip tests with the exception of the IWSLTs. It seems like the xml.etree.ElementTree.fromstring has no mechanism for recovering from error-inducing unicode (which is likely in randomly-generated strings). lxml has such a mechanism, but I'd hate to add a userland dependency for the sake of testing.

This issue records the TODO and acts as a place to brainstorm alternative mechanisms for testing roundtrip utf-8 encode/decodes in IWSLT.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions