Skip to content

Add an argument to enable/disable table de-nesting #1

@DavidGinzberg

Description

@DavidGinzberg

As described in source:

...
    #Removes nested tables. for handling the sins of 1990's web pages.
    #TODO: Add an argument to enable/disable table de-nesting
    [t.extract() for t in table.findAll("table")]
    #This would grab all TRs regardless of depth without the above line removing nested tables
    for row in table.findAll('tr'):
...

The table de-nesting hack is currently default behavior. For better backwards compatibility and flexibility it may be better to make it off-by-default and add a flag to enable it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions