Filter HTML tags #29
Labels
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
Miroslavsckaya/tg_rss_bot#29
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
We need to filter unsupported HTML tags
https://core.telegram.org/bots/api#html-style
Possible options:
get_text()unwrap()html-sanitizerBleach.clean()It's also possible to sanitize HTML right in the
RssReaderwith properFeedParserconfiguration.But it's better to sanitize it in
telegram.Notifierbecause those are platform-specific restrictions.lxmldidn't work out well because it was constantly wrapping the provided string or document in thedivelement as a 'container'.