Skip to content

Posts are no longer scanned for additional urls after recent API update #282

@AlexCSDev

Description

@AlexCSDev

Recently patreon updated their API to represent post contents as json (content_json_string field) instead of html (content field, now gone). This broke additional url extraction using PluginManager.ExtractSupportedUrls because it expects html data.

In order to fix that PluginManager and plugins are likely need to be updated with additional enum input variable which will tell plugin what kind of input data is provided (HTML or Text) and PatreonPageCrawler will likely need to pre-process json to make a raw text out of it.

Update:

Quick glance at the json format shows that all links are possibly being provided as "type":"link" entries in the json, so there might be a better way of handling it - finding all of those entries in json and creating a new subentry with the type of PatreonCrawledUrlType.ExternalUrl for each of them might work. That will make previous call of PluginManager.ExtractSupportedUrls completely obsolete.

Please note that contributions created with the help of AI agents are not accepted.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinghelp wantedExtra attention is needed

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions