FriendlyBeagleDog@lemmy.blahaj.zonetoSelfhosted@lemmy.world•I just developed and published a script to clear your pict-rs object storage from potential CSAM.English
19·
1 year agoNot well versed in the field, but understand that large tech companies which host user-generated content match the hashes of uploaded content against a list of known bad hashes as part of their strategy to detect and tackle such content.
Could it be possible to adopt a strategy like that as a first-pass to improve detection, and reduce the compute load associated with running every file through an AI model?
Ah, of course - that’s unfortunate, but thanks for the pointer.