• 0 Posts
  • 97 Comments
Joined 10 months ago
cake
Cake day: February 15th, 2025

help-circle

  • I guess my question should be how they managed to do this without having to create an account and profile users

    cookies

    If it’s a cookie, the history should not be there if you clear your cookies or open it in a private tab.

    The whole chat could be stored “in the browser” but more likely they have it on their server and associate it to you via the cookie. *edit: I guess it is this then according to what afk posted.

    If it is not a cookie, there is also your IP and lots of alternative fingerprinting ways of uniquely identifying your browser (see creepjs ). You could use a VPN and disable js in your browser, but that breaks half the internet nowadays.


  • the protocol is text only, to embed media, you need to host it on the regular ( Centralized ) internet

    except we already figured out how to encode images (or any file) as text when E-Mail was created. That is how images in E-Mails, attachment or embedded, are done. I can easily imagine a userJS script that will render them in the browser, but even if not you just copy the text and decode.

    if a community is badly moderated, the user will never see it, it wont be recommended to him. the user can visit bad communities directly just like you can visit a bad website directly, but it’s not recommended to you so it’s safe to use.

    Ah… so you’re guaranteed to have a dark CSAM subculture on there at some point.

    being p2p, seedit is not private, so it can’t really be used for illegal activity

    As if that has ever stopped anybody. See all the people that got caught for sharing it on the clearnet. Or on Signal, Telegram or similar, where you have to enter your phone number, which is personally tied to you.


    All in all - Great way to adress the concerns, by admitting they are in fact possible. “Hurray crypto” or whatever.



  • to add to what Elvith wrote:

    you can read the HTML like structures inside a PDF and then find out details about the elements you want to remove and then remove them by using that found common property.

    This is very hard to do by hand. But if you are curious you can download https://file-examples.com/wp-content/storage/2017/10/file-sample_150kB.pdf

    and open it with a text editor like kate. You will see a lot of encoded content data, but also the “html-like” structure in plaintext (in between the encoded stuff but also more at the bottom)

    You might find that editing the PDF by hand will break it completely, that is because it is complicated. Iirc you’d need to fix the index, recalculate the checksum or do some other magic bullshit. But that is often taken care of by the library.

    Here is a shitty python example for that demo pdf that redacts the image at the last page by drawing a white rectangle over it. There is no way in pymupdf to delete an image or a textblock … but this is just an example. Other libraries might be able to do it (the one I used a decade ago in java could). I just wanted to point you in the general direction, hope you can see from here how iterating over all the pages, picking the right element and redacting it would work.

    import pymupdf  # PyMuPDF
    
    # Open the PDF
    doc = pymupdf.open("./file-sample_150kB.pdf")
    
    # Get the last page
    page = doc[-1]
    
    # Get all images on the page
    images = page.get_images(full=True)
    
    if images:
        # Get the xref of the first image
        xref = images[0][0]
    
        # Find all instances of the image and redact their bounding boxes
        for info in page.get_image_info(xrefs=True):
            if info["xref"] == xref:
                rect = pymupdf.Rect(info["bbox"])
                page.add_redact_annot(rect, fill=(1, 1, 1))  # white fill
    
        page.apply_redactions()
    
    # Save the modified PDF
    doc.save("./modified.pdf")
    doc.close()
    

    A way simpler approach might be to crop all pages at the bottom.

    import pymupdf  # PyMuPDF
    
    doc = pymupdf.open("input.pdf")  # open the PDF
    
    for page in doc:
        rect = page.rect  # original page size
        new_rect = pymupdf.Rect(rect.x0, rect.y0 + 100, rect.x1, rect.y1)  # crop bottom 100px
        page.set_cropbox(new_rect)
    
    doc.save("output.pdf")  # save the cropped PDF
    doc.close()
    

    Here are the docs: https://pymupdf.readthedocs.io/en/latest/the-basics.html







  • It was posted 3x to the [email protected] community. Or at least it looks to me like 3 different accounts posted the same thing to this very community.

    I don’t really care about how it works, I’m just tired of the chan-esque experience where I have to question my sanity because I see the same posts every day.

    Just because people that don’t actually participate in a given community, thus not seeing the older posts, share the same article because they look for a community that fits and dump it there.

    Some subreddits had bots that detected and removed reposts and guided OP to the original post for them to add their discussion points.