Dendarii
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆@lemmy.ml to Open Source@lemmy.mlEnglish · 5 months ago

Microsoft open-sourced a Python tool for converting files and office documents to Markdown

github.com

external-link
message-square
23
fedilink
121
external-link

Microsoft open-sourced a Python tool for converting files and office documents to Markdown

github.com

☆ Yσɠƚԋσʂ ☆@lemmy.ml to Open Source@lemmy.mlEnglish · 5 months ago
message-square
23
fedilink
GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown.
github.com
external-link
Python tool for converting files and office documents to Markdown. - microsoft/markitdown
  • utopiah@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    5 months ago

    Thanks for the clarification. I checked the code you linked and noticed recognize_google and seems it’s relying on https://github.com/Uberi/speech_recognition which then seems to rely on https://github.com/Uberi/speech_recognition/blob/master/speech_recognition/recognizers/google.py so basically are they using an API, sending all the audio data to Google servers?

    • django@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      1
      ·
      5 months ago

      Yes, this is how I read it as well. The library would support to use a local model, but they decided to just send the audio data to Google.

      • utopiah@lemmy.ml
        link
        fedilink
        arrow-up
        3
        ·
        5 months ago

        Might open up a GDPR related issue there. I don’t think people using such a library assume they need connectivity nor that their data would be send to a 3rd party.

Open Source@lemmy.ml

opensource@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

  • Open Source Initiative
  • Free Software Foundation
  • Electronic Frontier Foundation
  • Software Freedom Conservancy
  • It’s FOSS
  • Android FOSS Apps Megathread

Rules

  • Posts must be relevant to the open source ideology
  • No NSFW content
  • No hate speech, bigotry, etc

Related Communities

  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]
  • [email protected]

Community icon from opensource.org, but we are not affiliated with them.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 250 users / day
  • 1.26K users / week
  • 3.76K users / month
  • 10.5K users / 6 months
  • 1 local subscriber
  • 36.7K subscribers
  • 1.66K Posts
  • 25.7K Comments
  • Modlog
  • mods:
  • Evan@lemmy.ml
  • kevincox@lemmy.ml
  • CrypticCoffee@lemmy.ml
  • Lettuce eat lettuce@lemmy.ml
  • BE: 0.19.9
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org