The “em-dashes” (—) come up a lot in online translations of books like Bible and Quran.
Normal keyboard “-” and “–” are different from “—” but microsoft office auto-formats “–” to that.
I kinda assumed it was ALL microsoft word data that caused training to include that.
I am only now realizing AI stole from even the religious texts and influenced by them as well.
Not all are, it’s just many translations are old enough to be public domain. But some things like the English Standard Version of The Bible isn’t public domain, vs the Geneva Bible which is