mouthporn.net
#computer science – @folatefangirl on Tumblr

Fangirling and Writer-Nerd Chaos

@folatefangirl / folatefangirl.tumblr.com

I'm Cinnia, late 20s, she/her, a fan of the health sciences and many other things, and a former quiet kid who was abducted by the theater people. This blog is a semi-queued experiment to vent my endless energy for fandoms, LGBT+ content, writing, languages, religion analysis and ExMormon content, dancing, mental health, etc. I also run the Grate Scoff food blog as well as the Incorrect Rings of Power and Incorrect Thornfruit Quotes blogs.
Avatar
Avatar
ibetitdoes

not to enforce gender roles but a computer should NOT fucking have apps okay. if I wanted an app I'd go on my phone my laptop is for Programs. I mean this.

Avatar
mlembug

bringing the ancient meme back

Avatar
cazort

really glad to see hate for this concept.

count me among the haters. i've been a hater since i first encountered this conflation of many different well-defined concepts into one ill-defined one

it's part of the general trend of hiding the inner workings of computers from the user, which is something i hate since it tends to promote computer illiteracy which tends to promote incompetence and dysfunction in society as a whole

Avatar

There was a paper in 2016 exploring how an ML model was differentiating between wolves and dogs with a really high accuracy, they found that for whatever reason the model seemed to *really* like looking at snow in images, as in thats what it pays attention to most.

Then it hit them. *oh.*

*all the images of wolves in our dataset has snow in the background*

*this little shit figured it was easier to just learn how to detect snow than to actually learn the difference between huskies and wolves. because snow = wolf*

Shit like this happens *so often*. People think trainning models is like this exact coding programmer hackerman thing when its more like, coralling a bunch of sentient crabs that can do calculus but like at the end of the day theyre still fucking crabs.

non-sentient crabs that can do calculus, even

Avatar
hasufin

That's the thing we learned about genetic algorithms back when, too. They're waaaaay better at hacking the criteria than solving the problem.

(There were a bunch of other issues, too. They were good for some specific things, but... aggressively not, for others.)

Avatar

remember when you were 10 and you would hang out with your friends in order to Look At The Computer together like you went to their house and experienced the information superhighway together. and then leave

How fucking old are you people?

normal amount

Avatar
kunou78

You see, there used to be a time (not all that long ago) when being offline was the default. And going online was the rare and wonderful thing that we (briefly) enjoyed.

It even came with happy modem noises.

They weren't happy noises.

They were polite and reasonable noises! The sound of protocol being followed! Negotiation and compromise!

The box would scream

Avatar

Fixing Windows 11 - A Masterpost

Windows 11 is absolute crap for a good bunch of stuff so yeah, have a masterpost with most of the things I've found around to fix it haha (they're not all here because I did stuff around and I don't remember what)

Stopping W11 from making you create an account/setting up your computer without internet (with a way to create gpedit.msc if your version doesn't have it)

Fixing the Taskbar (in case you also don't like the grouped Battery/internet/sound)

Fixing the Start Menu

How to show all options right away

How to remove Copilot if you have it (I don't, but it might be because I'm in Europe??)

Other stuff not specifically related to the shit W11 did

Changing the size of the scroll bars

Avatar

The fact that there’s an actually functional website for the library of Babel is one of those things that fucks me up more and more the more I think about the implications.

So, if anyone hasn’t encountered the concept of the library of Babel, the idea comes from a story of the same name by Jorge Luis Borges, which is set inside a seemingly infinite library which contains every possible combination of letters, periods, commas and spaces that fits within 410 pages.

So like… It isn’t THAT out there that someone was able to make a digital version of it. Making an algorithm that randomly generates every possible combination of those 29 characters within that space and making a website that lets you explore those combinations are things that are pretty squarely within the scope of things you’d expect someone to be able to make a computer do.

But it begins to get pretty out there when you start thinking about all the things that are technically contained there (and that someone randomly browsing it could THEORETICALLY stumble upon) just by virtue of being one of those possible combinations of letters, spaces, commas, and periods.

Somewhere in that website there IS a book that specifically mentions me by full name before giving an accurate, excruciatingly detailed, 410-page long physical description of me. There’ also many more books that SEEM to be that but are actually factually inaccurate. There’s also versions of all of those containing every possible combination of every possible typo, spelling mistake, and grammatical error.

Somewhere in that website there IS a book that’s a perfectly accurate prediction of how and when I will die narrated in third person over the course of 410 pages. There’s also a book that contains the exact same events narrated in first person. Not only for me, but for every person in the world. There are many more that claim to be that but are actually inaccurate.

Somewhere in that website there IS a book that’s completely blank except for the world’s funniest dick joke written right at the end of the very last page.

But chances are no one browsing that website is EVER going to see any of that because for every book we would consider useful, interesting, or even intelligible there are millions upon millions upon millions more that are just completely full of gibberish from cover to cover.

Every single thing I will ever write (barring punctuation marks that arent periods or commas and the letter ñ) is already contained somewhere on that website.

I have a volume from the Library of Babel! it’s one of my most treasured books.

on the second to last page, about halfway down it reads “OH TIME THY PYRAMIDS” a singular grain of order in the sea of chaos.

The library of babel contains every book to ever exist and moreover it contains all information that can be encoded in a finite string of characters from its alphabet.

I cannot overstate how much I love the Library of Babel. it’s wonderful, it is my heart and soul.

at last we created the perplexing nexus, from the novel “wouldnt it be weird if there was a perplexing nexus?”

Avatar
Avatar
feyosha

Computers are very simple you see we take the hearts of dead stars and we flatten them into crystal chips and then we etch tiny pathways using concentrated light into the dead star crystal chips and if we etch the pathways just so we can trick the crystals into doing our thinking for us hope this clears things up.

AFAIK The only technologies that are more advanced are artisanal, small batch type manufacturing done explicitly for research and development purposes.

Semiconductors are the only mass-manufactured product that regularly has to take quantum mechanical effects into consideration as part of day-to-day operation, as far as I know.

So. Yea.

Avatar
Avatar
shithowdy

i thought my laptop was on its last leg because it was running at six billion degrees and using 100% disk space at all times and then i turned off shadows and some other windows effects and it was immediately cured. i just did the same to my roommate's computer and its performance issues were also immediately cured. okay. i guess.

so i guess if you have creaky freezy windows 10/11 try searching "advanced system settings", go to performance settings, and uncheck "show shadows under windows" and anything else you don't want. hope that helps someone else.

hey this is apparently helping a lot of people! adding that on top of this you can also go to settings > personalization > colors and turn off transparency to also boost performance. this wasn't the Big Fix for me but might as well do that too if you're trying to optimize.

past a certain note threshold on tumblr posts you unlock a bloodborne-esque insight and the strange lives of this site's users become visible to you

Avatar
Avatar
sniperct

They really don't. They grew up on tablets and phones and other closed systems that are completely different and much more simplified then a personal computer. They don't have the knowledge because it wasn't taught to them and they never used it in day to day life.

I work in tech support and I can back this up with personal experience; they're as bad as boomers are only instead of not understanding tech at all, they refuse to learn or dig into the tech. They can use it but they don't have the curiosity to understand it. And when tech breaks, they don't have the know-how to fix it on their own or to even look for how to fix it (a task made all the harder by the enshitification of the internet and AI articles that litter search results)

They consistently act surprised when 'turn it off and back on again' actually works because that's become a meme and not understood to be a valid first troubleshooting step.

The blame lies with the big tech companies, and how strongly they've pushed hardware and software that has been designed to be obtuse at best.

Avatar
Avatar
ms-demeanor

Things that will make your computer meaningfully faster:

  • Replacing a HDD with an SSD
  • Adding RAM
  • Graphics cards if you're nasty
  • Uninstalling resource hogs like Norton or McAfee (if you're using Windows then the built-in Windows Security is perfectly fine; if you're using a mac consider bitdefender as a free antivirus or eset as a less resource intensive paid option)
  • Customizing what runs on startup for your computer

Things that are likely to make internet browsing specifically meaningfully faster:

  • Installing firefox and setting it up with ublock origin
  • adding the Auto Tab Discard extension to firefox to sleep unused tabs so that they aren't constantly reloading
  • Closing some fucking tabs bud I'm sorry I know it hurts I'm guilty of this too

Things that will make your computer faster if you are actually having a problem:

  • Running malwarebytes and shutting down any malicious programs it finds.
  • Correcting disk utilization errors

Things that will make your computer superficially faster and may slightly improve your user experience temporarily:

  • Clearing cache and cookies on your browser
  • Restarting the computer
  • Changing your screen resolution
  • Uninstalling unused browser extensions

Things that do not actually make your computer faster:

  • Deleting files
  • Registry cleaners
  • Defragging your drive
  • Passively wishing that your computer was faster instead of actually just adding more fucking RAM.

This post is brought to you by the lady with the 7-year-old laptop that she refuses to leave overnight for us to run scans on or take apart so that we can put RAM in it and who insists on coming by for 30-minute visits hoping we can make her computer faster.

There are people saying "what does this mean" so:

Meaningfully Faster

  • HDD means "hard disk drive" SSD means "solid state drive." An HDD has a spinning platter and a mechanical arm that has to wiggle back and forth very quickly to read the information written on the platter, while an SSD is a silicon chip that has the information "flashed" into it (basically it takes a snapshot of bits with electricity - it's unimportant how it works, just that it doesn't require mechanical movement to access data, and is therefore MUCH faster).
  • Installing an SSD does *not* require an M.2 slot and does *not* require a ton of expertise on most computers. There are many SATA SSDs available and if you want to replace your HDD all that you need to do is clone your hard drive and look up how to install your new SSD on ifixit (link in a bit).
  • Your storage drive (SSD or HDD) is the drive that stores the files and programs on your computer. People often confuse long-term drive storage with "memory" and when they get low-memory warnings they think they need to delete stored files. This is inaccurate because:
  • RAM is Random Access Memory - if is basically the "working memory" of your computer, it keeps track of all the things that your computer does/is doing. For example: if you are working in your browser, the browser is not saving everything that you're doing to your desktop so everything that is going on in your browser is using RAM to remember what pages you've got open and what images are on them, etc. Same if you're working in any other program: if you're writing a word doc and you haven't saved it in a while, all the changes are being tracked by the RAM but are not yet saved on your storage drive. "Low Memory" doesn't mean you're out of room for your computer to hold onto files, it means your computer is low on memory to think about stuff. Because programs are written to use more and more memory as time passes (which is a good thing, for the most part, but causes frustrating problems for people with older devices) older computers will struggle to run modern programs with the amount of RAM the older computers have in them.
  • RAM comes in different speeds, and the speed of your RAM does matter but if you get RAM that matches the speed of your installed RAM you should experience better speeds from your computer. Some computers have the RAM directly soldiered to the motherboard and cannot have RAM added (this is pretty much true of all macs. I do not recommend buying macs.) but if your computer DOES allow you to add RAM it is generally an easy process - check IFixIt.Com for instructions: just search the model number of your computer and look at the replacement guide for RAM and you will be able to see if you can replace your RAM and see the steps to follow along if you want to do that. There are a large number of tools to look up what RAM will work in your computer. Personally I like the Crucial RAM Configurator tool. Just look up your computer and it will recommend compatible RAM. You don't have to buy the Crucial RAM, you can use the Crucial RAM configurator to get the specs you need and then search for yourself by Size, Memory Technology, Speed, Format, and number of Pins.
  • Graphics cards are for graphics processing and they're "if you're nasty" because adding a dedicated graphics card is kind of a big hurdle these days: computers are smaller inside than they used to be and graphics cards are big and they are also very expensive right now. Unless you are using a gaming laptop you likely cannot add a graphics card to your laptop, and unless you have at least a couple hundred dollars to spare you can't add a graphics card to your desktop either. But also they will only speed up a very specific subset of applications; if you don't do anything that requires a GPU to get shit done (video rendering, really complicated math, etc) then it isn't going to make your computer faster.
  • Norton and McAfee run in the background of your computer at all times and can suck up a lot of valuable memory, especially if they decide to randomly start a scan or check back in with the mothership to make sure you've paid for another seven years. They are annoying and they can slow everything else down - McAfee in particular is notorious for this. Modern Windows Operating systems (10 and 11) have a lot of security features built in at baseline and Windows Defender is a perfectly functional whole-device antivirus for almost all users. It was never true that macs couldn't have software infections or couldn't be infected with malware, it's just that *people used to write fewer malicious softwares for mac( and NOW there are a lot more infections written for OSX *and* mac users often think they don't need antivirus. Buds. You do need antivirus. Since Bitdefender only has a free virus scanner for Mac, not antivirus, and Sophos Free is only good for a month, and Avast has recently sold customer data, try Avira.
  • If you hit Ctrl+Alt+Del and click on "task manager" you will find a list of applications that run on startup. Programs that run on startup are turned on and active every time you start your computer, regardless of whether you use the program or not. So, for instance, let's say you have an HP printer and there are drivers for the printer on your computer but you only print things maybe twice a year. If your HP Printer Print Utility is set to run on startup that means that every time you start your computer it wakes up and turns on this program, which slows down your startup time and takes processing power in the background. So just disable it on startup and only start the program when you need it. The only difference here is that it'll take a few seconds to start the utility when it's time to print. Things that I've got disabled on startup include: Microsoft Edge, Adobe, and Cortana.

Faster Browsing

  • You should be using Firefox anyway because Firefox rules and in literally hundreds of tests performs as well or better than chrome. If someone says that firefox is ridiculously slow it is because they've either set up a really weird configuration, have had the install go wrong, or just don't understand what they're doing. But ASIDE from that you should use Firefox with Ublock Origin because Ublock Origin is an *amazing* adblocker that doesn't whitelist ads and lets you do things like manage cookies and block annoying popups and a bunch of other stuff. And as of 2023 it will be impossible to install Ublock Origin on Chrome so you may as well switch to Firefox now and live that ad-free life. Ad-blocked browsing speeds up your browser because your computer isn't spending extra energy on loading ads and less shit loading means less RAM utilization means faster computing.
  • Auto Tab Discard is an extension that allows you to set tabs to sleep after an amount of time that you determine. You can customize your experience so that it doesn't sleep tabs with partially filled forms, or doesn't sleep tabs with media playing in them, or doesn't sleep a specific tab for a session, or doesn't ever sleep tabs from a specific website. Many people use OneTab, which basically turns your tabs into a bookmark list that opens the page when you click in OneTab, but I personally prefer Auto Tab Discard - neither of those are "better" than the other, it's just a question of what works for you.

Faster if you are having an actual problem

  • Open tabs eat RAM. Your computer is expending memory thinking about what's on all of those pages and even if most of them are slept you probably have more awake tabs than you need.
  • Malwarebytes has a free virus/malware scanning and removal tool that you can download and run to identify and remove malware or viruses that may be stealing your information and running background programs that are fucking up your speeds. If you think your computer might be slow because you might have a virus, go to malwarebytes, download the free version, run a scan, and follow the instructions that it gives you.
  • Disk utilization errors happen for a number of reasons and they will all slow down your computer. Here are some of the reasons they happen and some ways to fix them. If you hare having a persistent disk usage error issue *DO NOT* replace your HDD with an SSD until it is resolved because these errors can destroy an SSD.

Superficially Faster

  • Your browser cache and cookies are the things that your browser remembers between sessions: it might be logins for various sites or the logos of your homepage or a bunch of other stuff. If your browser is running super slow because you haven't cleared cache/cookies for a long time, doing this can help because it will basically make your browser forget about the cookies it has been carrying around for five years for a site you haven't logged into since four years and seven months ago. These are tiny pieces of data but they can add up over time which is why they will make your browsing experience faster if you haven't done it in years but won't make a difference if you clear your cookies and cache monthly. Doing this will log you out of all the sites you're logged into (which shouldn't be a huge deal, but you do need to be ready to log in a bunch, so you should HAVE A PASSWORD MANAGER TO MAKE THAT EASIER). This will make your browser run a bit faster, but if your computer is slow for non-browser reasons or if your cache/cookies have been cleared recently it will have no impact on your computer speeds.
  • Restarting your computer will speed your computer up for a bit because it will shut down all the programs you had running and clear your computer's RAM of all the stuff it was remembering for as long as you had those programs open. This is, again, a superficial improvement, and if you have, say, restarted your computer three times today it is unlikely that restarting again will do anything for you. But yeah if it's been a couple weeks, friend, restart your computer it is tired and it wants to run patches and it needs to forget the twenty docs you're editing for a minute so it can get its head on straight.
  • Reducing your screen resolution can make your computer run a bit faster if you're doing graphics-heavy stuff and have really shit built-in graphics support, but realistically all it is likely to do is make your computer look weird.
  • Your browser extensions run on startup and cause your browser to use more RAM at baseline; you should be installing minimal browser extensions and uninstalling any that you don't need. If it's been a while since you checked your extensions go uninstall the unused ones right now and restart the browser and you may speed up a bit. But the majority of people in the world run pretty minimal extensions so this is unlikely to fix most people's problems.

Will not make your computer faster

  • Unless your storage drive is within 10% of its maximum capacity it is in no way fucking up your speed. If it is within 10% of capacity go delete whatever's in your downloads folder, clear out your temp files, and empty your recycle bin. If that doesn't take care of it, use a backup drive or a thumb drive and copy off the photos, videos, and music you don't use very often, and only keep what you're using on your computer. If you've got an SSD having too little storage available can fuck with the lifespan, so aim to keep at least 10% free.
  • Registry cleaners are supposed to clean your computer's registry; basically they're supposed to look at the index of all the stuff your computer knows and delete redundant or outdated entries on the index. However it has been literal decades since a registry cleaner would be able to meaningfully improve performance while they can still absolutely fuck your computer up bad enough that it needs a reinstall and a lot of them are just straight up malware, some of them are actual for-realsies ransomware, a bunch of them sell your data, and you don't need ANY of them. If you don't know enough to fuck around with your registry by hand you shouldn't trust some random ass software that has a history of data breaches and virus infections associated with it. Fuck all registry cleaners.
  • Manual drive defragging has been obsolete on Windows Machines since Windows 7 shipped with an automatic defragging utility in 2011. In the past when you saved a file to a hard drive it might save that file in a bunch of little pieces scattered all over the platter, so accessing that file meant accessing all the *fragments* of that file and "defragging" meant taking some time every once in a while to put files together into whole files instead of fragments. But then eleven years ago Windows made that process automatic, so unless you have *seriously* fucked up your computer somehow it shouldn't be an issue, and if it is an issue you can run a defrag and then make sure automatic defragging is enabled. But that whole conversation is probably moot since SSDs don't need to be defragmented (and defragging them actually adds to the device wear). So basically if your computer actually DOES need a defrag it is probably more fucked than you initially thought and unless that is the case then it is already defragging so doing an additional defrag won't make anything faster and/or might cause extra wear to an SSD.

If you are using an older computer and it is tortuously slow and you can't upgrade the RAM or swap in an SSD but you *can* install software, I would strongly recommend installing Linux.

The vast majority of people these days use computers primarily for web browsing and occasionally use an office suite. Most people don't even store their music or photos on their computers these days. If your computer is basically an internet machine that you don't use for anything other than browsing and basic word processing/spreadsheets, then there is no reason not to switch to Linux.

I would recommend installing Linux Lite, which is a well-supported, widely-used distro that most computers built within the last twenty years should be able to run. It's called "Lite" because it is an operating system that requires minimal processing, storage, and memory to run.

To do this you will need:

  • A 4GB USB stick or a blank DVD to create installation media
  • An internet connection to download the software
  • Another computer or phone that you can use to follow the instructions, or a printed/written copy of the installation guide
  • A computer on which to install Linux. Installing Linux as the sole operating system on this computer (which is my recommendation here) will mean that everything else on the computer is deleted. Save all of your files/movies/music/etc. to a backup drive or another computer before you install Linux, because installing a new OS will delete literally everything on the computer and make it impossible to get back.

Here is the Linux Lite help manual, I've linked you directly to the step-by-step installation guide. If you get to a step that you don't understand, search the terms that you're unfamiliar with phrased as questions like "why do I need an ISO to install linux?" or "how do I enter UEFI BIOS on my version of Windows?"

Follow the instructions for installing Linux Lite, and then you can install software on your new OS. Linux Lite comes bundled with Libre Office, which is a free office suite like Microsoft Office that has programs similar to Word, Excel, and Powerpoint. It installs with Chrome, so I would recommend installing Firefox and uninstalling Chrome as soon as you're able to get on the internet.

After that you can pick and choose from a wide variety of linux-compatible software and you can use your computer as a regular computer. I'd say that it's probably a good idea to be picky about what software you install, and to try to keep your computer as lean as possible if you're trying to get a longer life out of an old device.

There are a lot of people in the notes who are saying "my computer can't even load four tabs" or "my computer can't even open the software I need to do this" and if that is your computer I think you've pretty much got nothing to lose from installing Linux. If your computer is essentially unusable in Windows then it's probably not going to be *less* functional in Linux, unless you're keeping the computer the way that it is for some very specific software you're using.

A lot of people think that they can't uses Linux because it's entirely in the command line or doesn't have a graphic interface or something, but most linux distros only look about as different to users as windows does to mac or vice versa.

This is what the Linux Lite default desktop looks like:

It's got a start menu and a task bar and folders on the desktop, same as any other computer. It's very easy to use and has the help manual installed right there as soon as you're up and running so that you can troubleshoot your way through any issues.

Really, seriously: if your computer is slow as fuck and all you use it for is web browsing, this will make your computing experience significantly better without having to buy anything but a thumb drive.

People are reblogging just the first section of this post and asking what these things mean, so I'm reblogging this in hopes that they'll see the comprehensive update in the notes.

Avatar
reblogged
Avatar
dedalvs
Anonymous asked:

can you make a translator for firish i want to use it in my rps i have with friends

I've actually gotten this question a couple times, which is great! But this type of thing just isn't possible with a conlang. It has nothing to do with the quality of the conlang or the level of completion (i.e. the amount of vocabulary, how much of the grammar has been recorded, etc.), and I'll tell you specifically why.

First, you may have seen "translators" for various languages online like LingoJam. LingoJam not only has translators for a bunch of different languages, but allows you to make your own translators. The way these work, though, is you write down a word in one language and write its translation into another—something like:

English > Spanish

  • I > yo
  • am > soy
  • to > a
  • the > el
  • store > tienda
  • going > yendo

That is, you put in one to one correspondences, and that's what it has to work with. Once you're done, if you ask for a translation, it looks up the words and sees what's available and it spits back what it has, in order. If we had this very minimal English to Spanish dictionary (which is 100% accurate, by the way! That is, all of these English words can be translated as all of these Spanish words), you could ask LingoJam to translate the following into Spanish...

I am going to the store.

...and you would get...

Yo soy yendo a el tienda.

Now, if you speak Spanish, you'll see all the places this went wrong. (Short version: You don't always need subjects pronouns in Spanish; you use a different helping verb for "to be x'ing" in Spanish; you rarely actually use this "to be x'ing" construction in Spanish; the present tense is sufficient; though el means "the", it's the wrong gender for tienda—analogous to saying "an store" as opposed to "a store" in English.) And you can actually avoid this in LingoJam by adding phrases on top of single words:

English > Spanish

  • the store > la tienda
  • I am going > voy

But you can imagine how much work that would be...

The reason why things like LingoJam are so popular, though, is because imagine if you knew nothing about Spanish. Typing in "I am going to the store" and having it instantly spit out "Yo soy yendo a el tienda" is pretty darn satisfying! If you don't know it's wrong but you're happy with it, what's the problem?

Now, a language like Spanish is huge, so it's easier to get accurate Spanish translations online than it is to get accurate Korean translations online—and it's easier to get accurate Korean translations online than accurate Tigrinya translations online, etc. The reason for that takes us to Google Translate.

I think most people know that with LingoJam, you get what you pay for. Google Translate, on the other hand, is much more sophisticated, and much more accurate. It's not 100%, but it's pretty darn good—for widely spoken languages. This is why.

Way back when, Syfy facilitated a chat between me and the folks at Google Translate because they wanted to see if Google and I could work together to create a translator for a couple of my Defiance languages at TED in 2013. After all, we had a full two weeks. We could bang something like that out in two weeks, right? (lol no)

I learned then how Google Translate works. Google Translate doesn't actually know anything about the specific grammar of a language—maybe a couple language specific tweaks, but it's not as if you can go under the hood and find a full grammar of Spanish that tells you when to use the subjunctive, what all the conjugations are, etc. Instead, what Google Translate has is a database (i.e. Google, along with Google Books, Google Scholar, etc.) with tons of, presumably, fluent documents written in the various target languages offered on Google Translate. They also have faithful translations of those documents—not all, but a percentage. Google Translate uses that information to predict what a given sentence in one language will turn into in another.

In order to do this successfully, Google Translate needs BILLIONS of documents to troll. And it has that. It has BILLIONS of articles written in Spanish and translated to English. That's why the English to Spanish translation is as good as it is.

Now, having said that, anyone who's bilingual in English and Spanish knows that Google Translate isn't perfect. Sometimes it's pretty good, but sometimes it produces a lot of clunky, unnatural, or even incorrect translations. This is because there isn't a human back there calling the shots.

But that's its best translator. Now imagine translating between English and Samoan (one of the other languages it offers). There are EXPONENTIALLY more online articles in Spanish than Samoan. Consequently, the translations you get between English and Samoan on Google Translate are absolutely no guarantee.

And bear in mind, there's a kind of minimum threshold they work with before adding a language to Google Translate. If Samoan is on there and not Fijian, it's because there's that much more Samoan online than Fijian.

Now let's go back to conlangs. What Google Translate wants is BILLIONS of articles written online in the target language. Forget how complete the grammar of a conlang is, whether you can find that description online, or how many thousands of words the conlang has. How many fluent articles are there written in that conlang that are online? How many can one person to? How about a team of people? And how many conlangs have that?

This is why Google Translate has Esperanto and nothing else. Esperanto has been around for 136 years, and in that time there have been a good number of people who have learned to speak it fluently, and have written things (poems, articles, books) that are now online. It is as much as Spanish? Certainly not, but it is enough to hit Google Translate's minimum threshold, and so it's available.

Assuming you have a conlang with a full grammar and a good amount of vocab, if it were popular, it might have enough available material for Google Translate to work with 125 years from now. But at the moment, it's not possible. That says nothing about the language: It's about how Google Translate works.

And bear in mind, Google Translate is, at the moment, our best non-human translator.

If predictive-AI gets good enough that it can learn the grammar of a language, then it may be possible to produce a translator for a new conlang. That, though, is not the goal of Google Translate. Maybe ChatGPT and things like it will get there one day, but even that isn't a dedicated language learning AI. We need an AI that doesn't work with billions of fluent articles, but works with two books: a complete grammar and a dictionary. If an AI can one day work with those two tiny (by comparison) resources and actually produce translations that are as good as or better than Google Translate, then we'll be at a "translation-on-demand" place that will be good enough to feed a new conlang to. At that point, it will simply be a matter of producing a grammar and lexicon of sufficient size for the AI to do its thing.

So, no, right now we can't do a Ts'íts'àsh translator. :( We can go over things like the sound system and basic grammar and you can create your own words to work with it... A lot more work, but hey, we don't have to churn our own butter or milk our own cows anymore! We've got time!

Avatar
Avatar
paradoxcase

So, the thing is, it is possible to make a translator that can do on demand translation of a conlang, or any other language that you have a complete grammar and dictionary for, that does not need billions of parallel texts to work, and which will actually be much more accurate that Google translate could ever hope to be, even with translation between English and Spanish. This style of NLP that is statistics based, that relies on huge amounts of data, is not the only kind of NLP there is, it's just the kind that is overwhelmingly favored by companies like Google and Microsoft, because it has the advantage of always being able to give an answer, even if the answer is wrong. However, there are other types of systems that don't rely on statistics, which are much more strongly based in actual linguistics, which will not always have an answer for you, but when they do have an answer it is guaranteed to be correct. Since it doesn't rely on statistics, it doesn't need all that data - it just needs a linguist dedicating a significant amount of time to manually encoding all of the rules of the language.

When I was in graduate school, I helped encode the grammar of a language into such a system. The system is called the LinGO Grammar Matrix, which allows you to encode the grammar in a coding language called tdl according to a head-driven phrase structure grammar analysis. HPSG is defined in such a way that a computer can analyze a sentence, use HPSG-specified rules to generate an actual language-independent semantic representation of the sentence, and then use the HPSG-specified rules of another language to generate the sentence in that other language that matches the semantic representation. The language I was working with was an Arawakan language called Nanti, for which very little data existed at the time - there was a grammar from a field linguist, and a few other papers that that guy had written (and now, after the work we did with it, there's another short paper on its negation system co-authored by me, and I think there may now be at least one other paper written by the guy I worked with, I know he continued to do some research there). But the point is, this is exactly the kind of language Google translate cannot handle, there are no parallel texts at all, but at the end of that course, we had a system that could automatically, accurately, translate between a number of languages of varying obscurity, including Nanti.

So, it's very much possible. ChatGPT and its ilk are never going to get there, though, those AIs are based on the compete opposite principle and will never not need massive amounts of training data to function properly. They are driving away from the ability to do things like this as fast as they possibly can.

This is really cool! HPSG was the one syntactic framework I could get behind. I wish it got more attention, because it would be great to have a tool like this available generally for the public. Would love to hear more about this!

This was almost a decade ago that I did this, so I don't have a lot of specific memories of how it all worked, but here is what I remember:

I believe that this system is basically available generally to the public, although you will probably not have a good time with it if you aren't pretty familiar with HPSG and linguistics in general. You start by going to this customization page: https://matrix.ling.washington.edu/customize/matrix.cgi You answer the questions based on how the language works, and once you've answered all of the required questions it will generate a grammar for you. If you want to complete the questions in several sittings, you can have it convert the partially completed form into a "choices" file, which you can upload again later to reenter the choices from before. However, this is still a work in progress, so the grammar it generates is not complete, and you will have to manually code a good deal of stuff after generating it.

There was a very useful book for this course that explained the tdl language, and how to define rules in the grammar, how to run a test suite of sample sentences, etc. This all ran as some kind of emacs plugin. I believe this was the book: https://www.amazon.com/Implementing-Typed-Feature-Structure-Grammars/dp/1575862603/ref=sr_1_1?qid=1688611024&refinements=p_27%3AAnn+Copestake&s=books&sr=1-1

If it's of any help, I still have the grammar I did for this course: https://github.com/faiuwle/Nanti Our grammar is in the cox folder, possibly of more use to you is the eng folder which has a very stripped-down version of an English grammar that we used for translation purposes, and frr is Frisian. The files and folder structure of these grammars are generated for you by the customization page, and building out the grammar is a matter of editing and adding to the files. The tdl language is mainly declarative and not super program-y, I think it should be easy to figure out if you're familiar with HPSG, even if you aren't necessarily a programmer. There exists a very large, far more complete English grammar called the ERG, but I'm unsure about its availability outside of academic institutions.

Okay, programmers. There’s the stuff. What can we do with this?

Avatar
reblogged
Avatar
sreegs
Anonymous asked:

What is DDOSing? And i guess why is it bad that it's how Twitter is stopping people from using it?

'DOS' stands for Denial of Service. It's an attack where a malicious actor uses one machine to repeatedly make requests of an online service in order to tie up its resources and degrade its service.

'DDOS' means Distributed Denial of Service. It's the same thing except the attacker uses multiple machines or at least multiple clients, usually distributed across multiple locations, to do the same thing except now it's harder to shut out the attacker. It's also usually much higher volume of requests so the scale is bigger.

Basically what happened is Twitter rate limited users from reading too many tweets. However they didn't change their own client (the website or app) to account for this, so the twitter clients kept getting errors and automatically retrying. They DDOSed themselves because their own client was repeatedly making requests that were being blocked for rate limiting. And whenever the client received a rate limit error they would retry the same request again and again and again. Their own users were doing this without being aware and not at any fault of their own. All they had to do is just go to twitter.

Avatar
You are using an unsupported browser and things might not work as intended. Please make sure you're using the latest version of Chrome, Firefox, Safari, or Edge.
mouthporn.net