Natural Language Programming AIs are taking the drudgery out of coding

But how long until they take the human out of it as well?

Former Senior Editor

Updated Fri, Jun 23, 2023, 10:00 AM·8 min read

Kunakorn Rassadornyindee via Getty Images

“Learn to code.” That three-word pejorative is perpetually on the lips and at the fingertips of internet trolls and tech bros whenever media layoffs are announced. A useless sentiment in its own right, but with the recent advent of code generating AIs, knowing the ins and outs of a programming language like Python could soon be about as useful as knowing how to fluently speak a dead language. In fact, these genAIs are already helping professional software developers code faster and more effectively by handling much of the programming grunt work.

How coding works

Two of today’s most widely distributed and written coding languages are Java and Python. The former almost single handedly revolutionized cross-platform operation when it was released in the mid-’90s and now drives “everything from smartcards to space vehicles,” as Java Magazine put it in 2020 — not to mention Wikipedia’s search function and all of Minecraft. The latter actually predates Java by a few years and serves as the code basis for many modern apps like Dropbox, Spotify and Instagram.

They differ significantly in their operation in that Java needs to be compiled (having its human-readable code translated into computer-executable machine code) before it can run. Python, meanwhile, is an interpreted language, which means that its human code is converted into machine code line-by-line as the program executes, enabling it to run without first being compiled. The interpretation method allows code to be more easily written for multiple platforms while compiled code tends to be focused to a specific processor type. Regardless of how they run, the actual code-writing process is nearly identical between the two: Somebody has to sit down, crack open a text editor or Integrated Development Environment (IDE) and actually write out all those lines of instruction. And until recently, that somebody typically was a human.

The “classical programming” writing process of today isn’t that different from the process those of ENIAC, with a software engineer taking a problem, breaking it down into a series of sub-problems, writing code to solve each of those sub-problems in order, and then repeatedly debugging and recompiling the code until it runs. “Automatic programming,” on the other hand, removes the programmer by a degree of separation. Instead of a human writing each line of code individually, the person creates a high-level abstraction of the task for the computer to then generate low level code to address. This differs from “interactive” programming, which allows you to code a program while it is already running.

Today’s conversational AI coding systems, like what we see in Github’s Copilot or OpenAI’s ChatGPT, remove the programmer even further by hiding the coding process behind a veneer of natural language. The programmer tells the AI what they want programmed and how, and the machine can automatically generate the required code.

Building the tools to build the tools allowing any tool to build tools

Among the first of this new breed of conversational coding AIs was Codex, which was developed by OpenAI and released in late 2021. OpenAI had already implemented GPT-3 (precursor to GPT-3.5 that powers BingChat public) by this point, the large language model remarkably adept at mimicking human speech and writing after being trained on billions of words from the public web. The company then fine-tuned that model using 100-plus gigabytes of GitHub data to create Codex. It's capable of generating code in 12 different languages and can translate existing programs between them.

Codex is adept at generating small, simple or repeatable assets, like “a big red button that briefly shakes the screen when clicked” or regular functions like the email address validator on a Google Web Form. But no matter how prolific your prose, you won’t be using it for complex projects like coding a server-side load balancing program — it’s just too complicated an ask.

Google’s DeepMind developed AlphaCode specifically to address such challenges. Like Codex, AlphaCode was first trained on multiple gigabytes of existing GitHub code archives, but was then fed thousands of coding challenges pulled from online programming competitions, like figuring out how many binary strings with a given length don’t contain consecutive zeroes.

To do this, AlphaCode will generate as many as a million code candidates, then reject all but the top 1 percent to pass its test cases. The system then groups the remaining programs based on the similarity of their outputs and sequentially test them until it finds a candidate that successfully solves the given problem. According to a 2022 study published in Science, AlphaCode managed to correctly answer those challenge questions 34 percent of the time (compared to Codex’s single-digit success on the same benchmarks, that’s not bad). DeepMind even entered AlphaCode in a 5,000-competitor online programming contest, where it surpassed nearly 46 percent of the human competitors.

Now even the AI has notes

Just as GPT-3.5 serves as a foundational model for ChatGPT, Codex serves as the basis for GitHub’s Copilot AI. Trained on billions of lines of code assembled from the public web, Copilot offers cloud-based AI-assisted coding autocomplete features through a subscription plugin for the Visual Studio Code, Visual Studio, Neovim, and JetBrains integrated development environments (IDEs).

Initially released as a developer’s preview in June of 2021, Copilot was among the very first coding capable AIs to reach the market. More than a million devs have leveraged the system in the two years since, GitHub's VP of Product Ryan J Salva, told Engadget. With Copilot, users can generate runnable code from natural language text inputs as well as autocomplete commonly repeated code sections and programming functions.

Salva notes that prior to Copilot’s release, GitHub’s previous machine-generated coding suggestions were only accepted by users 14 to 17 percent of the time. “Which is fine," he said. "It means it was helping developers along.” In the two years since Copilot’s debut, that figure has grown to 35 percent, “and that's netting out to just under half of the amount of code being written [on GitHub] — 46 percent by AI, to be exact.”

“[It’s] not a matter of just percentage of code written,” Salva clarified. “It's really about the productivity, the focus, the satisfaction of the developers who are creating.”

As with the outputs of natural language generators like ChatGPT, the code coming from Copilot is largely legible, but like any large language model trained on the open internet, GitHub made sure to incorporate additional safeguards against the system unintentionally producing exploitable code.

“Between when the model produces a suggestion and when that suggestion is presented to the developer,” Salva said, “we at runtime perform […] a code quality analysis for the developer, looking for common errors or vulnerabilities in the code like cross-site scripting or path injection.”

That auditing step is meant to improve the quality of recommended code over time rather than monitor or police what the code might be used for. Copilot can help developers create the code that makes up malware, the system won’t prevent it. “We've taken the position that Copilot is there as a tool to help developers produce code,” Salva said, pointing to the numerous White Hat applications for such a system. “Putting a tool like Copilot in their hands […] makes them more capable security researchers,” he continued.

As the technology continues to develop, Salva sees generative AI coding to expand far beyond its current technological bounds. That includes “taking a big bet” on conversational AI. “We also see AI-assisted development really percolating up into other parts of the software development life cycle,” he said, like using AI to autonomously repair a CI/CD build errors, patch security vulnerabilities, or have the AI review human-written code.

“Just as we use compilers to produce machine-level code today, I do think they'll eventually get to another layer of abstraction with AI that allows developers to express themselves in a different language,” Salva said. “Maybe it's natural language like English or French, or Korean. And that then gets ‘compiled down’ to something that the machines can understand,” freeing up engineers and developers to focus on the overall growth of the project rather than the nuts and bolts of its construction.

From coders to gabbers

With human decision-making still firmly wedged within the AI programming loop, at least for now, we have little to fear from having software writing software. As Salva noted, computers already do this to a degree when compiling code, and digital gray goos have yet to take over because of it. Instead, the most immediate challenges facing programming AI mirror those of generative AI in general: inherent biases skewing training data, model outputs that violate copyright, and concerns surrounding user data privacy when it comes to training large language models.

GitHub is far from alone in its efforts to build an AI programming buddy. OpenAI’s ChatGPT is capable of generating code — as are the already countless indie variants being built atop the GPT platform. So, too, is Amazon’s AWS CodeWhisperer system, which provides much of the same autocomplete functionality as Copilot, but optimized for use within the AWS framework. After multiple requests from users, Google incorporated code generation and debugging capabilities into Bard this past April as well, ahead of its ecosystem-wide pivot to embrace AI at I/O 2023 and the release of Codey, Alphabet’s answer to Copilot. We can’t be sure yet what generative coding systems will eventually become or how it might impact the tech industry — we could be looking at the earliest iterations of a transformative democratizing technology, or it could be Clippy for a new generation.

If you buy something through a link in this article, we may earn commission.

Engadget
Latest UN report demands 'unprecedented' emissions cuts to salvage climate goals
The United Nations' Environmental Program has released a new report with yet more dire news about our odds of avoiding climate disaster caused by greenhouse gas emissions.
Engadget
Good Omens’ final season will have only one episode
David Tennant and Michael Sheen are returning to Amazon comedy drama for one more season.
Engadget
President Biden sets up new AI guardrails for military, intelligence agencies
The White House's new guidelines for artificial intelligence use prohibits uses such as nuclear launches and granting asylum to immigrants.
Engadget
iOS 18.2 has a child safety feature that can blur nude content and report it to Apple
Apple’s iOS 18.2 introduces machine learning-based nude content detection that is entirely on-device and preserves end-to-end encryption. Initially arriving in Australia, this expansion of the Communication Safety feature aims to protect childrenfrom explicit content.
Engadget
Bluesky’s upcoming premium plan won’t give paid users special treatment
Bluesky says it's working on premium subscriptions as it looks to start generating revenue. The platform added that it won't add support for the likes of crypto trading or NFTs.
Engadget
Yooka-Laylee remaster rolling to all consoles, including Nintendo’s next system
Playtonic Games' Yooka-Re-playlee could be headed to Nintendo's next Switch console.
Engadget
Your Balatro deck can now feature Binding of Isaac characters
Balatro is back with another update, bringing four new crossover decks to the poker-inspired roguelike.
Engadget
Samsung Galaxy S24 FE review: A great phone, but I wish it was cheaper
Samsung's Galaxy S24 has a lot going for it, but the new AI features aren't enough to make it standout in a competitive market.
Engadget
Google Photos will show when images have been modified with AI
Google is adding new language for increased transparency about when AI has been used in the Google Photos app.
Engadget
Overwatch 2's long-awaited 6v6 tests start in December
Blizzard has revealed some details for its tests of six-player teams in Overwatch 2. When the transition from the original game took place just over two years ago, Blizzard switched to a 5v5 format.
Engadget
Google Calendar's web client finally gets a dark mode
Google Calendar's web client just got a redesign with new buttons, dialogs and sidebars. Users can also now toggle between light and dark mode.
Engadget
The UK’s antitrust regulator will formally investigate Alphabet’s $2.3 billion Anthropic investment
The UK’s competition regulator is investigating whether Alphabet’s $2.3 billion investment in AI startup Anthropic harms UK competition. The CMA will open its preliminary probe on Friday.
Engadget
Apple teases new Macs in a 'week of announcements' that starts Monday
Apple is set to reveal new Macs next week as part of a series of announcements.
Engadget
Get a four-pack of Samsung Galaxy SmartTag2 trackers for just $64
A set of four Galaxy SmartTag2 trackers can be yours for a solid discount. The sale means that the trackers will run you $16 each.
Engadget
Of course telecom companies are suing the FTC to block the new 'click-to-cancel' rule
Telecom companies sue FTC to block the ‘click-to-cancel’ rule. The mandate requires organizations to offer simple cancellation methods for subscriptions.
Engadget
Rivian factory workers are reportedly getting seriously injured on the job
EV maker Rivian's factory in Normal, Illinois, appears to have many safety violations. Some workers have reportedly been seriously injured.
Engadget
Intel wins latest antitrust battle with EU court
Intel just won an epic battle with the European Union over a €1.06 billion ($1.1 billion) fine levied way back in 2009.
Engadget
The Apple Watch Series 10 is $30 off right now
Shop the sale on both the 42mm and 46mm models.
Engadget
EU fines LinkedIn $334 million for violating the GDPR
LinkedIn also needs to show compliance in data processing moving forward.
Engadget
Snapchat's camera is getting a shortcut on the iPhone lock screen
Snapchat's camera is getting a shortcut on the iPhone lock screen. The tool requires iOS 18, as it takes advantage of the new shortcut tools.

Natural Language Programming AIs are taking the drudgery out of coding

But how long until they take the human out of it as well?

How coding works

Building the tools to build the tools allowing any tool to build tools

Now even the AI has notes

From coders to gabbers

Latest Stories

Latest UN report demands 'unprecedented' emissions cuts to salvage climate goals

Good Omens’ final season will have only one episode

President Biden sets up new AI guardrails for military, intelligence agencies

iOS 18.2 has a child safety feature that can blur nude content and report it to Apple

Bluesky’s upcoming premium plan won’t give paid users special treatment

Yooka-Laylee remaster rolling to all consoles, including Nintendo’s next system

Your Balatro deck can now feature Binding of Isaac characters

Samsung Galaxy S24 FE review: A great phone, but I wish it was cheaper

Google Photos will show when images have been modified with AI

Overwatch 2's long-awaited 6v6 tests start in December

Google Calendar's web client finally gets a dark mode

The UK’s antitrust regulator will formally investigate Alphabet’s $2.3 billion Anthropic investment

Apple teases new Macs in a 'week of announcements' that starts Monday

Get a four-pack of Samsung Galaxy SmartTag2 trackers for just $64

Of course telecom companies are suing the FTC to block the new 'click-to-cancel' rule

Rivian factory workers are reportedly getting seriously injured on the job

Intel wins latest antitrust battle with EU court

The Apple Watch Series 10 is $30 off right now

EU fines LinkedIn $334 million for violating the GDPR

Snapchat's camera is getting a shortcut on the iPhone lock screen

About

Sections

Contribute

Buying Guides