An introduction to APIs

APIs (application programming interfaces) are a big part of the web—a part that's only getting bigger. A 2021 F5 study projected that the number of active APIs would grow from under 200 million in 2018 to upwards of 1.7 billion by 2030. Meanwhile, the latest State of APIs report found that nearly 63% of developers relied on APIs more in 2022 than they had the previous year, and nearly 70% said they expected to rely on them even more in 2023. And on the executive side, over half the CEOs surveyed by Postman's most recent State of the API Report anticipated increasing their organizations' investments in APIs in the next year.

Table of contents:

What is an API?
API protocols
API types and formats
API authentication, part 1: Basic vs. key
API authentication, part 2: OAuth
API design
Real-time API communication
API implementation

What is an API?

An API is a set of rules (interface) that two separate systems or programs—one on a publishing side and one on an accessing side—agree to follow. The company publishing the API then implements their side by writing a program and putting it on a server. In practice, lumping the interface in with the implementation is an easier way to think about it.

With so many companies investing in APIs to do things like share data with third-party apps, enable logins using third-party profiles, integrate payment processors, and support service access across multiple devices, possessing a working understanding of APIs becomes increasingly relevant to careers in the tech industry. Through this guide, we hope to give you that knowledge by building up from the very basics. In this section, we start by looking at some fundamental concepts around APIs. We define what an API is and where it lives, and then give a high-level picture of how one is used.

Servers

When talking about APIs, a lot of the conversation focuses on abstract concepts. To anchor ourselves, let's start with something physical: the server. A server is nothing more than a big computer. It has all the same parts as the laptop or desktop you use for work; it's just faster and more powerful. Typically, servers don't have a monitor, keyboard, or mouse, which makes them look unapproachable. The reality is that IT folks connect to them remotely—think remote desktop-style—to work on them.

Servers are used for all sorts of things. Some store data; others send email. The kind people interact with the most are web servers. These are the servers that give you a webpage when you visit a website. To better understand how that works, here's a simple analogy:

In the same way that a program like Solitaire waits for you to click on a card to do something, a web server runs a program that waits for a person to ask it for a webpage.

There's really nothing magical or spectacular about it. A software developer writes a program, copies it to a server, and the server runs the program continuously. APIs provide an interface between these servers and the systems tasked with accessing the data in those servers.

How do APIs work?

APIs work by using predetermined rules and protocols to communicate user requests between two separate systems.

Websites are designed to cater to people's strengths. Humans have an incredible ability to take visual information, combine it with our experiences to derive meaning, and then act on that meaning. It's why you can look at a form on a website and know that the little box with the phrase "First Name" above it means you are supposed to type in the word you use to informally identify yourself.

Yet, what happens when you face a very time-intensive task, like copying the contact info for a thousand customers from one site to another? You would love to delegate this work to a computer so it can be done quickly and accurately. Unfortunately, the characteristics that make websites optimal for humans make them difficult for computers to use.

The solution is an API. An API is the tool that makes a website's data digestible for a computer. Through it, a computer can view and edit data, just like a person can by loading pages and submitting forms.

An infographic representing how an API works

Making data easier to work with is good because it means people can write software to automate tedious and labor-intensive tasks. What might take a human hours to accomplish can take a computer seconds through an API.

How an API is used

When two systems (websites, desktops, smartphones) link up through an API, we say they are "integrated." In an integration, you have two sides, each with a special name. One side we have already talked about: the server. This is the side that actually provides the API. It helps to remember that the API is simply another program running on the server. It may be part of the same program that handles web traffic, or it can be a completely separate one. In either case, it is sitting, waiting for others to ask it for data.

The other side is the "client." This is a separate program that knows what data is available through the API and can manipulate it, typically at the request of a user. A great example is a smartphone app that syncs with a website. When you push the refresh button in your app, it talks to a server via an API and fetches the newest info.

The same principle applies to websites that are integrated. When one site pulls in data from the other, the site providing the data is acting as the server, and the site fetching the data is the client.

Recap

This section focused on providing some foundational terminology and a mental model of what an API is and how it is used.

The key terms we learned were:

Server: A powerful computer that runs an API
API: The "hidden" portion of a website that is meant for computer consumption
Client: A program that exchanges data with a server through an API

API protocols

With a solid grasp on the who, we're ready to look deeper into how these two communicate. For context, we first look at the human model of communication and compare it to computers. After that, we move on to the specifics of a common protocol used in APIs.

API protocols rules

People create social etiquette to guide their interactions. One example is how we talk to each other on the phone. Imagine yourself chatting with a friend. While they are speaking, you know to be silent. You know to allow them brief pauses. If they ask a question and then remain quiet, you know they are expecting a response and it is now your turn to talk.

Computers have a similar etiquette, though it goes by the term "protocol." A computer protocol is an accepted set of rules that govern how two computers can speak to each other. Compared to our standards, however, a computer protocol is extremely rigid. Think for a moment of the two sentences "My favorite color is blue" and "Blue is my favorite color." People are able to break down each sentence and see that they mean the same thing, despite the words being in different orders. Unfortunately, computers are not that smart.

For two computers to communicate effectively, the server has to know exactly how the client will arrange its messages. You can think of it like a person asking for a mailing address. When you ask for the location of a place, you assume the first thing you are told is the street address, followed by the city, the state, and lastly, the ZIP Code. You also have certain expectations about each piece of the address, like the fact that ZIP Code should only consist of numbers. A similar level of specificity is required for a computer protocol to work.

HTTP: The protocol of the web

There is a protocol for just about everything, each one tailored to different jobs. You may have already heard of some: Bluetooth for connecting devices, and POP or IMAP for fetching emails.

On the web, the main protocol is the Hypertext Transfer Protocol, better known by its acronym, HTTP. When you type an address like http://example.com into a web browser, the "http" tells the browser to use the rules of HTTP when talking with the server.

With the ubiquity of HTTP on the web, many companies choose to adopt it as the protocol underlying their APIs. One benefit of using a familiar protocol is that it lowers the learning curve for developers, which encourages usage of the API. Another benefit is that HTTP has several features useful in building a good API, as we'll see later. Right now, let's brave the water and take a look at how HTTP works!

HTTP requests

Communication in HTTP centers around a concept called the Request-Response Cycle. The client sends the server a request to do something. The server, in turn, sends the client a response saying whether or not the server could do what the client asked.

An infographic representing an HTTP request

To make a valid request, the client needs to include four things:

URL (Uniform Resource Locator)
Method
List of headers
Body

That may sound like a lot of details just to pass along a message, but remember, computers have to be very specific to communicate with one another.

The HTTP specification actually requires a request to have a URI (Universal Resource Identifier), of which URLs are a subset, along with URNs (Uniform Resource Names). We chose URL because it is the acronym readers already know. The subtle differences between these three are beyond the scope of the guide.

URL

URLs are familiar to us through our daily use of the web, but have you ever taken a moment to consider their structure? In HTTP, a URL is a unique address for a thing (a noun). Which things get addresses is entirely up to the business running the server. They can make URLs for webpages, images, or even videos of cute animals.

APIs extend this idea a bit further to include nouns like customers, products, and tweets. In doing so, URLs become an easy way for the client to tell the server which thing it wants to interact with. Of course, APIs also do not call them "things", but give them the technical name "resources."

Method

The request method tells the server what kind of action the client wants the server to take. In fact, the method is commonly referred to as the request "verb."

Some of the most common API methods are:

GET: Asks the server to retrieve a resource
POST: Asks the server to create a new resource
PUT: Asks the server to edit/update an existing resource
PATCH: Asks the server to partially edit/update an existing resource
DELETE: Asks the server to delete a resource

Here's an example to help illustrate these methods. Let's say there is a pizza parlor with an API you can use to place orders. You place an order by making a POST request to the restaurant's server with your order details, asking them to create your pizza. As soon as you send the request, however, you realize you picked the wrong style crust, so you make a PATCH request to change only the crust style (as opposed to a PUT request, which you would use to change the entire resource/order).

While waiting on your order, you make a bunch of GET requests to check the status. After an hour of waiting, you decide you've had enough and make a DELETE request to cancel your order.

Headers

Headers provide meta-information about a request. They are a simple list of items like the time the client sent the request and the size of the request body.

Every time you visit a website on your smartphone that's been specially rendered for mobile devices, this formatting is made possible by an HTTP header called "User-Agent." The client uses this header to tell the server what type of device you are using, and websites smart enough to detect it can send you the best format for your device.

There are quite a few HTTP headers that clients and servers deal with, so we will wait to talk about other ones until they are relevant in later sections.

Body

The request body contains the data the client wants to send the server. Continuing our pizza ordering example above, the body is where the order details go.

A unique trait about the body is that the client has complete control over this part of the request. Unlike the method, URL, or headers, where the HTTP protocol requires a rigid structure, the body allows the client to send anything it needs.

These four pieces—URL, method, headers, and body—make up a complete HTTP request.

An infographic representing a valid HTTP request

HTTP responses

After the server receives a request from the client, it attempts to fulfill the request and send the client back a response. HTTP responses have a very similar structure to requests. The main difference is that instead of a method and a URL, the response includes a status code. Beyond that, the response headers and body follow the same format as requests.

Status codes

Status codes are three-digit numbers that each have a unique meaning. When used correctly in an API, this little number can communicate a lot of info to the client. For example, you may have seen this page during your internet wanderings:

An example of a not found error request when using an API

The status code behind this response is 404, which means "Not Found." Whenever the client makes a request for a resource that does not exist, the server responds with a 404 status code to let the client know: "that resource doesn't exist, so please don't ask for it again!"

There is a slew of other statuses in the HTTP protocol, including 200 ("success! that request was good") to 503 ("our website/API is currently down.") We'll learn a few more of them as they come up in later sections.

Completing the cycle

After a response is delivered to the client, the Request-Response Cycle is completed and that round of communication is over. It is now up to the client to initiate any further interactions. The server will not send the client any more data until it receives a new request.

An infographic representing a valid HTTP response

How APIs build on HTTP

By now, you can see that HTTP supports a wide range of permutations to help the client and server talk. So, how does this help us with APIs? The flexibility of HTTP means that APIs built on it can provide clients with a lot of business potential. We saw that potential in the pizza ordering example above. A simple tweak to the request method was the difference between telling the server to create a new order or cancel an existing one. It was easy to turn the desired business outcome into an instruction the server could understand. Very powerful!

This versatility in the HTTP protocol extends to other parts of a request, too. Some APIs require a particular header, while others require specific information inside the request body. Being able to use APIs hinges on knowing how to make the correct HTTP request to get the result you want.

Recap

The goal of this section was to give you a basic understanding of HTTP. The key concept was the Request-Response Cycle, which we broke down into the following parts:

Request: Consists of a URL (http://…), a method (GET, POST, PUT, PATCH, DELETE), a list of headers (User-Agent…), and a body (data)
Response: Consists of a status code (200, 404…), a list of headers, and a body

Throughout the rest of the guide, we will revisit these fundamentals as we discover how APIs rely on them to deliver power and flexibility.

API types and formats

So far, we've learned that HTTP (Hyper-Text Transfer Protocol) is the underpinning of APIs on the web and that to use them, we need to know how HTTP works. In this section, we explore the data APIs provide, how it's formatted, and how HTTP makes it possible.

The four types of APIs

APIs can be broken down into four types or formats: internal, external, partner, and composite. The API format you choose will depend on its intended scope and the unique specifications of your intended use case. Here's how they compare:

Internal: Also referred to as private, these APIs (as you probably guessed) are meant for connecting systems housed within one organization. This is a convenient way for growing companies to stay nimble with their data and collaboration needs.
External: Also called public or open APIs, external APIs are made accessible to the public and can be readily accessed by third parties. This is convenient for public-facing applications and systems that are intended to link to outside systems and applications.
Partner: These APIs are somewhere in between internal and external. They allow one organization to link their data only to predetermined and approved third parties. This helps organizations connect internal systems with specified organizations or applications like ERPs.
Composite: Developers can lighten the load on their servers by bringing together more than one API into a single call. This can improve server speed or link separate but related APIs into a single process.

API data formats

When sharing data with people, the possibilities for how to display the information are limited only by human imagination. Recall the pizza parlor from last section—how might they format their menu? It could be a text-only bulleted list, it could be a series of photos with captions, or it could even be only photos.

The way data is presented depends on what makes the information the easiest for the intended audience to understand.

The same principle applies when sharing data between computers. One computer has to communicate data in a format that the other will understand. Generally, this means some kind of textual representation. The most common formats found in modern APIs are JSON (JavaScript Object Notation) and XML (Extensible Markup Language).

JSON

Many APIs have adopted the newer JSON representation because it's built on the popular JavaScript programming language, which is ubiquitous on the web and usable on both the front- and back-end of a web app or service. JSON is a very simple format that is expressed using a combination of punctuation marks and real, readable words. Each object in JSON—set off between curly brackets ({})—contains two pieces, keys and values, each of which are contained within quotation marks ("") and separated by a colon (:).

Keys represent an attribute about the object being described and specify one or more corresponding values. For example, if a pizza order is an object, its attributes (keys) would be crust type, toppings, and order status. The selections for these attributes/keys would be options (values) like thick crust, pepperoni, and out for delivery, respectively.

Let's see how this pizza order could look in JSON:

In the JSON example above, the keys are the words on the left of the colons: toppings, crust, and status. They tell us what attributes the pizza order contains. The values are the parts to the right of the colons. These are the actual details of the order.

If you read a line from left to right, you get a fairly natural English sentence. Taking the first line as an example, we could read it as, "The crust for this pizza is original style." The second line can also be read this way—in JSON, square brackets ([]) specify a list of values, or an array. So, we read the second line of the order as, "The toppings for this order are: cheese, pepperoni, and garlic."

Sometimes, you may want to use an object as the value for a key. Let's extend our pizza order with customer details so you can see what this might look like:

In this updated version, we see that a new key, "customer," is added. The value for this key is another set of keys and values that provide details about the customer who placed the order. Cool trick, huh? This is called an associative array. Don't let the technical term intimidate you, though—an associative array is just a nested object.

XML

XML has been around since 1996. With age, it has become a very mature and powerful data format. Like JSON, XML provides a few simple building blocks that API makers use to structure their data, expressed in words and punctuation marks. The main building block of XML is called a node, and each node represents data in tags and values, similar to JSON's keys and values.

Let's see what our pizza order might look like in XML:

XML always starts with a root node, which in our pizza example is represented by the "order" tag. Nodes always open with a tag inside less-than (<) and greater-than (>) brackets and then close with the same tag with a slash (/) at the front, containing every node (called "child nodes") in between.

In the pizza example, each tag denotes a specific attribute of the order (like the key in JSON), and the data between the opening and closing tags represents the related detail (like the value in JSON).

You can also infer English sentences by reading XML. Looking at the line with "crust," we could read, "The crust for the pizza is original style." Notice how in XML, every item in the list of toppings is bookended by tags. You can see how the XML format requires a lot more text to communicate than JSON does.

How different data representations are communicated in HTTP

Now that we've explored some available data formats, we need to know how to use them in HTTP. To do so, we will say hello again to one of the fundamentals of HTTP: headers. In an earlier section, we learned that headers are a list of information about a request or response. There is a header for saying what representation the data is in: Content-Type.

When the client sends the Content-Type header in a request, it is telling the server that the data in the body of the request is formatted a particular way. If the client wants to send the server JSON data, it will set the Content-Type to "application/json." Upon receiving the request and seeing that Content-Type, the server will first check if it understands that format, and if so, it will know how to read the data. Likewise, when the server sends the client a response, it will also set the Content-Type to tell the client how to read the body of the response.

Sometimes, the client can only speak one data format. If the server sends back anything other than that format, the client will fail and throw an error. Fortunately, a second HTTP header comes to the rescue. The client can set the Accept header to tell the server what data formats it is able to accept. If the client can only speak JSON, it can set the Accept header to "application/json." The server will then send back its response in JSON. If the server doesn't support the format the client requests, it can send back an error to the client to let it know the request is not going to work.

With these two headers, Content-Type and Accept, the client and server can work with the data formats they understand and need to work properly.

Recap

In this section, we learned that for two computers to communicate, they need to be able to understand the data passed to them. We were introduced to two common data formats used by APIs: JSON and XML. We also learned that the Content-Type HTTP header is a useful way to specify what data format is being sent in a request, and the Accept header specifies the requested format for a response.

The key terms we learned were:

JSON: JavaScript Object Notation
Object: A thing or noun (person, pizza order...)
Key: An attribute about an object (color, toppings...)
Value: The value of an attribute (blue, pepperoni...)
Associative array: A nested object
XML: Extensible Markup Language

API authentication, part 1 (basic vs. key)

Things are starting to pick up in our understanding of APIs. We know who the client and server are, we know they use HTTP to talk to each other, and we know they speak in specific data formats to understand each other. Knowing how to talk, though, leaves an important question: how does the server know the client is who it claims to be? In this section, we explore two ways that the client can prove its identity to the server.

What is API authentication?

You've probably registered for an account on a website before. The process involves the site asking you for some personal information, most notably a username and a password. These two pieces of information become your identifying marks. We call these your credentials. When you visit the website again, you can log in by providing these credentials.

Logging in with a username and password is one example of a technical process known as authentication. When you authenticate with a server, you prove your identity to the server by telling it information that only you know (at least we hope only you know it). Once the server knows who you are, it can trust you and divulge the private data in your account.

There are several techniques APIs use to authenticate a client. These are called authentication schemes. Let's take a look at two of these schemes now.

Basic authentication

The logging-in example above is the most basic form of authentication. In fact, the official name for it is Basic Authentication ("Basic Auth" to its friends). Though the name has not garnered any creativity awards, the scheme is a perfectly acceptable way for the server to authenticate the client in an API.

Basic Auth only requires a username and password. The client takes these two credentials, smooshes them together to form a single value 1, and passes that along in the request in an HTTP header called Authorization.

The Basic Authentication process involves combining the username with a colon, followed by the password, and then running the whole string through the base64 encoding algorithm. Thus "user" and "password" becomes "user:password" and, after encoding, you have "dXNlcjpwYXNzd29yZAo=".

When the server receives the request, it looks at the Authorization header and compares it to the credentials it has stored. If the username and password match one of the users in the server's list, the server fulfills the client's request as that user. If there is no match, the server returns a special status code (401) to let the client know that authentication failed and the request is denied.

Though Basic Auth is a valid authentication scheme, the fact that it uses the same username and password to access the API and manage the account is not ideal. That is like a hotel handing a guest the keys to the whole building rather than to a room.

Similarly with APIs, there may be times when the client should have different permissions than the account owner. Take, for example, a business owner who hires a contractor to write a program that uses an API on their behalf. Trusting the contractor with the account credentials puts the owner at risk because an unscrupulous contractor could change the password, locking the business owner out of their own account. Clearly, it would be nice to have an alternative.

API key authentication 

API key authentication is a technique that overcomes the weakness of using shared credentials by requiring the API to be accessed with a unique key. In this scheme, the key is usually a long series of letters and numbers that is distinct from the account owner's login password. The owner gives the key to the client, very much like a hotel gives a guest a key to a single room.

When the client authenticates with the API key, the server knows to allow the client access to data, but now has the option to limit administrative functions, like changing passwords or deleting accounts. Sometimes, keys are used simply so the user does not have to give out their password. The flexibility is there with API key authentication to limit control as well as protect user passwords.

So, where does the API key go? Unlike Basic Auth, which is an established standard with strict rules, API keys were conceived at multiple companies in the early days of the web. As a result, API key authentication is a bit like the Wild West—everybody has their own way of doing it.

Over time, however, a few common approaches have emerged. One is to have the client put the key in the Authorization header in lieu of a username and password. Another is to add the key to the URL (http://example.com?api_key=my_secret_key). Less common is to bury the key somewhere in the request body next to the data. Wherever the key goes (assuming it's located where the server expects it to be based on the API documentation), the effect is the same—it lets the server authenticate the client.

Recap

In this section, we learned how the client can prove its identity to the server, a process known as authentication. We looked at two techniques, or schemes, APIs use to authenticate.

The key terms we learned were:

Authentication: Process of the client proving its identity to the server
Authorization: Process of the client proving its access privileges
Credentials: Secret pieces of info used to prove the client's identity (username, password...)
Basic Auth: Scheme that uses an encoded username and password for credentials
API key auth: Scheme that uses a unique key for credentials
Authorization header: The HTTP header used to hold credentials

API authentication, part 2 (OAuth)

We mentioned most websites use a username and password for authentication credentials. We also discussed how reusing these credentials for API access isn't secure, so APIs often require a different set of credentials from the ones used to log in to a website. A common example is API keys. In this section, we look at another solution, open authorization (OAuth), which is becoming the most widely used authentication scheme on the web.

Authentication vs. authorization

You might see "authentication" used interchangeably with "authorization" (or may mix them up yourself). This is understandable, because they're very similar-sounding words with very closely related definitions.

In the context of APIs, authentication is the process of proving identity—namely, confirming that an agent attempting to access something is who they claim to be. Authorization is the related process of proving access privilege—in this case, confirming that an agent has approval to access what they're trying to access. One increasingly common example of authorization is OAuth 2.0, which (spoiler alert) we'll dive into below.

The problem with API authentication

If you've ever had to enter a product key for new software or to activate a warranty, you know typing a long sequence of random characters into a form field makes for a poor user experience. First, you have to find the required key. Sure, it was right in your inbox when you bought the software, but a year later, you're scrambling to find it. (What email was it sent from? Which email did I use to register?!) Once located, you have to enter the darned thing perfectly—making a typo or missing a single character will result in failure or might even get you locked out of your unregistered software.

Forcing users to work with API keys is a similarly poor experience. Typos are a common problem, and the process requires users to do part of the setup between the client and server manually. Users must obtain the key from the server, then give it to the client. For tools meant to automate work, surely there's a better solution.

How OAuth solves the problem

Enter: OAuth. Automating the key exchange is one of the main problems OAuth solves. It provides a standard way for the client to get a key from the server by walking the user through a simple set of steps. All users have to do is enter their credentials. Behind the scenes, the client and server are chattering back and forth to get the client a valid key. An increasingly popular (and convenient) example of this is using a Google login agent to sign in to a non-Google website, like Reddit (or Zapier).

There are currently two versions of OAuth, aptly named OAuth 1.0 and OAuth 2.0. Understanding the steps in each is necessary to be able to interact with APIs that use them for authentication. Since they share a common workflow, we'll walk through the steps of OAuth 2.0, then point out the ways in which OAuth 1.0 differs.

OAuth 2.0

To get started, we first need to know the cast of characters involved in an OAuth exchange:

The user: A person who wants to connect two websites they use
The client: The website that will be granted access to the user's data
The server: The website that has the user's data

Next, we need to give a quick disclaimer. One goal of OAuth 2.0 is to allow businesses to adapt the authentication process to their needs. Due to this extendable nature, APIs can have slightly different steps. The workflow shown below is a common one found among web-based apps. Mobile and desktop applications might use slight variations in this process.

With that, here are the steps of OAuth 2.0.

Step 1: User tells client to connect to server

Graphic showing the user connecting to the client

The user kicks off the process by letting the client know they want it to connect to the server. Usually, this is done by clicking a button.

Step 2: Client directs user to server

The client sends the user over to the server's website, along with a URL that the server will send the user back to once the user authenticates, called the callback URL.

Step 3: User logs in to server and grants client access

Graphic representing granting client access

With their normal username and password, the user authenticates with the server. The server is now certain that one of its own users is requesting that the client be given access to the user's account and related data.

Step 4: Server sends user back to client, along with code

Graphic representing sending the user back to the client

The server sends the user back to the client (to the callback URL from Step 2). Hidden in the response is a unique authorization code for the client.

Graphic representing sending the auth code back to the client

Step 5: Client exchanges code and secret key for access token

The client takes the authorization code it receives and makes another request to the server. This request includes the client's secret key. When the server sees a valid authorization code and a trusted client secret key, it is certain that the client is who it claims to be and that it is acting on behalf of a real user. The server responds back with an access token.

A graphic showing the client exchanging a code and secret key for an access token

Step 6: Client fetches data from server

A graphic showing the client fetching data from the server

At this point, the client is free to access the server on the user's behalf. The access token from Step 6 is essentially another password into the user's account on the server. The client includes the access token with every request so it can authenticate directly with the server.

Client refreshes token (optional)

A feature introduced in OAuth 2 is the option to have access tokens expire. This is helpful in protecting users' accounts by strengthening security—the faster a token expires, the less time a stolen token might be used maliciously, similar to how a credit card number expires after a certain time. The lifespan of a token is set by the server. APIs in the wild use anything from hours to months. Once the lifespan is reached, the client must ask the server for a new token.

How OAuth 1.0 is different

There are several key differences between the two versions of OAuth. One we already mentioned: access tokens do not expire.

Another distinction is that OAuth 1.0 includes an extra step. Between Steps 1 and 2 above, OAuth 1.0 requires the client to ask the server for a request token. This token acts like the authorization code in OAuth 2.0 and is what gets exchanged for the access token.

A third difference is that OAuth 1.0 requires requests to be digitally signed. We'll skip the details of how signing works (you can find code libraries to do this for you), but it is worth knowing why it is in one version and not the other. Request signing is a way to protect data from being tampered with while it moves between the client and the server. Signatures allow the server to verify the authenticity of the requests.

Today, however, most API traffic happens over a channel that is already secure (HTTPS). Recognizing this, OAuth 2.0 eliminates signatures in an effort to make version two easier to use. The trade-off is that OAuth 2.0 relies on other measures to secure the data in transit.

Authorization

An element of OAuth 2.0 that deserves special attention is the concept of limiting access, known formally as authorization. Back in Step 2, when the user clicks the button to allow the client access, buried in the fine print are the exact permissions the client is asking for. Those permissions, called scope, are another important feature of OAuth 2.0. They provide a way for the client to request limited access to the user's data, thereby making it easier for the user to trust the client.

What makes scope powerful is that it involves client-based restrictions. Unlike an API Key, where limits placed on the key affect every client equally, OAuth scope allows one client to have permission X and another permissions X and Y. That means one website might be able to view your contacts while another site can view and edit them.

Recap

In this section, we learned the flow of the OAuth authentication process. We compared the two versions, pointing out the major difference between them.

The key terms we learned were:

OAuth: An authentication scheme that automates the key exchange between client and server
Access token: A secret code that the client obtains upon successfully completing the OAuth process
Scope: Permissions that determine what access the client has to the user's data

API design

This section marks a turning point in our adventure with APIs. We are finished covering fundamentals and are now ready to see how the previous concepts combine to form an API. In this section, we discuss the components of an API by designing one.

REST vs. SOAP

When discussing APIs, you might hear talk of "soap" and "rest" and wonder whether the software developers are doing work or planning a spa day. The truth is that these are the names of the two most common architectures for web-based APIs. SOAP (formerly an acronym) is an XML-based design that has standardized structures for requests and responses.

SOAP once stood for Simple Object Access Protocol. It was originally used for a very specific type of API access. As developers found ways to apply it to more situations, the name no longer fit, so in SOAP version 1.2, the acronym was dropped.

What is API design?

API design refers to the way developers and architects craft the rules and protocols governing the way systems interact with each other. To do this, they have to define the various methods, data formats, authentication mechanisms, and endpoints that let separate software entities do all the things we've discussed in the last five sections.

A well-designed API has a few key benefits:

Gives developers a clean, intuitive interface for accessing resources
Enables efficient, reliable, and predictable integration with low risk of coding errors
Fosters interoperability between a diverse range of systems, software, and applications

So, what exactly goes into well-designed APIs? Like just about all complex processes, it starts with organization.

Think for a moment about your Google Drive. Are you one of those people who dumps everything into a single folder or one of those people who meticulously arranges their photos, docs, and spreadsheets into a logical hierarchy of clearly labeled folders?

Companies give similar thought to organization when building their APIs. As we mentioned early on, the purpose of an API is to make it easy for computers to work with the company's data. With ease of use in mind, one company may decide to have a single URL for all the data and make it searchable (sort of like having one folder for all your photos). Another may decide to give each piece of data its own URL, organized in a hierarchy (like having folders and subfolders sorting photos by date, location, or event). Each company chooses the best way to structure its API for its particular situation, guided by existing industry best practices.

SOAP provides a very structured architecture. The structure provides system reliability and standard extensions for adding functionality to the protocol, and it makes it possible for tools to generate code, saving development time.

REST, which stands for Representational State Transfer, is a more open approach, providing lots of conventions but leaving many decisions to the person designing the API.

REST and SOAP don't necessarily compete, however, since they tend to apply to varying use cases. You're likely to see REST in scenarios where simplicity, efficiency, and scalability are prioritized, as well as public APIs like AI modules third-party developers can deploy on their own websites and apps. Meanwhile, SOAP, with its more rigid rules and standards, may be preferred for enterprise-level applications where standardized messaging, security, and transactions are essential, such as transferring sensitive customer data between separate financial institutions.

Throughout this guide, you may have noticed we've had an inclination for REST APIs. The preference is largely due to REST's incredible rate of adoption. This is not to say that SOAP is evil; it has its strong points. However, the focus of our discussion will stay on REST as this will likely be the kind of API you encounter. In the remaining sections, we will walk through the components that make up a REST API.

An infographic comparing API design styles: REST vs. SOAP

How REST API works

Earlier on, we talked a little bit about resources. Recall that resources are the nouns of APIs (customers and pizzas). These are the things we want the world to be able to interact with through our API.

To get a feel for how a company would design an API, let's try our hand at it with our pizza parlor. We'll start by adding the ability to order a pizza.

For the client to be able to talk pizzas with us, we need to do several things:

Decide what resource(s) need to be available.
Assign URLs to those resources.
Decide what actions the client should be allowed to perform on those resources.
Figure out what pieces of data are required for each action and what format they should be in.

Step 1: Picking resources

Picking resources can be a difficult first task. One way to approach the problem is to step through what a typical interaction involves. For our pizza parlor, we probably have a menu. On that menu are pizzas. When a customer wants us to make one of the pizzas for them, they place an order. In this context, menu, pizza, customer, and order all sound like good candidates for resources. Let's start with order.

Step 2: Assigning URLs

The next step is assigning URLs to the resource. There are lots of possibilities, but luckily REST conventions give some guidance. In a typical REST API, a resource will have two URL patterns assigned to it. The first is the plural of the resource name, like /orders. The second is the plural of the resource name plus a unique identifier to specify a single resource, like /orders/<order_id>, where <order_id> is the unique identifier for an order. These two URL patterns make up the first endpoints that our API will support. These are called endpoints simply because they go at the end of the URL, as in http://example.com/<endpoint_goes_here>.

Step 3: Deciding client actions

Now that we picked our resource and assigned it URLs, we need to decide what actions the client can perform. Following REST conventions, we say that the plural endpoint (/orders) is for listing existing orders and creating new ones. The plural with a unique identifier endpoint (/orders/<order_id>) is for retrieving, updating, or canceling a specific order. The client tells the server which action to perform by passing the appropriate HTTP verb (GET, POST, PUT or DELETE) in the request.

HTTP verb	Endpoint	Action
GET	/orders	List existing orders
POST	/orders	Place a new order
GET	/orders/1	Get details for order #1
GET	/orders/2	Get details for order #2
PUT	/orders/1	Update order #1
DELETE	/orders/1	Cancel order #1

Step 4: Identifying data to exchange

With the actions for our order endpoints fleshed out, the final step is to decide what data needs to be exchanged between the client and the server. Borrowing from our pizza parlor example, we can say that an order needs a crust and toppings. We also need to select a data format that the client and server can use to pass this information back and forth. XML and JSON are both good choices, but for readability sake, we'll go with JSON.

At this point, you should pat yourself on the back; we have designed a functional API! Here is what an interaction between the client and server might look like using this API:

An graphic example of an interaction between the client and server might look like using this API

Linking resources together

Our pizza parlor API is looking sharp. Orders are coming in like never before. Business is so good, in fact, we decide we want to start tracking orders by customer to gauge loyalty. An easy way to do this is to add a new customer resource.

Just like with orders, our customer resource needs some endpoints. Following convention, /customers and /customers/<id> fit nicely. We'll skip the details, but let's say we decide which actions make sense for each endpoint and what data represents a customer. Assuming we do all of that, we come to an interesting question: how do we associate orders with customers?

REST practitioners are split on how to solve the problem of associating resources. Some say that the hierarchy should continue to grow, giving endpoints like /customers/5/orders for all of customer #5's orders and /customers/5/orders/3 for customer #5's third order. Others argue to keep things flat by including associated details in the data for a resource. Under this paradigm, creating an order requires a customer_id field to be sent with the order details. Both solutions are used by REST APIs in the wild, so it is worth knowing about each.

Searching data

As data in a system grows, endpoints that list all records become impractical. Imagine if our pizza parlor had three million completed orders and you wanted to find out how many had pepperoni as a topping. Sending a GET request to /orders and receiving all three million orders would not be very helpful. Thankfully, REST has a nifty way of searching through data.

URLs have another component that we have not mentioned yet: the query string. Query means search and string means text. The query string is a bit of text that goes onto the end of a URL to pass things along to the API. For example, everything after the question mark is the query string in http://example.com/orders?key=value.

REST APIs use the query string to define the details of a search. These details are called query parameters. The API dictates what parameters it will accept, and the exact names of those parameters need to be used. Our pizza parlor API could allow the client to search for orders by topping by using this URL: http://example.com/orders?topping=pepperoni. The client can include multiple query parameters by listing one after another, separating them by an ampersand ("&"). For example: http://example.com/orders?topping=pepperoni&crust=thin.

Another use of the query string is to limit the amount of data returned in each request. Often, APIs will split results into sets (say, 100 or 500 records) and return one set at a time. This process of splitting up the data is known as pagination (an analogy to breaking up words into pages for books). To allow the client to page through all the data, the API will support query parameters that allow the client to specify which page of data it wants. In our pizza parlor API, we can support paging by allowing the client to specify two parameters: page and size. If the client makes a request like GET /orders?page=2&size=200, we know they want the second page of results, with 200 results per page, so orders 201-400.

Recap

In this section, we learned how to design a REST API. We showed the basic functions an API supports and how to organize the data so that it can be easily consumed by a computer.

The key terms we learned were:

SOAP: API architecture known for standardized message formats
REST: API architecture that centers around manipulating resources
Resource: API term for a business noun like customer or order
Endpoint: A URL that makes up part of an API. In REST, each resource gets its own endpoints
Query string: A portion of the URL that is used to pass data to the server
Query parameters: A key-value pair found in the query string (topping=cheese)
Pagination: Process of splitting up results into manageable chunks

Real-time API communication

We learned about designing APIs by building our own. At this point, we have a lot of hard-earned knowledge, and it's time for it to start paying off. We are ready to see how we can put APIs to work for us. In this section, we learn four ways to achieve real-time communication through APIs.

What is a real-time API?

Real-time APIs are APIs that enable near-instantaneous communication between systems, allowing users to get virtually immediate responses to their actions. Where REST API, as we discussed in the last section, uses the response-based HTTP protocol, real-time API keeps open communication between clients and servers. Since this guide is meant to be a primer on APIs in general, we won't get too nitty-gritty with the details of how this type of API works right now.

Obviously, real-time communication can be a huge advantage, as it can continually update users with the latest data without requiring them to request it. It also makes APIs more scalable for growing enterprises while making systems more efficient and improving data accuracy and timeliness.

An infographic showing types of real-time APIs

Real-time API examples

Real-time APIs have a huge range of use cases—many of which you probably benefit from without even realizing it. As technology integrates even further into our daily lives, they're likely to grow even more ubiquitous.

Here are just a few common real-time API examples.

Internet of Things (IoT) devices: That Bluetooth thermostat in your home? Real-time APIs allow you to monitor the temperature and change it with the tap of a finger.
Push notifications: Every time you get a Google Calendar notification on your phone that you're about to start a meeting you forgot about, you're using real-time APIs.
Live chat: Your favorite chat apps rely on real-time APIs to send communications from one user to another the instant they hit the send button.
Geolocation: In order to track your location as you drive, GPS apps need real-time APIs.
Live data feeds: Those Wall Street brokers you see in the movies need real-time APIs to watch stock prices rise and dip on that revolving digital stock ticker.
AI applications: APIs facilitate real-time communication between applications and AI models, allowing these applications to analyze human-written text, process it, and deliver responses based on data from existing datasets.

Real-time API integrations

Let's remind ourselves why APIs are useful. We said that APIs make it easy to share data between two systems (websites, desktops, smartphones). Straightforward sharing allows us to link systems together to form an integration. People like integrations because they make life easier. With an integration, you can do something in one system and the other will automatically update.

For our purposes, we will split integrations into two broad categories. The first we call "client-driven," where a person interacts with the client and wants the server's data to update. The other we call "server-driven," where a person does something on the server and needs the client to be aware of the change.

Client-driven and server-driven are our terms, so don't be surprised if you use one in front of a developer and get only a blank stare in return. Mention polling or webhooks if you want instant credibility.

The reason for dividing integrations in this manner comes down to one simple fact: the client is the only one who can initiate communication. Remember, the client makes requests and the server just responds. A consequence of this limitation is that changes are easy to send from the client to the server, but hard to send in the reverse direction.

Client-driven integration

To demonstrate why client-driven integrations are easy, let's turn to our trusty pizza parlor and its API for ordering pizzas. Say we release a smartphone app that uses the API. In this scenario, the pizza parlor API is the server and the smartphone app is the client. A customer uses the app to choose a pizza and then hits a button to place the order. As soon as the button is pressed, the app knows it needs to make a request to the pizza parlor API.

A graphic showing client-driven interaction for APIs

More generally speaking, when a person interacts with the client, the client knows exactly when data changes, so it can call the API immediately to let the server know. There's no delay (since it's real time) and the process is efficient because only one request is made for each action a person takes.

Server-driven integration

Once the pizza order is placed, the customer might want to know when the pizza is ready. How do we use the API to provide them with updates? Well, that is a bit harder. The customer has nothing to do with making the pizza. They are waiting on the pizza parlor to prepare the pizza and update the order status. In other words, the data is changing on the server and the client needs to know about it. Yet, if the server can't make requests, we appear to be stuck.

Solving this type of problem is where we utilize the second category of integrations. There are a number of solutions software developers use to get around the client-only requests limitation. Let's take a look at each.

Polling

When the client is the only one who can make requests, the simplest solution to keep it up-to-date with the server is for the client to simply ask the server for updates. This can be accomplished by repeatedly requesting the same resource, a technique known as polling.

With our pizza parlor, polling for the status of an order might look like the following.

A graphic example of polling for the status of an order

In this approach, the more frequently the client makes requests (polls), the closer the client gets to real-time communication. If the client polls every hour, at worst, there could be a one-hour delay between a change happening on the server and the client becoming aware of it. Poll every minute, instead, and the client and server effectively stay in sync.

Of course, there is one big flaw with this solution. It is terribly inefficient. Most of the requests the client makes are wasted because nothing has changed. Worse, to get updates sooner, the polling interval has to be shortened, causing the client to make more requests and become even more inefficient. This solution does not scale well.

Long polling

If requests were free, then nobody would care about efficiency and everyone could just use polling. Unfortunately, handling requests comes at a cost. For an API to handle more requests, it needs to utilize more servers, which costs more money. Scale this cumbersome situation up to Google- or Facebook-sized proportions, and you're paying a lot for inefficiency. Hence, lots of effort has been put into optimizing the way the client can receive updates from the server.

One optimization, which builds off of polling, is called long polling. Long polling uses the same idea of the client repeatedly asking the server for updates, but with a twist: the server does not respond immediately. Instead, the server waits until something changes, then responds with the update.

Let's revisit the polling example from above, but this time with a server that uses the long polling trick.

A graphic showing waiting for the long polling trick

This technique is pretty clever. It obeys the rule of the client making the initial request while leveraging the fact that there is no rule against the server being slow to respond. As long as both the client and the server agree that the server will hold on to the client's request, and the client is able to keep its connection to the server open, it will work.

As resourceful as long polling is, it too has some drawbacks. We'll skip the technical details, but there are concerns like how many requests the server can hold onto at a time or how to recover if the client or server loses its connection. For now, we'll say that for some scenarios, neither form of polling is sufficient.

Webhooks

With polling ruled out, some innovative software developers thought, "if all our trouble is because the client is the only one making requests, why not remove that rule?" So they did. The result was webhooks, a technique where the client both makes requests and listens for them, allowing the server to easily push updates to it.

If this sounds like cheating because now we have the server making requests to the client, don't worry. What makes webhooks work is that the client becomes a server too. From a technical perspective, it's sometimes very easy to extend the client's functionality to also listen for requests, enabling two-way communication.

Let's look at the basics of webhooks. In its simplest form, webhooks requires the client to provide a callback URL where it can receive events, and the server to have a place for a person to enter that callback URL. Then, whenever something changes on the server, the server can send a request to the client's Callback URL to let the client know about the update.

For our pizza parlor, the flow might look a little something like the following.

This solution is excellent. Changes happening on the server are sent instantly to the client, so you have true real-time communication. Also, webhooks are efficient since there's only one request per update.

Subscription webhooks

Building on the idea of webhooks, there have been a variety of solutions that aim to make the setup process dynamic and not require a person to manually enter a callback URL on the server. You might hear names like HTTP Subscriptions Specification, Restful Webhooks, REST Hooks, and PubSubHubbub. What all of these solutions try to do is define a subscription process, where the client can tell the server what events it is interested in and what callback URL to send updates to.

Each solution has a slightly different take on the problem, but the general flow looks like the following.

A graphic representing subscription webhooks

Subscription-based webhooks hold a lot of promise. They are efficient, real-time, and easy for people to use. Similar to REST's explosive adoption, a tide is rising behind the movement, and it's becoming more common for APIs to support some form of webhooks.

Still, there will likely be a place for polling and long polling for the foreseeable future. Not all clients can also act as servers. Smartphones are a great example where technical constraints rule out webhooks as a possibility. As technology progresses, new ideas will emerge for how to make real-time communication easier between all kinds of devices.

Recap

In this section, we grouped integrations into two broad categories: client-driven and server-driven. We saw how APIs can be used to provide real-time updates between two systems, as well as some of the challenges.

The key terms we learned were:

Polling: Repeatedly requesting a resource at a short interval
Long polling: Polling, but with a delayed response; improves efficiency
Webhooks: When the client gives the server a callback URL, so the server can post updates in real time
Subscription webhooks: Informal name for solutions that make setting up webhooks automatic

API implementation

You made it. You now know everything there is to know about APIs...at an introductory level, at least. So, with all this acquired knowledge, how can you put it to good use? In this section, we explore how to turn knowledge into working software by outlining three foundational components of any API implementation.

API documentation

As we have seen throughout this guide, an API interaction involves two sides. When we are talking at the code level, though, what we are really saying is that we need two programs that implement the API. A program implements an API when it follows the API's rules. In our pizza parlor example, a client that can make requests to the /orders endpoint using the correct headers and data format would be a client that implements the pizza parlor's API.

The server program is the responsibility of the company publishing the API. We looked at the process behind designing the API. After planning, the next step is for the company to implement their side by writing software that follows the design. The last step is to put the resulting program on a server.

Along with the server software, the company publishes documentation for the API. The documentation is one or more documents—typically webpages or PDFs—that explain how to use the API. It includes information like what authentication scheme to use, what endpoints are available, and how the data is formatted. It may also include example responses, code snippets, and an interactive console to play with the available endpoints. Documentation is important because it acts as a guide for building clients. It's where someone interested in using the API goes to learn how the API works.

With documentation in hand, there are a number of ways you can begin to use an API as a client. Let's examine three of those now.

Testing an API using HTTP clients

An easy way to start using an API is with an HTTP client, a generic program that lets you quickly build HTTP requests to test with. You specify the URL, headers, and body, and the program sends it to the server properly formatted. These types of programs come in all sorts of flavors, including web apps, desktop apps, web browser extensions, and more.

The nice thing about generic HTTP clients is that you do not have to know how to program to use one. With the skills you've attained through this guide, you now have the ability to read a company's API documentation and figure out the request you need to make to get the data you want. This small learning curve makes generic clients great for exploration and quick one-off tasks.

There are a couple downsides to this approach, however. First, you usually can't save your work. After you close the program, the requests you made are forgotten and you have to rebuild them the next time you need them. Another disadvantage is that you typically can't do much with the data you get back, other than look at it. At best, you have the ability to save the data into a file, after which it's up to you to do something interesting with it.

Writing API client code

To really harness the power of an API, you will eventually need custom software. This is where programming comes in. Since coding is a discipline unto itself, we won't attempt to cover everything about software development, but we can give you some guidance for what writing an API client involves.

The first requirement is to gain some familiarity with a programming language. There are a bunch out there, each with its strengths and weaknesses. For simplicity's sake, it is probably better to stick to an interpreted language (JavaScript, Python, PHP, Ruby, or similar) instead of a compiled language (C or C++).

If you aren't sure which language to choose, a great way to narrow down the selection can be to find an API you want to implement and see if the company provides a client library. A library is code that the API owner publishes that already implements the client side of their API. Sometimes the library will be individually available for download or it will be bundled in an SDK (software development kit). Using a library saves you time because instead of reading the API documentation and forming raw HTTP requests, you can simply copy and paste a few lines of code and already have a working client.

After you settle on a language, you need to decide where the code will run. If you are automating your own tasks, running the software from your work computer might be acceptable. More frequently, you will want to run the code on a computer better suited for acting as a web server. There are quite a few solutions available, including running your code on shared hosting environments, cloud services (like Amazon Web Services), or even on your own physical servers at a data center.

A third important decision is to determine what you'll do with the data. Saving results into a file is easy enough, but if you want to store the data in a database or send it to another application, things become more complex. Pulling data out of a database to send to an API can also be challenging.

At this point, we can pause and remind you to not be too intimidated by all this new information. You should not expect to know everything about implementing APIs on your first attempt. Take solace in knowing that there are people who can help (open source communities, developers for hire, and potential project collaborators) and lots of resources available online to facilitate learning.

Once you master the basics, there are plenty more topics to learn about in the rich realm of software development. For now, if you succeed at learning a programming language and getting a library up and running, you should celebrate. You will be well on your way to making the most of APIs!

An infographic describing how to write API client code

API AI

While AI may not be completely automating all phases of API generation on its own (yet, at least), it can be an incredibly useful tool for API developers. Here are a few ways AI is being used in APIs today.

Generating code: AI-powered models can translate human language descriptions of API functionalities into corresponding code—as in, just describe what you want the API to do, and AI can spit out the code to make it happen.
API design: To help developers design more sound APIs even faster, they can use machine learning algorithms to analyze model APIs, identify patterns, and suggest insights.
API integration: Machine learning-informed algorithms can learn from documentation to suggest operations, parameters, and instructions for executing specific use cases.
Troubleshooting: When problems arise, AI can assist in identifying issues and generating code for processes like exception handling and status code management.
Security: AI can generate code snippets or configurations for security measures like authentication, authorization, and encryption within the API.
Template-based coding: Just define code templates, and AI can fill in the details based on context and patterns from training data.

Give Zapier a try

If coding is beyond your current skill set or time constraints, Zapier empowers you to easily interact with APIs. Zapier's Developer Platform offers a way for you to implement an API that you then interact with as an app on Zapier. By hitting a few buttons and filling out a few forms, you can implement nearly any API you want. Once you get started, you can even use webhooks to automate data transfer between apps.

What makes using the Developer Platform easy is that we've done a lot of the programming for you (and have even compiled all the documentation you need). Zapier has code in place to make requests—all you have to do is fill in the details. Think of using the platform a bit like using a generic HTTP client; you tell us a bit about the endpoints, and we'll do the rest.

The additional benefit is that once you have Zapier talking with an API, you have lots of options for what to

Zapier is the most connected AI orchestration platform—integrating with thousands of apps from partners like Google, Salesforce, and Microsoft. Use forms, data tables, and logic to build secure, automated, AI-powered systems for your business-critical workflows across your organization's technology stack. Learn more.

do with the data you get back. You can even use the Zapier Platform to build your own Zapier integration (with or without code) securely and flexibly. Also, if you get stuck, you can reach out to the friendly support team, where you have API experts ready to help you out.

Ideas for using APIs at work

Think about ways you might be able to use an API in your working life. To get the juices flowing, here are a few ideas:

You need some quick stats from a SaaS (software as a service) application you use. Firing up an HTTP client to make a few requests could be a fast way to get the information you need.
You have a labor-intensive task that needs to get done and there isn't time to have a developer friend lend a hand. Grabbing a client library and creating a quick program could be a big timesaver.
You really want to move data between two internal apps on a continual basis, but you don't have the resources to build a client for each app from scratch, nor a good place to run that code. Using the Zapier Developer Platform could be a low-cost way to get the applications connected.

Recap

In this section, we discussed how an API design becomes working software. We talked about ways that you can begin using APIs.

The key terms we learned were:

Implement: Writing software that obeys the rules of an API
Documentation: Webpages, PDFs, and other documents that explain the rules of an API
Library: Code released by an API publisher that implements the client portion of their API

This article was originally published in April 2014. The most recent update, with contributions from Bryce Emley, was in January 2024.

An introduction to APIs: A comprehensive guide

What is an API?

Servers

How do APIs work?

How an API is used

Recap

API protocols

API protocols rules

HTTP: The protocol of the web

HTTP requests

URL

Method

Headers

Body

HTTP responses

Status codes

Completing the cycle

How APIs build on HTTP

Recap

API types and formats

The four types of APIs

API data formats

JSON

XML

How different data representations are communicated in HTTP

Recap

API authentication, part 1 (basic vs. key)

What is API authentication?

Basic authentication

API key authentication

Recap

API authentication, part 2 (OAuth)

Authentication vs. authorization

The problem with API authentication

How OAuth solves the problem

OAuth 2.0

Step 1: User tells client to connect to server

Step 2: Client directs user to server

Step 3: User logs in to server and grants client access

Step 4: Server sends user back to client, along with code

Step 5: Client exchanges code and secret key for access token

Step 6: Client fetches data from server

Client refreshes token (optional)

How OAuth 1.0 is different

Authorization

Recap

API design

REST vs. SOAP

What is API design?

How REST API works

Step 1: Picking resources

Step 2: Assigning URLs

Step 3: Deciding client actions

Step 4: Identifying data to exchange

Linking resources together

Searching data

Recap

Real-time API communication

What is a real-time API?

Real-time API examples

Real-time API integrations

Client-driven integration

Server-driven integration

Polling

Long polling

Webhooks

Subscription webhooks

Recap

API implementation

API documentation

Testing an API using HTTP clients

Writing API client code

API AI

Give Zapier a try

Ideas for using APIs at work

Recap

Related articles

Improve your productivity automatically. Use Zapier to get your apps working together.

API key authentication