What is an application binary interface ABI

Question

I never clearly understood what an ABI is  Please don t point me to a Wikipedia article  If I could understand it  I wouldn t be here posting such a lengthy post   This is my mindset about different interfaces   A TV remote is an interface between the user and the TV  It is an existing entity  but useless  doesn t provide any functionality  by itself  All the functionality for each of those buttons on the remote is implemented in the television set      Interface  It is an  existing entity  layer between the   functionality and consumer of that functionality  An interface by itself   doesn t do anything  It just invokes the functionality lying behind       Now depending on who the user is there are different type of interfaces       Command Line Interface  CLI  commands are the existing entities    the consumer is the user and functionality lies behind       functionality  my software functionality which solves some   purpose to which we are describing this interface       existing entities  commands      consumer  user      Graphical User Interface GUI  window  buttons  etc  are the existing   entities  and again the consumer is the user and functionality lies behind       functionality  my software functionality which solves some problem to which we are describing this interface       existing entities  window  buttons etc        consumer  user      Application Programming Interface API  functions  or to be   more correct  interfaces  in interfaced based programming  are the   existing entities  consumer here is another program not a user  and again   functionality lies behind this layer       functionality  my software functionality which solves some   problem to which we are describing this interface       existing entities  functions  Interfaces  array of functions        consumer  another program application       Application Binary Interface  ABI  Here is where my problem starts       functionality           existing entities           consumer         I ve written software in different languages and provided different kinds of interfaces  CLI  GUI  and API   but I m not sure if I have ever provided any ABI    Wikipedia says      ABIs cover details such as         data type  size  and alignment    the calling convention  which controls how functions  arguments are   passed and return values retrieved    the system call numbers and how an application should make system calls   to the operating system          Other ABIs standardize details such as         the C   name mangling    exception propagation  and   calling convention between compilers on the same platform  but do   not require cross-platform compatibility        Who needs these details  Please don t say the OS  I know assembly programming  I know how linking  amp  loading works  I know exactly what happens inside  Why did C   name mangling come in  I thought we are talking at the binary level  Why do languages come in    Anyway  I ve downloaded the  PDF  System V Application Binary Interface Edition 4 1  1997-03-18  to see what exactly it contains  Well  most of it didn t make any sense    Why does it contain two chapters  4th  amp  5th  to describe the ELF file format  In fact  these are the only two significant chapters of that specification  The rest of the chapters are  processor specific   Anyway  I though that it is a completely different topic  Please don t say that ELF file format specifications are the ABI  It doesn t qualify to be an interface according to the definition  I know  since we are talking at such a low level it must be very specific  But I m not sure how is it  instruction set architecture  ISA   specific  Where can I find Microsoft Windows  ABI    So  these are the major queries that are bugging me

User · Answer

If you know assembly and how things work at the OS-level, you are conforming to a certain ABI. The ABI govern things like how parameters are passed, where return values are placed. For many platforms there is only one ABI to choose from, and in those cases the ABI is just "how things work".

However, the ABI also govern things like how classes/objects are laid out in C++. This is necessary if you want to be able to pass object references across module boundaries or if you want to mix code compiled with different compilers.

Also, if you have an 64-bit OS which can execute 32-bit binaries, you will have different ABIs for 32- and 64-bit code.

In general, any code you link into the same executable must conform to the same ABI. If you want to communicate between code using different ABIs, you must use some form of RPC or serialization protocols.

I think you are trying too hard to squeeze in different types of interfaces into a fixed set of characteristics. For example, an interface doesn't necessarily have to be split into consumers and producers. An interface is just a convention by which two entities interact.

ABIs can be (partially) ISA-agnostic. Some aspects (such as calling conventions) depend on the ISA, while other aspects (such as C++ class layout) do not.

A well defined ABI is very important for people writing compilers. Without a well defined ABI, it would be impossible to generate interoperable code.

EDIT: Some notes to clarify:

"Binary" in ABI does not exclude the use of strings or text. If you want to link a DLL exporting a C++ class, somewhere in it the methods and type signatures must be encoded. That's where C++ name-mangling comes in.
The reason why you never provided an ABI is that the vast majority of programmers will never do it. ABIs are provided by the same people designing the platform (i.e. operating system), and very few programmers will ever have the privilege to design a widely-used ABI.

User · Answer

ABI - Application Binary Interface is about a machine code communication in runtime between two binary parts like - application  library  OS    ABI describes how objects are saved in memory  how functions are called calling convention   mangling    A good example of API and ABI is iOS ecosystem with Swift language   Application layer - When you create an application using different languages  For example you can create application using Swift and Objective-C Mixing Swift and Objective-C   Application - OS layer - runtime - Swift runtime and standard libraries are parts of OS and they should not be included into each bundle e g  app  framework   It is the same as like Objective-C uses  Library layer - Module Stability case - compile time - you will be able to import a framework which was built with another version of Swift s compiler  It means that it is safety to create a closed-source pre-build  binary which will be consumed by a different version of compiler   swiftinterface is used with  swiftmodule  and you will not get Module compiled with   cannot be imported by the   compiler   Library layer - Library Evolution case  Compile time - if a dependency was changed  a client has not to be recompiled  Runtime - a system library or a dynamic framework can be hot-swapped by a new one      API vs ABI   Swift Module and Library stability

User · Answer

One easy way to understand  ABI  is to compare it to  API    You are already familiar with the concept of an API   If you want to use the features of  say  some library or your OS  you will program against an API   The API consists of data types structures  constants  functions  etc that you can use in your code to access the functionality of that external component   An ABI is very similar   Think of it as the compiled version of an API  or as an API on the machine-language level    When you write source code  you access the library through an API   Once the code is compiled  your application accesses the binary data in the library through the ABI   The ABI defines the structures and methods that your compiled application will use to access the external library  just like the API did   only on a lower level   Your API defines the order in which you pass arguments to a function   Your ABI defines the mechanics of how these arguments are passed  registers  stack  etc     Your API defines which functions are part of your library   Your ABI defines how your code is stored inside the library file  so that any program using your library can locate the desired function and execute it   ABIs are important when it comes to applications that use external libraries   Libraries are full of code and other resources  but your program has to know how to locate what it needs inside the library file   Your ABI defines how the contents of a library are stored inside the file  and your program uses the ABI to search through the file and find what it needs   If everything in your system conforms to the same ABI  then any program is able to work with any library file  no matter who created them   Linux and Windows use different ABIs  so a Windows program won t know how to access a library compiled for Linux   Sometimes  ABI changes are unavoidable   When this happens  any programs that use that library will not work unless they are re-compiled to use the new version of the library   If the ABI changes but the API does not  then the old and new library versions are sometimes called  source compatible    This implies that while a program compiled for one library version will not work with the other  source code written for one will work for the other if re-compiled   For this reason  developers tend to try to keep their ABI stable  to minimize disruption    Keeping an ABI stable means not changing function interfaces  return type and number  types  and order of arguments   definitions of data types or data structures  defined constants  etc   New functions and data types can be added  but existing ones must stay the same   If  for instance  your library uses 32-bit integers to indicate the offset of a function and you switch to 64-bit integers  then already-compiled code that uses that library will not be accessing that field  or any following it  correctly   Accessing data structure members gets converted into memory addresses and offsets during compilation and if the data structure changes  then these offsets will not point to what the code is expecting them to point to and the results are unpredictable at best   An ABI isn t necessarily something you will explicitly provide unless you are doing very low-level systems design work   It isn t language-specific either  since  for example  a C application and a Pascal application can use the same ABI after they are compiled   Edit  Regarding your question about the chapters regarding the ELF file format in the SysV ABI docs   The reason this information is included is because the ELF format defines the interface between operating system and application   When you tell the OS to run a program  it expects the program to be formatted in a certain way and  for example  expects the first section of the binary to be an ELF header containing certain information at specific memory offsets   This is how the application communicates important information about itself to the operating system   If you build a program in a non-ELF binary format  such as a out or PE   then an OS that expects ELF-formatted applications will not be able to interpret the binary file or run the application   This is one big reason why Windows apps cannot be run directly on a Linux machine  or vice versa  without being either re-compiled or run inside some type of emulation layer that can translate from one binary format to another   IIRC  Windows currently uses the Portable Executable  or  PE  format   There are links in the  external links  section of that Wikipedia page with more information about the PE format   Also  regarding your note about C   name mangling   When locating a function in a library file  the function is typically looked up by name   C   allows you to overload function names  so name alone is not sufficient to identify a function   C   compilers have their own ways of dealing with this internally  called name mangling   An ABI can define a standard way of encoding the name of a function so that programs built with a different language or compiler can locate what they need   When you use extern  c  in a C   program  you re instructing the compiler to use a standardized way of recording names that s understandable by other software

User · Answer

Functionality  A set of contracts which affect the compiler  assembly writers  the linker  and the operating system  The contracts specify how functions are laid out  where parameters are passed  how parameters are passed  how function returns work  These are generally specific to a  processor architecture  operating system  tuple   Existing entities  parameter layout  function semantics  register allocation  For instance  the ARM architectures has numerous ABIs  APCS  EABI  GNU-EABI  never mind a bunch of historical cases  - using the a mixed ABI will result in your code simply not working when calling across boundaries   Consumer  The compiler  assembly writers  operating system  CPU specific architecture   Who needs these details  The compiler  assembly writers  linkers which do code generation  or alignment requirements   operating system  interrupt handling  syscall interface   If you did assembly programming  you were conforming to an ABI   C   name mangling is a special case - its a linker and dynamic linker centered issue - if name mangling is not standardized  then dynamic linking will not work  Henceforth  the C   ABI is called just that  the C   ABI  It is not a linker level issue  but instead a code generation issue  Once you have a C   binary  it is not possible to make it compatible with another C   ABI  name mangling  exception handling  without recompiling from source   ELF is a file format for the use of a loader and dynamic linker  ELF is a container format for binary code and data  and as such specifies the ABI of a piece of code  I would not consider ELF to be an ABI in the strict sense  as PE executables are not an ABI   All ABIs are instruction set specific  An ARM ABI will not make sense on an MSP430 or x86 64 processor   Windows has several ABIs - for instance  fastcall and stdcall are two common use ABIs  The syscall ABI is different again

User · Answer

In order to call code in shared libraries  or call code between compilation units  the object file needs to contain labels for the calls  C   mangles the names of method labels in order to enforce data hiding and allow for overloaded methods  That is why you cannot mix files from different C   compilers unless they explicitly support the same ABI

User · Answer

The best way to differentiate between ABI and API is to know why and what is it used for   For x86-64 there is generally one ABI  and for x86 32-bit there is another set    http   www x86-64 org documentation abi pdf  https   developer apple com library mac documentation DeveloperTools Conceptual LowLevelABI 140-x86-64 Function Calling Conventions x86 64 html  http   people freebsd org  obrien amd64-elf-abi pdf  Linux   FreeBSD   MacOSX follow it with some slight variations    And Windows x64 have its own ABI   http   eli thegreenplace net 2011 09 06 stack-frame-layout-on-x86-64   Knowing the ABI and assuming other compiler follows it as well  then the binaries theoretically know how to call each other  libraries API in particular  and pass parameters over the stack or by registers etc    Or what registers will be changed upon calling the functions etc    Essentially these knowledge will help software to integrate with one another   Knowing the order of the registers   stack layout I can easily piece together different software written in assemblies together without much problem   But API are different   It is a high level functions names  with argument defined  such that if different software pieces build using these API  MAY be able to call into one another    But an additional requirement of SAME ABI must be adhered to      For example  Windows used to be POSIX API compliant   https   en wikipedia org wiki Windows Services for UNIX    https   en wikipedia org wiki POSIX  And Linux is POSIX compliant as well    But the binaries cannot be just moved over and run immediately    But because they used the same NAMES in the POSIX compliant API  you can take the same software in C  recompile it in the different OS  and immediately get it running      API are meant to ease integration of software - pre-compilation stage    So after compilation the software can look totally different - if the ABI are different   ABI are meant to define exact integration of software at the binary   assembly level

User · Answer

I was also trying to understand ABI and JesperE   s answer was very helpful  From a very simple perspective  we may try to understand ABI by considering binary compatibility  KDE wiki defines a library as binary compatible    if a program linked dynamically to a former version of the library continues running with newer versions of the library without the need to recompile     For more on dynamic linking  refer Static linking vs dynamic linking Now  let   s try to look at just the most basic aspects needed for a library to be binary compatibility  assuming there are no source code changes to the library    Same backward compatible instruction set architecture  processor instructions  register file structure  stack organization  memory access types  along with sizes  layout  and alignment of basic data types the processor can directly access  Same calling conventions Same name mangling convention  this might be needed if say a Fortran program needs to call some C   library function    Sure  there are many other details but this is mostly what the ABI also covers  More specifically to answer your question  from the above  we can deduce   ABI functionality  binary compatibility existing entities  existing program libraries OS consumer  libraries  OS  Hope this helps

User · Answer

Let me at least answer a part of your question  With an example of how the Linux ABI affects the systemcalls  and why that is usefull    A systemcall is a way for a userspace program to ask the kernelspace for something  It works by putting the numeric code for the call and the argument in a certain register and triggering an interrupt  Than a switch occurs to kernelspace and the kernel looks up the numeric code and the argument  handles the request  puts the result back into a register and triggers a switch back to userspace  This is needed for example when the application wants to allocate memory or open a file  syscalls  brk  and  open     Now the syscalls have short names  brk   etc  and corresponding opcodes  these are defined in a system specific header file  As long as these opcodes stay the same you can run the same compiled userland programs with different updated kernels without having to recompile  So you have an interface used by precompiled binarys  hence ABI

User · Answer

In short and in philosophy  only things of a kind can get along well  and the ABI could be seen as the kind of which software stuff work together

User · Answer

Application binary interface  ABI   Functionality    Translation from the programmer s model to the underlying system s domain data type  size  alignment  the calling convention  which controls how functions  arguments are passed and return values retrieved  the system call numbers and how an application should make system calls to the operating system  the high-level language compilers  name mangling scheme  exception propagation  and calling convention between compilers on the same platform  but do not require cross-platform compatibility      Existing entities    Logical blocks that directly participate in program s execution  ALU  general purpose registers  registers for memory  I O mapping of I O  etc      consumer    Language processors linker  assembler      These are needed by whoever has to ensure that build tool-chains work as a whole  If you write one module in assembly language  another in Python  and instead of your own boot-loader want to use an operating system  then your  application  modules are working across  binary  boundaries and require agreement of such  interface    C   name mangling because object files from different high-level languages might be required to be linked in your application  Consider using GCC standard library making system calls to Windows built with Visual C     ELF is one possible expectation of the linker from an object file for interpretation  though JVM might have some other idea   For a Windows RT Store app  try searching for ARM ABI if you really wish to make some build tool-chain work together

User · Answer

Summary There are various interpretation and strong opinions of the exact layer that define an ABI  application binary interface   In my view an ABI is a subjective convention of what is considered a given platform for a specific API  The ABI is the  quot rest quot  of conventions that  quot will not change quot  for a specific API or that will be addressed by the runtime environment  executors  tools  linkers  compilers  jvm  and OS  Defining an Interface  ABI  API If you want to use a library like joda-time you must declare a dependency on joda-time- lt major gt   lt minor gt   lt patch gt  jar  The library follows best practices and use Semantic Versioning  This defines the API compatibility at three levels   Patch - You don t need to change at all your code  The library just fixes some bugs  Minor - You don t need to change your code since the additions Major - The interface  API  is changed and you might need to change your code   In order for you to use a new major release of the same library a lot of other conventions are still to be respected   The binary language used for the libraries  in Java cases the JVM target version that defines the Java bytecode  Calling conventions JVM conventions Linking conventions Runtime conventions All these are defined and managed by the tools we use   Examples Java case study For example  Java standardized all these conventions  not in a tool  but in a formal JVM specification  The specification allowed other vendors to provide a different set of tools that can output compatible libraries  Java provides two other interesting case studies for ABI  Scala versions and Dalvik virtual machine  Dalvik virtual machine broke the ABI The Dalvik VM needs a different type of bytecode than the Java bytecode  The Dalvik libraries are obtained by converting the Java bytecode  with same API  for Dalvik  In this way you can get two versions of the same API  defined by the original joda-time-1 7 2 jar  We could call it joda-time-1 7 2 jar and joda-time-1 7 2-dalvik jar  They use a different ABI one is for the stack-oriented standard Java vms  Oracle s one  IBM s one  open Java or any other  and the second ABI is the one around Dalvik  Scala successive releases are incompatible Scala doesn t have binary compatibility between minor Scala versions  2 X   For this reason the same API  quot io reactivex quot      quot rxscala quot     quot 0 26 5 quot  has three versions  in the future more   for Scala 2 10  2 11 and 2 12  What is changed  I don t know for now  but the binaries are not compatible  Probably the latest versions adds things that make the libraries unusable on the old virtual machines  probably things related to linking naming parameter conventions  Java successive releases are incompatible Java has problems with the major releases of the JVM too  4 5 6 7 8 9  They offer only backward compatibility  Jvm9 knows how to run code compiled targeted  javac s -target option  for all other versions  while JVM  4 doesn t know how to run code targeted for JVM  5  All these while you have one joda-library  This incompatibility flies bellow the radar thanks to different solutions   Semantic versioning  when libraries target higher JVM they usually change the major version  Use JVM 4 as the ABI  and you re safe  Java 9 adds a specification on how you can include bytecode for specific targeted JVM in the same library   Why did I start with the API definition  API and ABI are just conventions on how you define compatibility  The lower layers are generic in respect of a plethora of high level semantics  That s why it s easy to make some conventions  The first kind of conventions are about memory alignment  byte encoding  calling conventions  big and little endian encodings  etc  On top of them you get the executable conventions like others described  linking conventions  intermediate byte code like the one used by Java or LLVM IR used by GCC  Third you get conventions on how to find libraries  how to load them  see Java classloaders   As you go higher and higher in concepts you have new conventions that you consider as a given  That s why they didn t made it to the semantic versioning  They are implicit or collapsed in the major version  We could amend semantic versioning with  lt major gt - lt minor gt - lt patch gt - lt platform ABI gt   This is what is actually happening already  platform is already a rpm  dll  jar  JVM bytecode   war jvm web server   apk  2 11  specific Scala version  and so on  When you say APK you already talk about a specific ABI part of your API  API can be ported to different ABI The top level of an abstraction  the sources written against the highest API can be recompiled ported to any other lower level abstraction  Let s say I have some sources for rxscala  If the Scala tools are changed I can recompile them to that  If the JVM changes I could have automatic conversions from the old machine to the new one without bothering with the high level concepts  While porting might be difficult will help any other client  If a new operating system is created using a totally different assembler code a translator can be created  APIs ported across languages There are APIs that are ported in multiple languages like reactive streams  In general they define mappings to specific languages platforms  I would argue that the API is the master specification formally defined in human language or even a specific programming language  All the other  quot mappings quot  are ABI in a sense  else more API than the usual ABI  The same is happening with the REST interfaces

User · Answer

The ABI needs to be consistent between caller and callee to be certain that the call succeeds  Stack use  register use  end-of-routine stack pop  All these are the most important parts of the ABI

User · Answer

Linux shared library minimal runnable ABI example  In the context of shared libraries  the most important implication of  having a stable ABI  is that you don t need to recompile your programs after the library changes   So for example    if you are selling a shared library  you save your users the annoyance of recompiling everything that depends on your library for every new release if you are selling closed source program that depends on a shared library present in the user s distribution  you could release and test less prebuilts if you are certain that ABI is stable across certain versions of the target OS   This is specially important in the case of the C standard library  which many many programs in your system link to    Now I want to provide a minimal concrete runnable example of this   main c   include  lt assert h gt   include  lt stdlib h gt    include  mylib h   int main void        mylib mystruct  myobject   mylib init 1       assert myobject- gt old field    1       free myobject       return EXIT SUCCESS      mylib c   include  lt stdlib h gt    include  mylib h   mylib mystruct  mylib init int old field        mylib mystruct  myobject      myobject   malloc sizeof mylib mystruct        myobject- gt old field   old field      return myobject      mylib h   ifndef MYLIB H  define MYLIB H  typedef struct       int old field    mylib mystruct   mylib mystruct  mylib init int old field     endif   Compiles and runs fine with   cc  gcc -pedantic-errors -std c89 -Wall -Wextra   cc -fPIC -c -o mylib o mylib c  cc -L   -shared -o libmylib so mylib o  cc -L   -o main out main c -lmylib LD LIBRARY PATH     main out   Now  suppose that for v2 of the library  we want to add a new field to mylib mystruct called new field   If we added the field before old field as in   typedef struct       int new field      int old field    mylib mystruct    and rebuilt the library but not main out  then the assert fails   This is because the line   myobject- gt old field    1   had generated assembly that is trying to access the very first int of the struct  which is now new field instead of the expected old field   Therefore this change broke the ABI   If  however  we add new field after old field   typedef struct       int old field      int new field    mylib mystruct    then the old generated assembly still accesses the first int of the struct  and the program still works  because we kept the ABI stable   Here is a fully automated version of this example on GitHub   Another way to keep this ABI stable would have been to treat mylib mystruct as an opaque struct  and only access its fields through method helpers  This makes it easier to keep the ABI stable  but would incur a performance overhead as we d do more function calls   API vs ABI  In the previous example  it is interesting to note that adding the new field before old field  only broke the ABI  but not the API   What this means  is that if we had recompiled our main c program against the library  it would have worked regardless   We would also have broken the API however if we had changed for example the function signature   mylib mystruct  mylib init int old field  int new field     since in that case  main c would stop compiling altogether   Semantic API vs Programming API  We can also classify API changes in a third type  semantic changes   The semantic API  is usually a natural language description of what the API is supposed to do  usually included in the API documentation   It is therefore possible to break the semantic API without breaking the program build itself   For example  if we had modified  myobject- gt old field   old field    to   myobject- gt old field   old field   1    then this would have broken neither programming API  nor ABI  but main c the semantic API would break   There are two ways to programmatically check the contract API    test a bunch of corner cases  Easy to do  but you might always miss one  formal verification  Harder to do  but produces mathematical proof of correctness  essentially unifying documentation and tests into a  human    machine verifiable manner  As long as there isn t a bug in your formal description of course  -   This concept is closely related to the formalization of Mathematics itself  https   math stackexchange com questions 53969 what-does-formal-mean 3297537 3297537   List of everything that breaks C   C   shared library ABIs  TODO  find   create the ultimate list    https   github com lvc abi-compliance-checker automated tool to check it https   community kde org Policies Binary Compatibility Issues With C 2B 2B KDE C   ABI guidelines https   plan99 net  mike writing-shared-libraries html   Java minimal runnable example  What is binary compatibility in Java   Tested in Ubuntu 18 10  GCC 8 2 0

User · Answer

An application binary interface  ABI  is similar to an API  but the function is not accessible to the caller at source code level  Only a binary representation is accessible available   ABIs may be defined at the processor-architecture level or at the OS level  The ABIs are standards to be followed by the code-generator phase of the compiler  The standard is fixed either by the OS or by the processor   Functionality  Define the mechanism standard to make function calls independent of the implementation language or a specific compiler linker toolchain  Provide the mechanism which allows JNI  or a Python-C interface  etc   Existing entities  Functions in machine code form   Consumer  Another function  including one in another language  compiled by another compiler  or linked by another linker

User · Answer

You actually don t need an ABI at all if--   Your program doesn t have functions  and-- Your program is a single executable that is running alone  i e  an embedded system  where it s literally the only thing running and it doesn t need to talk to anything else    An oversimplified summary      API   Here are all the functions you may call        ABI   This is how to call a function     The ABI is set of rules that compilers and linkers adhere to in order to compile your program so that will work properly   ABIs cover multiple topics    Arguably the biggest and most important part of an ABI is the procedure call standard sometimes known as the  calling convention    Calling conventions standardize how  functions  are translated to assembly code  ABIs also dictate the how the names of exposed functions in libraries should be represented so that other code can call those libraries and know what arguments should be passed   This is called  name mangling   ABIs also dictate what type of data types can be used  how they must be aligned  and other low-level details    Taking a deeper look at calling convention  which I consider to be the core of an ABI   The machine itself has no concept of  functions    When you write a function in a high-level language like c  the compiler generates a line of assembly code like  MyFunction1    This is a label  which will eventually get resolved into an address by the assembler   This label marks the  start  of your  function  in the assembly code   In high-level code  when you  call  that function  what you re really doing is causing the CPU to jump to the address of that label and continue executing there   In preparation for the jump  the compiler must do a bunch of important stuff   The calling convention is like a checklist that the compiler follows to do all this stuff    First  the compiler inserts a little bit of assembly code to save the current address  so that when your  function  is done  the CPU can jump back to the right place and continue executing  Next  the compiler generates assembly code to pass the arguments    Some calling conventions dictate that arguments should be put on the stack  in a particular order of course   Other conventions dictate that the arguments should be put in particular registers  depending on their data types of course   Still other conventions dictate that a specific combination of stack and registers should be used   Of course  if there was anything important in those registers before  those values are now overwritten and lost forever  so some calling conventions may dictate that the compiler should save some of those registers prior to putting the arguments in them  Now the compiler inserts a jump instruction telling the CPU to go to that label it made previously   MyFunction1     At this point  you can consider the CPU to be  in  your  function   At the end of the function  the compiler puts some assembly code that will make the CPU write the return value in the correct place   The calling convention will dictate whether the return value should be put into a particular register  depending on its type   or on the stack  Now it s time for clean-up   The calling convention will dictate where the compiler places the cleanup assembly code    Some conventions say that the caller must clean up the stack   This means that after the  function  is done and the CPU jumps back to where it was before  the very next code to be executed should be some very specific cleanup code  Other conventions say that the some particular parts of the cleanup code should be at the end of the  function  before the jump back     There are many different ABIs   calling conventions   Some main ones are    For the x86 or x86-64 CPU  32-bit environment     CDECL STDCALL FASTCALL VECTORCALL THISCALL  For the x86-64 CPU  64-bit environment     SYSTEMV MSNATIVE VECTORCALL  For the ARM CPU  32-bit    AAPCS  For the ARM CPU  64-bit    AAPCS64    Here is a great page that actually shows the differences in the assembly generated when compiling for different ABIs   Another thing to mention is that an ABI isn t only relevant inside your program s executable module   It s also used by the linker to make sure your program calls library functions correctly   You have multiple shared libraries running on your computer  and as long as your compiler knows what ABI they each use  it can call functions from them properly without blowing up the stack   Your compiler understanding how to call library functions is extremely important   On a hosted platform  that is  one where an OS loads programs   your program can t even blink without making a kernel call

User · Answer

The term ABI is used to refer to two distinct but related concepts   When talking about compilers it refers to the rules used to translate from source-level constructs to binary constructs  How big are the data types  how does the stack work  how do I pass parameters to functions  which registers should be saved by the caller vs the callee   When talking about libraries it refers to the binary interface presented by a compiled library  This interface is the result of a number of factors including the source code of the library  the rules used by the compiler and in some cases definitions picked up from other libraries    Changes to a library can break the ABI without breaking the API  Consider for example a library with an interface like   void initfoo FOO   foo  int usefoo FOO   foo  int bar  void cleanupfoo FOO   foo    and the application programmer writes code like  int dostuffwithfoo int bar      FOO foo    initfoo  amp foo     int result   usefoo  amp foo bar    cleanupfoo  amp foo     return result      The application programmer doesn t care about the size or layout of FOO  but the application binary ends up with a hardcoded size of foo  If the library programmer adds an extra field to foo and someone uses the new library binary with the old application binary then the library may make out of bounds memory accesses   OTOH if the library author had designed their API like   FOO   newfoo void  int usefoo FOO   foo  int bar  void deletefoo  FOO   foo  int bar     and the application programmer writes code like  int dostuffwithfoo int bar      FOO   foo    foo   newfoo      int result   usefoo foo bar    deletefoo foo     return result      Then the application binary does not need to know anything about the structure of FOO  that can all be hidden inside the library  The price you pay for that though is that heap operations are involved

[compiler-construction] What is an application binary interface (ABI)?

Examples related to compiler-construction

Examples related to abi