PDR: Laboratory 8: x86 Assembly Language, part 1 (64 bit)

Go up to the Labs table of contents page

Objective

This lab is one of two labs meant to familiarize you with the process of writing, assembling, and linking assembly language code. The purposes of the in-lab and post-lab activities are to investigate how various C++ language features are implemented at the assembly level.

There are both 32 bit (md) and 64 bit (md) versions of this lab. This is the 64 bit version.

Background

The Intel x86 assembly language is currently one of the most popular assembly languages and runs on many architectures from the x86 line through the Pentium 4. It is a CISC instruction set that has been extended multiple times (e.g. MMX) into a larger instruction set. In 2004 it was extended to allow for a 64 bit memory space.

Tutorial

The tutorial for this lab is the following two readings on x86 and the C Calling convention.

Procedure

Pre-lab

Understand how to compile x86 assembly code
Review the sample vecsum.s program
Write a program in x86, mathlib.s, to compute the product of two numbers and power of two numbers
Create a C++ program, mathfun.cpp, to call the functions in mathlib.s
Ensure your code compiles on a x64 Linux machine
Files to download vecsum.s (src), main.cpp (src), Makefile (src)
Files to submit mathlib.s, mathfun.cpp, Makefile

In-lab

Write an x86 function to perform merge sort
Files to download: mergeSort.s (src), testMergeSort.cpp (src)
Files to submit: mergeSort.s, testMergeSort.cpp, Makefile

Post-lab

Write an x86 function to perform a linear search
Files to download: testLinearSearch.cpp (src)
Files to submit: testLinearSearch.cpp, linearSearch.s

Platform Architectures

There are a few differences when writing x86 assembly code, depending on whether you are using Linux or macOS. Specifically, the object file format for nasm will differ, as well as the subroutine name itself. The instructions in this lab will assume the use of Linux – if you are using macOS, use the table below for the correct instructions.

Platform	nasm flag	x86 subroutine name
64-bit Linux	-f elf64	`vecsum`
64 bit macOS	-f macho64	`_vecsum`

Some additional notes:

The subroutine name must be changed everywhere it appears in the assembly file, including in global
On Linux, you may need to install the g++-multilib command for compilation to succeed

IMPORTANT: When you submit your code, it MUST be in 64-bit Linux format.

Pre-lab

Compiling Assembly With C++

For this part, you will need to download three files: vecsum.s (src), main.cpp (src), and Makefile (src).

To compile a program written partly in x86 assembly and partly in C++, we have to build the program in parts. We build the C++ file as we have in the past:

clang++ -Wall -g -c main.cpp

Note that we used the -c flag, which tells the compiler to compile but not link the program. Linking it will create the final executable – but as there is not a vecsum() function defined (yet), the compiler will report an error stating that it does not know what the vecsum() function is. We also added a few more flags (-Wall -g) to print all warnings and compile debugging symbols into the program.

Next, we need to compile the assembly file. To do this, we enter the following:

nasm -f elf64 -g vecsum.s

This invokes nasm, which is the assembler that we are using for this course. -f elf64 tells nasm to output the object file using the ELF 64-bit format, which is specific to Linux (and would need to be changed to -f macho64 for macOS as detailed above). We also tell it to include debugging symbols via -g. The assembly file name is specified by the vecsum.s at the end of the command line.

Finally, we have to link the two files into the final executable. We do this as before:

clang++ -Wall -g vecsum.o main.o

This tells clang++ to link both of the .o files created above into an executable, which it called a.out. There isn’t any compiling done at this stage – this just links the two object files into the final executable.

Vecsum

Examine the vecsum subroutine in vecsum.s (src). Use the slides and readings to help understand what is happening in vecsum.s. Make sure you understand the prologue and epilogue implementation, as well as the instructions used in the subroutine.

Compile and run the vecsum program:

Use the tutorial as a guide, but see the instructions above.
If you forget the lldb commands described below, see the LLDB command summary, which has a summary of all of these commands.
You can find the assembly and C++ source code in the repository (vecsum.s (src), main.cpp (src), Makefile (src)). For the C++ code compilation (i.e. main.cpp) and the final link, use the -g flag, which allows the program to work well with the lldb debugger.
Use the debugger to step through the assembly code, view the register contents, and view the computer’s memory.
Set a breakpoint at the line in the main.cpp where the vecsum() function is called (probably line 38).
Normally, you would use the step function to step into the next instruction. However, since no debugging information was included with the assembler (a shortcoming of nasm), we can’t use step – it will just move to the next C++ instruction (the cout). Instead, we will use stepi, which will step exactly one assembly instruction, which is what we want.
To display the assembly code that is currently being executed, enter disassemble. This is just like list, but it displays the assembly code instead of the C++ code.
Note that this prints things in a different assembly format. To set the format to the style we are used to (and the style we are programming in with nasm), enter settings set target.x86-disassembly-flavor intel. Now enter disassemble again – the format should look more familiar. You only have to enter that set command once (unless you exit and re-enter lldb).
To see the vecsum function, enter disassemble --name vecsum. Note that this only lists the first third (or so) of the routine – up until the start label. To see the rest of the code, enter disassemble --name start, disassemble --name done, etc.
To show the contents of the registers, use the register read command.

Pre-lab program: mathlib.s

You will need to write two routines in assembly, one that computes the product of two numbers, and one that computes the power of two numbers.

The first subroutine will compute the product of the two integer parameters passed in. The restrictions are that it can only use addition, and thus cannot use a multiplication operation. We will assume that both of the parameters are positive integers. It must compute this iteratively, not recursively. The resulting product is then returned to the calling routine. This subroutine should be called product. We will assume that values will not be provided to the subroutine that will cause an overflow, nor will negative (or zero) parameters be passed in.

The second subroutine will compute the power of the two integer parameters passed in. We will assume that the first parameter is the base, and the second parameter is the exponent. Again, both are integers. The restrictions on this routine are that it can only use the multiplication routine described above – it cannot use imul or call any exponentiation routine. Furthermore, it must be defined recursively, not iteratively. This routine should be called power.

You can assume that the numbers passed into both routines will be non-negative. Furthermore, as described above, no values will be used on your program that could cause an integer overflow.

Both of these routines should be in a file called mathlib.s, and must use the proper C-style calling convention. You must also provide a mathfun.cpp file, which calls both of your subroutines – see the main.cpp file provided as a template. The program should take in ONLY two integers (we’ll call them x and y). It should then print out the output of calling product(x,y) and power(x,y). Thus, if the input is 3 and 4, it would print out 12 and 81.

Input is to be read via standard input (i.e., cin), not through command-line parameters.

In order for routines in an assembly file to be callable from C/C++, you need to declare them with global, like is done in vecsum.s. To have multiple routines in a single assembly file (as is needed for mathlib.s), you should have multiple global lines, one for each subroutine that you plan on calling from your C/C++ code.

Sample Execution Run

Below is a sample execution run to show you the input and output format we are looking for.

Enter integer 1: 3
Enter integer 2: 2
Product: 6
Power: 9

Once you have completed the pre-lab, submit mathlib.s, mathfun.cpp, and your Makefile

In-lab

Come to lab with a functioning version of the pre-lab, and be prepared to demonstrate that you understand how to build and run the pre-lab programs. If you are unsure about any part of the pre-lab, talk to a TA. You should be able to explain and write recursive functions in assembly for the exams in this course, so make sure that you understand how to implement the pre-lab program.

Before starting the in-lab, make sure you read and understand the book chapters on the C calling convention. For the in-lab, you will be implementing merge sort in x86 assembly. We have provided the helper function merge in mergeSort.s. Note: merge makes use of two caller-saved registers, r10 and r11. Remember to save and restore these registers before and after calling merge.

Download mergeSort.s (src), as well as testMergeSort.cpp (src), which you will use to test your code. Make sure you do not alter testMergeSort.cpp, as you must include the original file in your submission.

Your task for the in-lab is to implement the mergeSort function in mergeSort.s. This function takes in three parameters. The first parameter is a pointer to an int array. The second parameter is an integer corresponding to the left index in the array. The third parameter is an integer corresponding to the right index in the array. The return type of this function is void, and it modifies the original array. You may assume the size of the array is nonzero. When testing your function using testMergeSort.cpp, input will be read via standard input, not through command-line parameters. After entering the array size, you will be prompted to enter each element one by one. This test file will call your mergeSort function on the array, and print the result. Make sure you test your function on arrays of various sizes.

Sample Execution Run

Below is a sample execution run to show you the input and output format we are looking for.

Enter the array size: 5
Enter value 0: -7
Enter value 1: 2
Enter value 2: -39
Enter value 3: 12
Enter value 4: 8
Unsorted array: -7 2 -39 12 8
Sorted array: -39 -7 2 8 12

The following resource explains the merge sort algorithm. This is what you need to implement in x86 assembly: www.hackerearth.com/practice/algorithms/sorting/merge-sort/tutorial/

Once you have completed the in-lab, submit mergeSort.s, testMergeSort.cpp, and your Makefile.

Post-lab

Download testLinearSearch.cpp (src), which you will use to test your code. Make sure you do not alter testLinearSearch.cpp, as you must include the original file in your submission.

Your task for the post-lab is to implement the linearSearch function in assembly. This function will scan through an array from left to right iteratively until it finds the target element or reaches the end of the array. The function takes in three parameters. The first parameter is a pointer to an int array. The second parameter is an integer representing the size of the array. The third parameter is an integer representing the target element to find in the array. The return type of this fuction is int, and will be the index into the array that the target was found, or -1 if it wasn’t. Just like with the testMergeSort program in the in-lab, testLinearSearch will take input from std and pass it on to your linearSearch routine. Make sure you test your function on various sized arrays, both sorted and unsorted.

Sample Execution Run

Below is a sample execution run to show you the input and output format we are looking for.

Enter the array size: 5
Enter value 0: -7
Enter value 1: 2
Enter value 2: -39
Enter value 3: 12
Enter value 4: 8
Enter target to search for: 2

Found 2 at index 1

Once you have completed the in-lab, submit linearSearch.s, testLinearSearch.cpp, and your Makefile

Hints

Accessing Array Elements in Assembly

Recall that C++ arrays are stored contiguously in memory. This means that if you know the start address of the array &a, and the size of the elements that are being stored, you can find the address of an element at any index i with &a[i] = &a + <size_of_elements>*i (if the array is one-dimensional). Use this fact along with the memory addressing slide from lecture to access array elements in assembly.

PDR: Laboratory 8: x86 Assembly Language, part 1 (64 bit)

Objective

Background

Tutorial

Recommended Readings

Procedure

Pre-lab

In-lab

Post-lab

Platform Architectures

Pre-lab

Compiling Assembly With C++

Vecsum

Pre-lab program: mathlib.s

Sample Execution Run

In-lab

Sample Execution Run

Post-lab

Sample Execution Run

Hints

Accessing Array Elements in Assembly