TweetFollow Us on Twitter

May 94 Cornfield
Volume Number:10
Issue Number:5
Column Tag:From The Corn Field

Thoughts From The Cornfield

Provocative, perhaps inflammatory, but just say no to assembly language on PowerPC

By Steve Kiene, MindVision Software, Lincoln, Nebraska

About the author

Steve, author of things like Stacker for Macintosh, cares about performance, code size, performance, portability, and performance as much as anyone we know (well, there’s always Mike Scanlin, too). Steve’s recently worked through a number of issues about porting to the PowerPC for performance, and along the way surprised himself with his conclusions. He’s curious about your reaction, so please let us know if they surprise you, too. I can just see our assistant Al holding up a placard with this in big letters: editorial@xplain.com

Writing code in assembly language instead of a high-level language to get performance is fast becoming an historic anachronism. The fierce competition in the 90’s leads to time-to-market battles that cannot be won by the company that insists on writing large chunks of their product in assembly language.

I’ve seen plenty of code whose authors have spent an inordinate amount of time tweaking assembly language instructions to get the most speed out of the code when the real problem was a slow algorithm. It’s the old problem of not seeing the forest for the trees. Careful examination of the algorithm offers more potential for improved performance than coding a bad algorithm in tightly-tuned assembly language. This generally holds true even when the improved algorithm is coded in C.

I took some code a friend had written; he had spent weeks hand-tuning assembly code. I re-examined the algorithms and found a better way to do it. I coded it up in C and the new C code ran fifty times faster on the 68K than the assembly solution had before. Now, it not only performs, it’s portable and more maintainable. After simply recompiling the code for Power Macintosh, its speed doubled. To convert the assembly code would have taken at least a couple of weeks for someone proficient not only in writing PowerPC assembly code, but also good at scheduling the assembly instructions to keep the chip as busy as possible.

Now, with all that said, there are reasons for writing PowerPC assembly language. So, if you have to write part of your program in assembly, make sure it’s the right part and be completely sure you cannot increase the speed by improving the algorithm. There’s little sense in writing assembly language for code that only amounted to 4% of the execution time, but it’s not that hard to find programs that do just that. What was gained by writing the code in assembly language rather than a high-level language?

The release of the Power Macintosh machines has sent many 68K assembly language programmers scrambling to learn the new architecture and its assembly language so that they can continue to performance-program the Macintosh. However, as they are finding out, programming in PowerPC assembly language is much harder than on the 68K Macintosh.

I’ve seen several examples of PowerPC assembly language code that at first glance looks fast but after careful examination the code turns out to run slower than expected. RISCprocessors require understanding the architecture of both the CPU and the memory bus to get good performance, and it’s simply difficult to keep all of the rules and constraints in your head while trying to be creative and write code. Compilers, on the other hand, just don’t care how many rules they have to remember.

Reasons to avoid assembly language

(1) Assembly language code is not easily ported to different instruction set architectures. There are tools which will port 68K assembly to PowerPC assembly, but you run the risk that the architectures are so different a port doesn’t get you the full potential of the new architecture.

(2) Code can be written in a shorter amount of time in a high level language than it can be in assembly. People want to argue this, claiming that bit manipulation routines are too hard to do in C, but it’s just not true. I suspect that if they knew C as well as they knew assembler there would be little or no argument.

(3) It is far easier to make mistakes in assembly than it is in a high level language. High level languages offer abstraction and structure which makes many common assembly language problems simply non-existent.

(4) Code written in assembly is harder to maintain both for the original programmer as well as a different programmer. Because of the fine-grain control you get with assembly language, it is not always easy to follow the flow of the code.

(5) The development tools available for writing assembly language are not advancing at the same rate as those for high level languages. In fact, there are many situations where the tools are getting worse. Apple’s PowerPC Assembler for MPW is not nearly as sophisticated as their 68K Assembler.

Reasons to use assembly language

(1) Highly time-critical code, such as software which interfaces with a piece of hardware which has very specific timing dependencies. Not very common.

(2) Code where space is at a minimum, such as embedded controllers. Generally not applicable to the Macintosh.

(3) Code that is proven to be an unacceptable bottleneck in a specific task.

(4) Places where parameters are passed in specific locations that are not easily accessible to a high level language. [Between the PowerPCruntime architecture, and the protocol conversion that Mixed Mode does for you, this problem essentially goes away on the Power Macintosh - Ed stb]

In all of these instances, there is a need for assembly only in specific places in the code. There is no need to code large parts in assembly.

How to speed up your code - the old way

The most common way to speed up existing code is to find the parts of the program that are slow and rewrite them in assembly. In the past that may have been a good way to gain more speed. Today, that model is not only outdated, it can backfire. I’ve seen people rewrite their code in PowerPC assembly language only to see it run SLOWER. Do not assume you know more about the processor architecture than the compiler. Unless you understand the instruction scheduling of the processor entirely, you probably can’t out-do a good compiler.

How to speed up your code - the new way

Determine which parts of your program are used the most. If a particular feature takes several minutes to run but is only used once a month, maybe it’s not as important as features which takes ten seconds but are used every five minutes. Watch your customers’ usage patterns. Ask them which parts of the program are annoyingly slow. Ask them why they think those parts are slow. Remember, slowness is subjective. What is slow to a power user may seem perfectly fine to a novice user. Who uses your product, the novice or the power user?

Once you have identified the areas of your software that seem slow, you may want to back up the results with scientific data. Run performance analysis tools to see exactly where in the code things are slow when you perform the tasks that users said were slow. THINK C and CodeWarrior have performance monitoring tools included that work well. MPW has its own performance tools which are adequate. If you are writing code that is not easily interfaced to these tools, I recommend you look at the source code provided for the performance tools in THINK C. It is very easy to adapt this code to monitor the performance of any piece of code.

One thing to remember is that the performance of your software may differ greatly when comparing Power Macintosh to the 68K Macintosh. Performance may also vary quite a bit between specific Macintosh models. Machines with a 32 bit data bus will perform memory intensive operations much faster than machines with a 16 bit data bus.

Now that you have figured out which parts of your program are slow, it is time to decide how to make them faster. The first thing to do is to examine the underlying algorithms of the code. Is there anything fundamental that you can do to speed things up? For example, if you are performing a text search, how do you search through the text? Do you use Munger? Perhaps something like a Boyer-Moore algorithm would be much faster. Remember, the key is to work smarter. Brute force is not the answer - it’s a matter of brains over brawn.

Sometimes simply a small change to your existing algorithm will make things much faster. I sped up a search algorithm I wrote years ago by a factor of three by simply adding two lines of code. Look at your algorithm and examine how it operates with common data that goes through it. Perhaps certain shortcuts can be taken when the most common data runs through it.

If you don’t have many books on fundamental computer algorithms, now is the time to stock up. I am a firm believer that you cannot have enough books on algorithms. At the end of this article I have listed several books that will help broaden and round out your algorithm skills. I highly recommend all of them.

Once you have analyzed the specific parts of your program that are bottlenecks, it is time to look at the architecture of your program as a whole. If your program is rather large you may want to look at it as several modules working together.

Is the underlying architecture of your program going to be a bottleneck? Are there time consuming tasks that can be done in the background at idle time rather than being done while the user must wait? Are you doing network communication synchronously when you could do it asynchronously and give the user their machine back? Are there tasks that need to be performed but don’t need to give immediate feedback to the user? These kinds of tasks are good candidates for idle time processing, additional user feedback, modeless dialog boxes, asynchronous programming, and other methods of helping the user feel as if they are not waiting on your program, or at least aren’t prevented from doing something else while you get your thing done. If you keep the user occupied or help them feel productive while your program is working, they’ll be more patient with whatever performance you have.

How to write the code in Assembly Language

If, after careful examination, you have determined a bottleneck in your program, analyzed the algorithms as best you can, rewritten them to be as efficient as possible, and still it is not fast enough, perhaps it is time to code a small part in assembly. The best place to start is to disassemble compiler- generated code for the routine you want to code in assembly. Look at the code. What is inefficient about it? Are registers constantly being reloaded? Are the registers being used efficiently? Are the instructions scheduled for maximum pipelining? Very often you can take the disassembled code, make a few minor modifications to it and see a very nice speed increase.

Perform accurate timing tests on the code you are optimizing. Unless you completely understand the PowerPC Architecture Manual and the PowerPC 601 User’s Guide, more often than not you will make PowerPC code slower than a good compiler. The bottom line is that it must run faster, not look faster.

Maintain an exact high-level equivalent of the assembly code, and keep it right there in the same file. This way if you port your code to a different architecture, you’ve got what you need to get up and running quickly. In many cases the bottleneck on one machine will not be a bottleneck on another.

In Conclusion

This article has discussed some alternate methods of speeding up your program execution that are in many ways better than traditional methods used by many programmers. The goal is to maximize your gain and minimize your effort. By working smarter rather than harder, you can have a faster program in less time.

Recommended Books

[1] Thomas H. Cormen, Charles E. Leiserson, and Ronald L. Rivest. Introduction to Algorithms. MIT Press, 1990.

[2] Alfred V. Aho, John E. Hopcroft, and Jeffrey D. Ullman. The Design and Analysis of Computer Algorithms. Addison-Wesley, 1974.

[3] Saumyendra Sengupta and Paul Edwards. Data Structures in ANSI C. Academic Press, 1991.

[4] Donald Knuth. The Art of Computer Programming, Volumes 1-3. Addison-Wesley, 1973

[5] Daniel H. Greene and Donald E. Knuth. Mathematics for the Analysis of Algorithms., Third Edition Birkhäuser, 1990.

[6] P. D. Eastman. Go, Dog, Go! Random House, 1961.

[7] Manoochehr Azmoodeh. Abstract Data Types and Algorithms, Second Edition. Macmillan, 1990.

 

Community Search:
MacTech Search:

Software Updates via MacUpdate

Tor Browser 12.5.5 - Anonymize Web brows...
Using Tor Browser you can protect yourself against tracking, surveillance, and censorship. Tor was originally designed, implemented, and deployed as a third-generation onion-routing project of the U.... Read more
Malwarebytes 4.21.9.5141 - Adware remova...
Malwarebytes (was AdwareMedic) helps you get your Mac experience back. Malwarebytes scans for and removes code that degrades system performance or attacks your system. Making your Mac once again your... Read more
TinkerTool 9.5 - Expanded preference set...
TinkerTool is an application that gives you access to additional preference settings Apple has built into Mac OS X. This allows to activate hidden features in the operating system and in some of the... Read more
Paragon NTFS 15.11.839 - Provides full r...
Paragon NTFS breaks down the barriers between Windows and macOS. Paragon NTFS effectively solves the communication problems between the Mac system and NTFS. Write, edit, copy, move, delete files on... Read more
Apple Safari 17 - Apple's Web brows...
Apple Safari is Apple's web browser that comes bundled with the most recent macOS. Safari is faster and more energy efficient than other browsers, so sites are more responsive and your notebook... Read more
Firefox 118.0 - Fast, safe Web browser.
Firefox offers a fast, safe Web browsing experience. Browse quickly, securely, and effortlessly. With its industry-leading features, Firefox is the choice of Web development professionals and casual... Read more
ClamXAV 3.6.1 - Virus checker based on C...
ClamXAV is a popular virus checker for OS X. Time to take control ClamXAV keeps threats at bay and puts you firmly in charge of your Mac’s security. Scan a specific file or your entire hard drive.... Read more
SuperDuper! 3.8 - Advanced disk cloning/...
SuperDuper! is an advanced, yet easy to use disk copying program. It can, of course, make a straight copy, or "clone" - useful when you want to move all your data from one machine to another, or do a... Read more
Alfred 5.1.3 - Quick launcher for apps a...
Alfred is an award-winning productivity application for OS X. Alfred saves you time when you search for files online or on your Mac. Be more productive with hotkeys, keywords, and file actions at... Read more
Sketch 98.3 - Design app for UX/UI for i...
Sketch is an innovative and fresh look at vector drawing. Its intentionally minimalist design is based upon a drawing space of unlimited size and layers, free of palettes, panels, menus, windows, and... Read more

Latest Forum Discussions

See All

Listener Emails and the iPhone 15! – The...
In this week’s episode of The TouchArcade Show we finally get to a backlog of emails that have been hanging out in our inbox for, oh, about a month or so. We love getting emails as they always lead to interesting discussion about a variety of topics... | Read more »
TouchArcade Game of the Week: ‘Cypher 00...
This doesn’t happen too often, but occasionally there will be an Apple Arcade game that I adore so much I just have to pick it as the Game of the Week. Well, here we are, and Cypher 007 is one of those games. The big key point here is that Cypher... | Read more »
SwitchArcade Round-Up: ‘EA Sports FC 24’...
Hello gentle readers, and welcome to the SwitchArcade Round-Up for September 29th, 2023. In today’s article, we’ve got a ton of news to go over. Just a lot going on today, I suppose. After that, there are quite a few new releases to look at... | Read more »
‘Storyteller’ Mobile Review – Perfect fo...
I first played Daniel Benmergui’s Storyteller (Free) through its Nintendo Switch and Steam releases. Read my original review of it here. Since then, a lot of friends who played the game enjoyed it, but thought it was overpriced given the short... | Read more »
An Interview with the Legendary Yu Suzuk...
One of the cool things about my job is that every once in a while, I get to talk to the people behind the games. It’s always a pleasure. Well, today we have a really special one for you, dear friends. Mr. Yu Suzuki of Ys Net, the force behind such... | Read more »
New ‘Marvel Snap’ Update Has Balance Adj...
As we wait for the information on the new season to drop, we shall have to content ourselves with looking at the latest update to Marvel Snap (Free). It’s just a balance update, but it makes some very big changes that combined with the arrival of... | Read more »
‘Honkai Star Rail’ Version 1.4 Update Re...
At Sony’s recently-aired presentation, HoYoverse announced the Honkai Star Rail (Free) PS5 release date. Most people speculated that the next major update would arrive alongside the PS5 release. | Read more »
‘Omniheroes’ Major Update “Tide’s Cadenc...
What secrets do the depths of the sea hold? Omniheroes is revealing the mysteries of the deep with its latest “Tide’s Cadence" update, where you can look forward to scoring a free Valkyrie and limited skin among other login rewards like the 2nd... | Read more »
Recruit yourself some run-and-gun royalt...
It is always nice to see the return of a series that has lost a bit of its global staying power, and thanks to Lilith Games' latest collaboration, Warpath will be playing host the the run-and-gun legend that is Metal Slug 3. [Read more] | Read more »
‘The Elder Scrolls: Castles’ Is Availabl...
Back when Fallout Shelter (Free) released on mobile, and eventually hit consoles and PC, I didn’t think it would lead to something similar for The Elder Scrolls, but here we are. The Elder Scrolls: Castles is a new simulation game from Bethesda... | Read more »

Price Scanner via MacPrices.net

Clearance M1 Max Mac Studio available today a...
Apple has clearance M1 Max Mac Studios available in their Certified Refurbished store for $270 off original MSRP. Each Mac Studio comes with Apple’s one-year warranty, and shipping is free: – Mac... Read more
Apple continues to offer 24-inch iMacs for up...
Apple has a full range of 24-inch M1 iMacs available today in their Certified Refurbished store. Models are available starting at only $1099 and range up to $260 off original MSRP. Each iMac is in... Read more
Final weekend for Apple’s 2023 Back to School...
This is the final weekend for Apple’s Back to School Promotion 2023. It remains active until Monday, October 2nd. Education customers receive a free $150 Apple Gift Card with the purchase of a new... Read more
Apple drops prices on refurbished 13-inch M2...
Apple has dropped prices on standard-configuration 13″ M2 MacBook Pros, Certified Refurbished, to as low as $1099 and ranging up to $230 off MSRP. These are the cheapest 13″ M2 MacBook Pros for sale... Read more
14-inch M2 Max MacBook Pro on sale for $300 o...
B&H Photo has the Space Gray 14″ 30-Core GPU M2 Max MacBook Pro in stock and on sale today for $2799 including free 1-2 day shipping. Their price is $300 off Apple’s MSRP, and it’s the lowest... Read more
Apple is now selling Certified Refurbished M2...
Apple has added a full line of standard-configuration M2 Max and M2 Ultra Mac Studios available in their Certified Refurbished section starting at only $1699 and ranging up to $600 off MSRP. Each Mac... Read more
New sale: 13-inch M2 MacBook Airs starting at...
B&H Photo has 13″ MacBook Airs with M2 CPUs in stock today and on sale for $200 off Apple’s MSRP with prices available starting at only $899. Free 1-2 day delivery is available to most US... Read more
Apple has all 15-inch M2 MacBook Airs in stoc...
Apple has Certified Refurbished 15″ M2 MacBook Airs in stock today starting at only $1099 and ranging up to $230 off MSRP. These are the cheapest M2-powered 15″ MacBook Airs for sale today at Apple.... Read more
In stock: Clearance M1 Ultra Mac Studios for...
Apple has clearance M1 Ultra Mac Studios available in their Certified Refurbished store for $540 off original MSRP. Each Mac Studio comes with Apple’s one-year warranty, and shipping is free: – Mac... Read more
Back on sale: Apple’s M2 Mac minis for $100 o...
B&H Photo has Apple’s M2-powered Mac minis back in stock and on sale today for $100 off MSRP. Free 1-2 day shipping is available for most US addresses: – Mac mini M2/256GB SSD: $499, save $100 –... Read more

Jobs Board

Licensed Dental Hygienist - *Apple* River -...
Park Dental Apple River in Somerset, WI is seeking a compassionate, professional Dental Hygienist to join our team-oriented practice. COMPETITIVE PAY AND SIGN-ON Read more
Sublease Associate Optometrist- *Apple* Val...
Sublease Associate Optometrist- Apple Valley, CA- Target Optical Date: Sep 30, 2023 Brand: Target Optical Location: Apple Valley, CA, US, 92307 **Requisition Read more
*Apple* / Mac Administrator - JAMF - Amentum...
Amentum is seeking an ** Apple / Mac Administrator - JAMF** to provide support with the Apple Ecosystem to include hardware and software to join our team and Read more
Child Care Teacher - Glenda Drive/ *Apple* V...
Child Care Teacher - Glenda Drive/ Apple ValleyTeacher Share by Email Share on LinkedIn Share on Twitter Read more
Cashier - *Apple* Blossom Mall - JCPenney (...
Cashier - Apple Blossom Mall Location:Winchester, VA, United States (https://jobs.jcp.com/jobs/location/191170/winchester-va-united-states) - Apple Blossom Mall Read more
All contents are Copyright 1984-2011 by Xplain Corporation. All rights reserved. Theme designed by Icreon.