TweetFollow Us on Twitter

Jul 94 Dialogue Box
Volume Number:10
Issue Number:7
Column Tag:Dialogue Box

Dialogue Box

By Scott T Boyd, Editor

In The Cornfield == In The Weeds?

You raised many fine points about the need (or lack thereof) to write assembly language for Power PC. I would have appreciated more detail to illustrate the points you made. For instance, I would like to know what the two lines of code that sped up the search algorithm by a factor of three did, in general. Also, a simple example of Power PC code you saw rewritten by someone else where the rewrite ran slower. Or, an example where a Power PC rewrite looks faster but did not run faster. Some of your points raised big question marks for me.

The article as a whole appeared directed toward the C programmer, but very little of how to write C that is compiled to good Power PC code was addressed.

A cursory examination of the PowerPC instruction set shows that it is not a C machine; there are instructions that do not match well to C, as well as C constructs that do not map well to PowerPC instructions. For instance, the expression: unsigned char c = f; where f is a float compiles to over 20 instructions on an RS6000 using the xlc compiler with the O3 optimization (the highest level), regardless of the rounding convention chosen. This is in part because there are no instructions that move data between the general purpose registers and the floating point registers, and in part because both the unsigned char data type and the float data type, while supported on the Power PC in theory, are not natural data types; at least the compiler does not think so.

Another example true for both 68K and Power PC is that while the processors make overflow detection easy, the C language does not provide any natural method to write code that detects arithmetic overflow in integer arithmetic. Thus, while 68K and Power PC both have instructions to support multiple word precision arithmetic easily, writing it portably is not so easy (although it is possible; QuickDraw GX supplied PowerPC with C implementations of 64-bit-wide numbers thanks to the magic of Apple engineer Rob Johnson).

You mention that bit manipulation is too hard to do in C, etc., claiming that if the arguer knew C well, there would be little or no argument. Not always so; for instance, finding the first set bit in a register is a single instruction for both PowerPC and 68K, but I think you’ll have a hard time generating a C construct to generate that instruction. Similarly, I challenge you to generate C constructs for the 68K instructions BFINS or BFEXT, or the more complicated cases of the PowerPC instruction rlwimi; for instance, some compilers will not generate a simple bit rotate no matter how clever your C is (MPW C will, but only because of requests on my part).

Sometimes the best way around these sorts of problems is to use a standard C library, since the compiler author is likely to make sure that at least the standard library calls generate the correct instructions. For instance, the fabs() function will generate the corresponding instruction on the xlc compiler. But, I tried to get the compiler to generate the fnabs instruction (negative absolute value of a double) using various permutations of

-(a > 0 ? a : -a) 

by moving the negation inside the expression and reversing the conditional, and not only did it never generate the correct instruction, each time it generated a different set of instructions for each of the identical permutations!

To suggest that the compiler is always smarter than the programmer is a bit naive. While I do not doubt that this can be the case, I suggest that a thorough understanding of the instruction set and the experimental knowledge gained from attempting to write C constructs to generate those instructions can go a long way towards high end program tuning. For instance, I had no trouble getting both MetroWerks and xlc to generate a single instruction out of the line: d = -(d * d + d); where d is a double. At first, reading your article seemed to support the argument that I hear that “all engineers should write C as well as they can without regard to the assembly that is produced, because the compiler will always be smarter than they are, and because they want to be portable, etc”. It is fine to drive a car without understanding how the engine works, but a little less savory to drive one knowing that the designer did not understand the same. In order to become expert at computer programming, you have to understand how computers work, including dirty assembly. It is difficult to gain that understanding without ever having written a line of it.

I am not suggesting writing the next version of MacWorks in assembly; far from it. I just finished working on QuickDraw GX, and the native PowerPC version has nearly no assembly at all; the 68K version has maybe 1%, and all assembly has portable C equivalents. But as I consider graphics algorithms for the next version, I immediately consider what assembly best implements the algorithms, and how that defines the high level representation for those algorithms. I rely on my experience writing very large projects completely in assembly, and expect those working with me to have a similar depth of knowledge.

Further on in the article, I get the feeling that you do expect the reader to understand assembly. But rather than heading the paragraph “How to Write the Code in Assembly Language”, I suggest “How to Verify the Algorithm Written in a High Level Language.” Disassemble it. Understand what you are asking the computer to do. In the PowerPC case, important concepts include leaf node routines, how floats and integers work, register conventions, C switch statements (not cheap because of the architecture), to mention a few.

Finally, I disagree that PowerPC assembly language is more difficult than 68K. I suggest it is just different. 68K code can slow down or speed up by a factor of 2 depending on how it is loaded in the cache, and another factor of 2 if the data is misaligned; you won’t detect either easily by looking at a code listing. While the scheduling and pipe-lining issues in PowerPC complicate writing good assembly, there are not a lot of rules to learn; the simplified memory model and uniformity of instruction latency makes it actually much easier than the cycle counting I used to do for 68K.

Well, I thought I was done, but I disagree with one more thing: your assessment that THINK C has adequate performance tools. By this I assume that you mean the profiler option. First, it is not easy to adopt this to monitor the performance of any piece of code (as you suggest) since the compiler must generate special callouts in the body of the code for the profiler to work at all. Secondly, it works unmodified using Ticks, not the Microsecond timer; coarse by any measure. Third, when adapted to use the Microsecond timer, the software must be calibrated to eliminate the time used by the profiler code itself. Without doing this, the software can’t tell the difference between one function callout and two. Fourth, even with the corrections, I have found that I need to either remove all network connections or turn off interrupts on my IIfx (slow enough for microseconds to be useful) to get accurate timings. And you can’t just jam a register on a PowerPC to turn off interrupts. Lastly, I have to rewrite the printouts to get useful info; the THINK standard ones just aren’t very good. Not a big deal, but work nonetheless.

I hope I wasn’t being too big of an pain in the backside with everything written above; I liked the article. And, if I am wrong about any of the points in my rebuttal, please do not hesitate to correct my shortcomings.

- Cary Clark, Apple Computer, Inc.

More 601 Assembly Feedback

I would like to comment on the prevailing “propaganda” regarding assembly language and the PowerPC. Every time I hear it, I feel that my intelligence is being insulted.

It has been expressed (primarily by Apple, but also by Metrowerks) that trying to use assembly language on the PowerPC is a bad idea. They give a number of reasons including the difficulty of porting, the difficulty of optimizing, the need to adapt to different PowerPC implementations and the quality of existing compilers. I am told that I couldn’t match the speed of compiled C/C++ code even if I tried.

In fact I agree with their reasons and have often generated optimal code simply by breaking long expressions into many pieces, using extra variables to hold intermediate values and reordering the statements to schedule well on the PowerPC.

However, there is a important point that everyone seems to be missing! The PowerPC architecture defines an instruction set that is significantly larger than what the compilers actually use. For example, compile:

x = (x << 31) + (x >> 1)// Rotate right one bit

and you will probably get at least three instructions, when one (a right longword rotate by one bit) would suffice on both the 680x0 and the PowerPC. No compiler I have ever seen (and I’ve seen quite a few) generates rotate instructions.

There are numerous other examples, including absolute value, multiply long high word, add/subtract with extend, and count leading zeros. All are useful in specialized compute-intensive operations. In addition, certain “no-op” instructions are necessary on the PowerPC 601 to keep the pipeline running at full speed.

Of course, none of them correspond to built-in operations of the C, C++, or Pascal language -- and that’s the real reason we need assembler. In fact, it’s one of the reasons that inline assembler is part of the developing ANSI C++ specification.

I would appreciate it if a more generous attitude were taken towards assembly language in the future.

- Robert P. Munafo, Malden, MA

Sometimes our mailbox gets to be a bit interactive. In a followup letter, Robert added

Thank you for your reply. I feel a lot better after hearing what you had to say about the use of assembler at Apple. [Ibasically said that most everyone was writing system software in C, with a handful of exceptions, like Mixed Mode, and the emulator - Ed stb]

I was trying not to respond exclusively to Steve’s article (“Thoughts from the Cornfield”, MacTech vol. 10, No. 5) but to each of the other times Assembler has been discouraged and the general “C++ is now sufficient for everything” propaganda. For example:

• Same issue, p. 68, column 1, last ¶

• Vol. 10, No. 3 Page 82 column 1, ¶ 3

• The place in the CodeWarrior manual where they explain why inline assembler for PowerPC is not yet supported.

I do appreciate the section on “How to write the code in assembly language” inasmuch as it points out the advantages of using the compiler’s output as a first step, and benchmarking your efforts to see if they really speed things up, etc.

I think that even if you are not going to write any assembler, you still need to be rather familiar with the operation of the PowerPC chip in order to optimize your C or C++ code.

It would be very interesting indeed if compilers could be improved a bit as a result of this discussion! Even GCC and G++, (the GNU compilers) which are widely regarded as pretty much the best thing going, do not generate rotate instructions. However, it is quite easy to generate rotate instructions if you have already implemented a peephole optimizer.

- Robert Munafo

You Can Beat Mpw’s Code Generator

You must have been up late for you to have thought “people like Symantec’s quick turnaround tools and MPW’s code generation.” It’s OK, ’cuz we know it was probably a simple case of getting the object files swapped - and, alas, it was very late. <grin> I think MPW’s code generation is brain-dead; Think’s code is significantly more intelligent. Is it not smaller, too?

Thanks for mentioning your visit to the Software Developer’s Conference. Going behind enemy lines is good for our side. Maybe as a result you will broaden the horizons of Mac tool developers; causing the number and the quality of Mac development tools to grow. Miracles are possible...

- dk smith, mtn. view, ca

I must have been asleep when the balance changed. Besides, I never said that MPW’s code generation wasn’t brain-dead - Ed stb

But You Can’t Beat A Good Algorithm

Your article on performance and the misconceptions about the use of assembly (Thoughts from the Cornfield, May 1994) was excellent. The importance of a good algorithm cannot be overlooked. The thought of thousands of lines of 68K code being ported to run native on the PowerPC architecture is scary - there's a lot of Toolbox/OS code in this category. Let’s hope those porting 68K code to C don’t blindly port the assembly language program's structure, interfaces, and algorithms. This could waste the PowerPC’s speed and nullify our hardware performance advantages over Pentium.

- dk smith, mtn. view, ca

Free the SDKs

This is an open letter to decision makers at Apple in which I request that the policy of charging extra for crucial SDKs be discontinued.

Why are SDKs important? Software Development Kits (SDKs) contain critical information that enables developers to support specific components of the Macintosh OS and User Interface.

As a small developer, Ifind it difficult enough to budget for necessities like E.T.O., the Developer Program, and WWDC, but the added pain of buying an SDK for every feature Iwant to add to my application is burdensome in terms both of time and money.

Apple is fond of comparing itself to Microsoft. So let’s make a rough comparison of the yearly cost of the Apple and Microsoft developer programs, and see how Apple’s policy of charging for SDKs dramatically alters the cost equation.

The comparisons assume I want:

1. Access to all the SDKs I’ll need for a platform.

2. Programming tools to use the SDKs.

3. System software to test with.

4. Programmer’s documentation.

APPLE

Associates Program (& dev. CDs) $350 Yearly

E.T.O (development tools) $400 Yearly

(initial $1295) ---

$750 Base

Add a few SDKs (a sampling for reference only):

QuickTime SDK $195

Easy Open SDK $150

AppleScript SDK $199

---

“Some SDKs” $544

---

Add it all up to get $1294 Total

MICROSOFT

Developer Network Level 2 $495 yearly

Visual C++ $99 update

initial $599 ---

$594 Total

I’ll point out areas in which the above comparison favors Apple:

1. As SDKs are updated, Apple charges update fees in the ~$100 range per SDK (reference AppleScript SDK and QuickTime SDK). So though I’ve listed Apple’s initial SDK charge, there’s a recurring cost component as well. Microsoft ships updates to SDKs on each quarterly Developer Network CD. Apple forces developers to purchase these updates (once they realize something is not up-to-date).

2. We’ll also ignore the fact that Microsoft ships the Windows programming documentation with Visual C++. Apple developers need to spend untold extra $$$ hundreds $$$ for $Inside $Mac.

3. The items bought above from Apple do not guarantee access to all SDKs and all copies of system software that we need from Apple. The list above contains only a few. I’m omitting some little things...like AOCE :-). The items from Microsoft include ALL WINDOWS SDKs and OS versions. Including NT. And soon Chicago. It’s coming...

So what am I bitching about, and why?

My major gripe is that Apple’s policy of charging for SDKs adds substantial cost and time overhead to Macintosh development that hinders support of new features.

Not only do I need to spend the money to purchase the additional SDKs, but I need to take the time to order from APDA and wait for the material before I can start to implement new features. This extra pain makes it less likely that I (or others) will add support for new system SW features.

What’s the point, Apple?

Do you want us to support new System 7.x features like scripting, QuickTime, Easy Open?

Are you trying to fund development by charging developers for SDKs?

Wouldn’t you rather do everything you can to encourage developers to continue to support the Mac and to add Macintosh-specific features to cross-platform programs in order to maintain the shrinking differentiation between the platforms so that you can maintain or improve market share?

I’ve heard folks from Apple argue that the charge for SDKs is simply to cover the cost of their delivery. Surely you’re joking, Mr. Spindler!!! Apple claimed that the E.T.O. (and/or, at one time or another, The Developer CD Series) was the one-stop source for developer tools and information for Macintosh development. Okay, we paid for it. But where are those SDKs? Surely this is the place?

The recent re-shuffle of the Developer CD series into Tools, Systems Software, and Reference Library was to make more room on the CDs. So where are those SDKs? Surely you can fit on the Easy Open SDK ($150 for a floppy and 100 pages of docs - isn’t that just pure greed?). And surely there’s room for AppleScript docs, headers, and samples?

Go ahead, make my day - Here’s what I want:

Put all SDKs, System versions, System Extensions, and DocViewer copies of associated documentation on the Developer CD Series. If I want paper documentation, I’ll pay extra. Or print it.

So why should Icare?

I’ve supported Apple for a long time. I’ve been writing commercial Macintosh Software since 1984. In 1984 Apple did everything it could to encourage development on the Mac. 1994 is like 1984 in that this is a hard sell. 1994 is not like 1984 in that Apple is doing less to support developers. Let’s make 1994 more like 1984. I want the Mac to succeed, and to trample Windows. But on a recent project (Houdini) I took a trip to the dark side. And discovered some truths: Microsoft does far better than Apple in providing access to its system technologies.. Go ahead, make my day. Make it easier for me to make the Macintosh shine.

- James Berry, jberry@teleport.com

Apple Responds!

As Ike Nassi, Apple’s Vice President of Development Products, said at the WWDC in May, Apple is committed to improving the way in which we currently distribute SDKs to developers. In fact, we are - right now - finalizing a new plan that will allow us to deliver a complete set of system software SDKs to developers on a regular basis for a very attractive price. We expect to be able to present the details of this plan within the next month or so. We’re confident that our new approach to SDK distribution will address most of the concerns of Mr. Berry and other developers with whom we’ve discussed similar issues in the past few months.

- Gary Little, Product Manager,

Macintosh Development Tools

Apple Computer, Inc.

 

Community Search:
MacTech Search:

Software Updates via MacUpdate

Latest Forum Discussions

See All

Top Mobile Game Discounts
Every day, we pick out a curated list of the best mobile discounts on the App Store and post them here. This list won't be comprehensive, but it every game on it is recommended. Feel free to check out the coverage we did on them in the links... | Read more »
Price of Glory unleashes its 1.4 Alpha u...
As much as we all probably dislike Maths as a subject, we do have to hand it to geometry for giving us the good old Hexgrid, home of some of the best strategy games. One such example, Price of Glory, has dropped its 1.4 Alpha update, stocked full... | Read more »
The SLC 2025 kicks off this month to cro...
Ever since the Solo Leveling: Arise Championship 2025 was announced, I have been looking forward to it. The promotional clip they released a month or two back showed crowds going absolutely nuts for the previous competitions, so imagine the... | Read more »
Dive into some early Magicpunk fun as Cr...
Excellent news for fans of steampunk and magic; the Precursor Test for Magicpunk MMORPG Crystal of Atlan opens today. This rather fancy way of saying beta test will remain open until March 5th and is available for PC - boo - and Android devices -... | Read more »
Prepare to get your mind melted as Evang...
If you are a fan of sci-fi shooters and incredibly weird, mind-bending anime series, then you are in for a treat, as Goddess of Victory: Nikke is gearing up for its second collaboration with Evangelion. We were also treated to an upcoming... | Read more »
Square Enix gives with one hand and slap...
We have something of a mixed bag coming over from Square Enix HQ today. Two of their mobile games are revelling in life with new events keeping them alive, whilst another has been thrown onto the ever-growing discard pile Square is building. I... | Read more »
Let the world burn as you have some fest...
It is time to leave the world burning once again as you take a much-needed break from that whole “hero” lark and enjoy some celebrations in Genshin Impact. Version 5.4, Moonlight Amidst Dreams, will see you in Inazuma to attend the Mikawa Flower... | Read more »
Full Moon Over the Abyssal Sea lands on...
Aether Gazer has announced its latest major update, and it is one of the loveliest event names I have ever heard. Full Moon Over the Abyssal Sea is an amazing name, and it comes loaded with two side stories, a new S-grade Modifier, and some fancy... | Read more »
Open your own eatery for all the forest...
Very important question; when you read the title Zoo Restaurant, do you also immediately think of running a restaurant in which you cook Zoo animals as the course? I will just assume yes. Anyway, come June 23rd we will all be able to start up our... | Read more »
Crystal of Atlan opens registration for...
Nuverse was prominently featured in the last month for all the wrong reasons with the USA TikTok debacle, but now it is putting all that behind it and preparing for the Crystal of Atlan beta test. Taking place between February 18th and March 5th,... | Read more »

Price Scanner via MacPrices.net

AT&T is offering a 65% discount on the ne...
AT&T is offering the new iPhone 16e for up to 65% off their monthly finance fee with 36-months of service. No trade-in is required. Discount is applied via monthly bill credits over the 36 month... Read more
Use this code to get a free iPhone 13 at Visi...
For a limited time, use code SWEETDEAL to get a free 128GB iPhone 13 Visible, Verizon’s low-cost wireless cell service, Visible. Deal is valid when you purchase the Visible+ annual plan. Free... Read more
M4 Mac minis on sale for $50-$80 off MSRP at...
B&H Photo has M4 Mac minis in stock and on sale right now for $50 to $80 off Apple’s MSRP, each including free 1-2 day shipping to most US addresses: – M4 Mac mini (16GB/256GB): $549, $50 off... Read more
Buy an iPhone 16 at Boost Mobile and get one...
Boost Mobile, an MVNO using AT&T and T-Mobile’s networks, is offering one year of free Unlimited service with the purchase of any iPhone 16. Purchase the iPhone at standard MSRP, and then choose... Read more
Get an iPhone 15 for only $299 at Boost Mobil...
Boost Mobile, an MVNO using AT&T and T-Mobile’s networks, is offering the 128GB iPhone 15 for $299.99 including service with their Unlimited Premium plan (50GB of premium data, $60/month), or $20... Read more
Unreal Mobile is offering $100 off any new iP...
Unreal Mobile, an MVNO using AT&T and T-Mobile’s networks, is offering a $100 discount on any new iPhone with service. This includes new iPhone 16 models as well as iPhone 15, 14, 13, and SE... Read more
Apple drops prices on clearance iPhone 14 mod...
With today’s introduction of the new iPhone 16e, Apple has discontinued the iPhone 14, 14 Pro, and SE. In response, Apple has dropped prices on unlocked, Certified Refurbished, iPhone 14 models to a... Read more
B&H has 16-inch M4 Max MacBook Pros on sa...
B&H Photo is offering a $360-$410 discount on new 16-inch MacBook Pros with M4 Max CPUs right now. B&H offers free 1-2 day shipping to most US addresses: – 16″ M4 Max MacBook Pro (36GB/1TB/... Read more
Amazon is offering a $100 discount on the M4...
Amazon has the M4 Pro Mac mini discounted $100 off MSRP right now. Shipping is free. Their price is the lowest currently available for this popular mini: – Mac mini M4 Pro (24GB/512GB): $1299, $100... Read more
B&H continues to offer $150-$220 discount...
B&H Photo has 14-inch M4 MacBook Pros on sale for $150-$220 off MSRP. B&H offers free 1-2 day shipping to most US addresses: – 14″ M4 MacBook Pro (16GB/512GB): $1449, $150 off MSRP – 14″ M4... Read more

Jobs Board

All contents are Copyright 1984-2011 by Xplain Corporation. All rights reserved. Theme designed by Icreon.