Nov 95 Challenge
Volume Number: | | 11
|
Issue Number: | | 11
|
Column Tag: | | Programmers Challenge
|
Programmers Challenge
By Bob Boonstra, Westford, Massachusetts
Note: Source code files accompanying article are located on MacTech CD-ROM or source code disks.
Enclosing Bounds
The Challenge this month is based on a suggestion by Mike Scanlin, who remains a fan of the column. (Were still waiting for Mikes first Challenge entry, however.) The problem is to write a routine that will return a rectangle enclosing all non-white pixels in a selected area of an image. This code might be useful in a drawing or painting program, where the user would be allowed to select a subset of the image by clicking and dragging, and the software would select all of the elements of the image contained within that selection. The prototype of the code you will write is:
void EnclosingBounds(
PixMapHandle pm, /* handle to PixMap containing image */
Rect selection, /* subset of image to enclose */
Rect *enclosingRect /* enclosing rect return value */
);
Your code should examine all of the pixels within the selection rectangle of the PixMap and return the smallest rectangle containing all of the non-white pixels. Pixels outside the selection rectangle should be ignored. The bounds rectangle of the PixMap will be no larger than 2048 pixels in each dimension, the baseAddr pointer will be longword aligned, and rowBytes will be a multiple of 4. You should deal with pixelSize values of 1, 8, or 32, with values of 8 and 32 being weighted most heavily in measuring performance. For PixMaps with indexed pixels (cmpCount==1), the color table will contain white as the first table entry (as all good color tables are supposed to). For PixMaps with direct pixels, the unused (alpha) bits of each pixel will be zero.
You may use either the Metrowerks or the Symantec compilers for this native PowerPC Challenge. If you have any questions, or would like some test data for your code, please send me e-mail at one of the Programmers Challenge addresses, or directly to boonstra@ultranet.com.
Two Months Ago Winner
Congratulations to Eric Lengyel (Blacksburg, VA) for submitting the fastest and smallest entry to the Reversible Scrambling Algorithm Challenge. Despite an unfortunate delay in publication of the magazine that left participants with less time than usual to complete the Challenge, three of the four entries I received by the extended deadline worked correctly, at least in part.
You might recall that the Challenge was to write code that would raise a large integer message to a power and compute the remainder modulo another large integer. The name of the Challenge comes from the fact that this technique is reversible, given properly chosen integers. Eric is a graduate student in Mathematics at Virginia Tech, and he took advantage of a highly optimized multiple precision integer arithmetic library that he had written as part of a number theory project involving the factorization of very large numbers.
Each of the working entries converted the BigNum representation provided in the problem into one that right-justified numbers into a fixed-length data structure. While this imposes a restriction on the maximum size integer that the code can handle, this assumption was permitted by the problem statement. In Erics code, the restriction is controlled by a single #define statement.
Eric uses a binary exponentiation algorithm to raise the message to the specified power, and takes advantage of facts from number theory that allow the remainder to be computed at each step of the exponentiation. The time to perform the exponentiation is therefore proportional to the logarithm of the exponent. Erics multiplication and division routines use the 68020s capability to compute the 64-bit product of two longwords and to divide a 64-bit dividend by a longword. The multiplication, division, exponentiation, and compare routines in Erics code are general purpose and could be used in any 68K application that needs large integers.
Honorable mention goes to Ernst Munter, who submitted an entry in pure C that was actually the fastest code for the short modulus test cases. Unfortunately, his entry did not produce correct results for the longer moduli.
Here are the times and code sizes for the entries that worked correctly (or partially correctly). Execution time is presented for two specific test cases, with modulus lengths of 22 and 88 bytes, respectively, as well as the total time for all of the test cases I ran. Cases that produced incorrect results are indicated with an asterisk. Numbers in parens after a persons name indicate that persons cumulative point total for all previous Challenges, not including this one.
Name time1 time2 Total time code data
(22) (88)
bytes bytes
Eric Lengyel 47 463 2083 1190 0
Xan Gregg (51) 35 967 3175 1558 0
Ernst Munter (C entry) (90) 17 * * 4266 11788
Top 20 Contestants of All Time
Here are the Top 20 Contestants for the Programmers Challenges to date. The numbers below include points awarded for this months entrants. (Note: ties are listed alphabetically by last name - there are more than 20 people listed this month because of ties.)
Rank Name Points
1. [Name deleted] 176
2. Munter, Ernst 90
3. Karsh, Bill 78
4. Stenger, Allen 65
5. Gregg, Xan 61
6. Larsson, Gustav 60
7. Riha, Stepan 51
8. Goebel, James 49
9. Nepsund, Ronald 47
10. Cutts, Kevin 46
11. Mallett, Jeff 44
12. Kasparian, Raffi 42
13. Vineyard, Jeremy 42
14. Darrah, Dave 31
15. Landry, Larry 29
16. Elwertowski, Tom 24
17. Lee, Johnny 22
18. Noll, Robert 22
19. Anderson, Troy 20
20. Beith, Gary 20
21. Burgoyne, Nick 20
22. Galway, Will 20
23. Israelson, Steve 20
24. Landweber, Greg 20
25. Lengyel, Eric 20
26. Pinkerton, Tom 20
There are three ways to earn points: (1) scoring in the top 5 of any Challenge, (2) being the first person to find a bug in a published winning solution or, (3) being the first person to suggest a Challenge that I use. The points you can win are:
1st place 20 points
2nd place 10 points
3rd place 7 points
4th place 4 points
5th place 2 points
finding bug 2 points
suggesting Challenge 2 points
Here is Erics winning solution:
PowerAndRemainder.c
Copyright © 1995 Eric Lengyel
/*
I call my fixed length numbers BigFixed and translate from BigNums to BigFixeds in the PowerAndRemainder
routine. These are the assembly language routines which are the guts of my program:
(1) PowerMod - raises a number to a power and reduces it by a modulus. It uses a fast binary exponentiation
algorithm, reducing by the modulus at each step.
(2) Multiply - multiplies 2 BigNums together.
(3) MultQ - mutliplies a BigNum by a long int.
(4) Divide - divides one BigNum by another and supplies the quotient and remainder.
(5) Compare - determines the ordering of 2 BigNums.
Some of the loops have been expanded to make more efficient use of the instruction cache.
*/
#define NumSize 72
typedef struct BigNum
{
short numDig;
unsigned char *dig;
} BigNum;
typedef struct BigFixed
{
unsigned char dig[NumSize*4];
} BigFixed;
/* We need 72 longs because the division routine needs the most significant longword to be zero and the
speed optimization requires that NumSize be a multiple of four. */
void PowerAndRemainder(BigNum *msg, BigNum *exp, BigNum *n,
BigNum *res);
void PowerMod(BigFixed *msg, BigFixed *exp, BigFixed *n,
BigFixed *res);
void Multiply(BigFixed *src1, BigFixed *src2, BigFixed *dst);
void MultQ(BigFixed *src1, long src2, BigFixed *dst);
void Divide(BigFixed *end, BigFixed *sor, BigFixed *dst);
short Compare(BigFixed *src1, BigFixed *src2);
PowerAndRemainder
void PowerAndRemainder(BigNum *msg, BigNum *exp, BigNum *n,
BigNum *res)
{
short a, b, numDigits;
BigFixed msg0, exp0, n0, res0;
for (a = 0; a < NumSize*4; a++)
{
b = NumSize*4 - msg->numDig;
if (a < b) msg0.dig[a] = 0;
else msg0.dig[a] = msg->dig[a - b];
b = NumSize*4 - exp->numDig;
if (a < b) exp0.dig[a] = 0;
else exp0.dig[a] = exp->dig[a - b];
b = NumSize*4 - n->numDig;
if (a < b) n0.dig[a] = 0;
else n0.dig[a] = n->dig[a - b];
}
PowerMod(&msg0, &exp0, &n0, &res0);
a = 0;
while (res0.dig[a] == 0) a++;
numDigits = res->numDig = NumSize*4 - a;
for (b = 0; b < numDigits; b++)
res->dig[b] = res0.dig[a++];
}
PowerMod
void PowerMod(BigFixed *msg, BigFixed *exp, BigFixed *n,
BigFixed *res)
{
BigFixed acc, scrap;
asm
{
LEA acc, A0 ; Start with one in
MOVEQ #NumSize/4-2, D0 ; accumulator
1} ; Test a bit in current
; longword of exponent
BEQ @1 ; If zero, skip multiply
PEA acc ; Multiply accumulator
PEA acc ; by base
MOVE.L msg, -(A7)
JSR Multiply
ADDA.W #12, A7
MOVE.L n, -(A7) ; Compare accumulator
PEA acc ; to modulus
JSR Compare
ADDQ.W #8, A7
TST.B D0
BMI @1 ; If its less, skip
; reduction
PEA scrap ; Reduce modulo n
MOVE.L n, -(A7)
PEA acc
JSR Divide
ADDA.W #12, A7
Multiply
/* Multiply src1 by src2 and put product in dst */
void Multiply(BigFixed *src1, BigFixed *src2, BigFixed *dst)
{
short topStop, botStop;
BigFixed acc, line;
asm
{
MOVEM.L D0-D7/A0-A4, -(A7)
LEA acc, A0 ; Clear accumulator
MOVEQ #NumSize/4-1, D0
D5 ; Do 64-bit multiply
ADD.L D2, D5 ; Add carry to low
; longword of product
CLR.L D2 ; Use D2 as dummy to
; extend carry
ADDX.L D2, D6 ; Add zero to high
; longword with carry
MOVE.L D6, D2 ; Anything in high
; longword gets carried
MOVE.L D5, 00(A2, D3.W*4) ; Store low longword in
; partial product
SUBQ.W #1, D3 ; Loop through all
CMP.W topStop, D3 ; longwords in top number
BGE @1
MOVEA.L A2, A0 ; Now add partial product
; to accumulator
MOVE.L D4, D0 ; Calculate correct
; position in product
LEA acc, A1 ; Get accumulators addr
ADDQ.W #1, D0
ADDA.W #NumSize * 4, A0
LSL.W #2, D0
ADDA.W D0, A1
MOVE.W D4, D1
MOVE.L -(A1), D0 ; Get longword of product
SUBQ #1, D1
ADD.L -(A0), D0 ; Add longword of
MOVE.L D0, (A1) ; partial product
TST.W D1 ; If no more longwords,
BMI @2 ; then branch
MultQ
/* Multiply src1 by src2 and put product in dst */
void MultQ(BigFixed *src1, long src2, BigFixed *dst)
{
BigFixed pro;
asm
{
MOVEM.L D0-D7/A0/A1, -(A7)
LEA pro, A0 ; Clear product
MOVEQ #NumSize/4-1, D0
D4 ; Do 64-bit multiply
; by bottom number
ADD.L D2, D4 ; Add carry
CLR.L D2 ; Use D2 as dummy to
; extend carry
ADDX.L D2, D5 ; Add zero with carry
MOVE.L D5, D2 ; High longword
; becomes carry
MOVE.L D4, 00(A1, D0.W*4) ; Put partial product
; into result
SUBQ.W #1, D0 ; Loop through all
CMP.W D1, D0 ; longwords in top #
BGE @1
Divide
/* Divide end (dividend) by sor (divisor) and put quotient in dst. Remainder will end
up in end */
void Divide(BigFixed *end, BigFixed *sor, BigFixed *dst)
{
long pq;
BigFixed quo, line;
asm
{
MOVEM.L D0-D7/A0-A4, -(A7)
LEA quo, A0 ; Clear quotient
MOVEQ #NumSize/4-1, D0
D4 ; Do 64-bit division
Compare
/* Compare src1 and src2. Returns 1 if src1 > src2, 0 if theyre equal, and -1 if src1 <
rc2. */
short Compare(BigFixed *src1, BigFixed *src2)
{
asm
{
MOVEM.L D1/D2/A0/A1, -(A7)
MOVEA.L src1, A0 ; Get src1s address
MOVEA.L src2, A1 ; Get src2s address
MOVEQ #1, D0 ; Start with +1
MOVE.L (A0)+, D2
CMP.L (A1)+, D2 ; Compare 1st longwords
BLT @1 ; If src1 less, branch
BNE @2 ; If !=, src1 must
MOVE.L (A0)+, D2 ; be greater
CMP.L (A1)+, D2 ; Cmp 3 more longwords
BCS @1 ; (Unsigned)
BNE @2
MOVE.L (A0)+, D2
CMP.L (A1)+, D2
BCS @1
BNE @2
MOVE.L (A0)+, D2
CMP.L (A1)+, D2
BCS @1
BNE @2
MOVEQ #NumSize/4-2, D1 ; Number of longwords
; remaining / 4