[comp.graphics] Ray Tracing News archive 2 of 7

cnsy@vax5.CIT.CORNELL.EDU (06/01/89)
 _ __                 ______                         _ __
' )  )                  /                           ' )  )
 /--' __.  __  ,     --/ __  __.  _. o ____  _,      /  / _  , , , _
/  \_(_/|_/ (_/_    (_/ / (_(_/|_(__<_/ / <_(_)_    /  (_</_(_(_/_/_)_
             /                               /|
            '                               |/

Ray Tracing News, e-mail edition, 2/15/88

concatenated by Eric Haines, hpfcla!hpfcrs!eye!erich@hplabs.HP.COM

So, now that the SIGGRAPH paper submission rush is over, the SIGGRAPH paper
review process begins.  Fortunately, it's generally easier to comment on someone
else's deathless prose than write it yourself.  It's also time to start
procrastinating on writing up SIGGRAPH tutorial notes.  So, all in all it's
not been too busy, except for all the "real" work we've all (hopefully) been
doing.


Dore'
-----

The only new news I've got is on the new product by Ardent, called Dore'
(rhymes with "moray" - there should be an up-accent over that "e" in Dore).
Ardent is the new name for Dana Computer Inc (i.e. the "single-user
supercomputer/supergraphics" people.  Their "Titan" minisupercomputer is due
out realsoonnow).  Dore' stands for "Dynamic Object-Rendering Environment".

The places I've seen articles so far is "Electronics", February 4, 1988, on
pages 69-70, and "Mini-Micro Systems", February 1988, pages 22-23.  The
first article offers more detail.  I don't really want to rehash either
article in full.  The salient points (to me) about Dore' are:

	(1) Toolkit approach.
	(2) Can render using vectors, hidden surface, or ray tracing.
	(3) Hierarchical, object oriented system.
	(4) Five object classes:
	    (a) primitives (including points, curves, polygons, meshes, cubic
		solids (?!), and NURBS (non-uniform rational B-splines),
	    (b) appearance attributes (material properties, inc. solid texture
		maps and environmental reflection maps),
	    (c) geometric attributes (modeling matrices),
	    (d) studio objects (camera, lights) (I like this term!),
	    (e) organizational objects (hierarchy, and evidentally the ability
		to define function calls inside the environment which call
		routines in the application program.  No idea how this works).
	(5) Quoted times: 0.1 second for vector, 10 seconds for hidden
	    surface, 100 seconds ray-traced (I assume on the Titan.  No
	    idea what kind of scene complexity or resolution).
	(6) Written in C.
	(7) "Open" system - source code sold in hopes of selling Dore' on other
	    systems.

The best part (for universities and research labs) is the price: $250 for
a source code license - not sure what the cost is for source code maintenance
(vs. $15000 for commercial users plus $5000/year after the first year).  Per
copy binary license is $200.

I am teaching the ray-tracing section of "A Consumer's and Developer's Guide
to Image Synthesis" at SIGGRAPH this year, so definitely want to know more.
I would also like more information just out of curiosity.  So, you university
people, please go out there and get one - seems like a real bargain.  The
contact info for Ardent is:

	Ardent Computer Corp
	550 Del Rey Ave
	Sunnyvale, CA  94086
	408-732-2806


That's all, folks,

Eric

   

 _ __                 ______                         _ __
' )  )                  /                           ' )  )
 /--' __.  __  ,     --/ __  __.  _. o ____  _,      /  / _  , , , _
/  \_(_/|_/ (_/_    (_/ / (_(_/|_(__<_/ / <_(_)_    /  (_</_(_(_/_/_)_
             /                               /|
            '                               |/

			"Light Makes Right"

			 March 1, 1988


I just receive Andrew's note that the hardcopy RTN is in the mail, which
inspired me to flush the buffer and send on the latest offerings.  Special
thanks to Jeff Goldsmith for submissions.

    - Eric

--------------------------------------------------

Mailing list updates
--------------------

First, an address change.  John Peterson is now with Apple, who writes:

    'I'm currently hanging out at Apple thinking about "3D graphics for the
    rest of us" and how to keep the jaggies away from personal computers.
    (But there is this Cray sitting about 50 feet away.  Hmmm...)'

#
# John Peterson - bicubic splines, texturing
# Apple Computer (graduated University of Utah, 1988)
alias	john_peterson	hpfcrs!hpfcla!jp%apple.apple.com@RELAY.CS.NET

I asked him for ray tracers at the University of Utah.  So, Tom Malley and
Rod Bogart (whose initials are 'RGB') are now subscribers.

From Tom:
    My thesis research was similar to what John Wallace described,
    being a two pass approach to radiosity to include specular reflection
    and transparency.  Form factors were all calculated via ray tracing,
    however.  I did some brief examination of different ray intersection
    methods along the way (Rubin-Whitted, Kay-Kajiya, and Glassner).

#
# Tom Malley - blending ray tracing and radiosity
# Evans & Sutherland (graduated University of Utah, 1988)
# (malley@cs.utah.edu, cs.utah.edu!esunix!tmalley)
alias	tom_malley	hpfcrs!hpfcla!hplabs!malley@cs.utah.edu


To quote John Peterson about Rod Bogart:
    Rod developed a really neat method for using ray tracing to integrate
    computer generated pictures with real world images (coming soon to a
    SIGGRAPH near you...).

#
# Rod Bogart - blending ray tracing and images
# University of Utah
alias	rod_bogart	hpfcrs!hpfcla!hplabs!bogart%gr@cs.utah.edu

-----------------------------------

Another Dore' Article

In case you have not been able to track down the two articles previously
mentioned about Dore', Ardent's new rendering system, there's now a third
(that I know of):  it's in "Computer Design", Feb 15, 1988, pages 30-31.
Pretty much like the other articles (i.e. cast from the same press release).

-----------------------------------

Responses to the "teapot in a football stadium" problem:


From: Andrew Glassner

 Just a quick response to your football stadium/teapot example.  When you
subdivide a node, look at its children.  If only one child is non-empty,
replace the original node with its non-null child.  Do this recursively 
until the subdivision criterion is satisfied.  I do this in my spacetime
ray tracer, and the results can be big.  The ray propagation can get just
a bit more complex, but there are clever rays to keep it simple (see
John Amanatide's article in Eurographics '87, plus I have a scheme that
I hope to write up soon...).

  Better yet, go with a hybrid space subdivision/bounding volume scheme,
such as the one described in my spacetime paper (poorly described in the
Intro to RT notes, but better described in the version slated for the
March issue of CG&A; I'd be happy to mail you a preprint).  I think this
hybrid scheme gives the best of both worlds, and you can use whatever
space subdivision and bounding volume techniques that you like in the
two distinct phases of the algorithm.  I use adaptive space subdivision
and Kay's bounding slabs, and that combination seems to work pretty well.

  And now I have to get back to moving into my office!

------------------------------------------

Comments on Jim Arvo's Efficiency Article


From: Eric Haines (with a few extra comments than my original letter to Jim)

	Your article on efficiency is fascinating.  I hope to read it more
carefully tonight and (eventually--we just came under a crunch of work)
comment on it.  Sounds like you've done a lot of serious thought and
speculation on the possibilities.  I agree with the philosophy of objects
each having their own private hierarchies, and having the ability to hook
these hierarchies up however you want.  We've done that on a small scale
with our tesselated spline surfaces:  automatic hierarchy a la Goldsmith &
Salmon (IEEE CG&A, May 1987) for everything, but then octrees for the spline
surfaces themselves.  A nice feature of Goldsmith is that you can weight the
cost of each primitive into the algorithm: multiply its area by some
intersection cost (which you'll probably have to figure out through
experimentation) to give it a weighting.  In this way a torus surface which has
the same size bounding volume as a quadrilateral can be given a higher
weighting factor.  A higher cost has the effect of making the hierarchical tree
less horizontal near the complicated object, i.e. there are more bounding
volumes overall, with a few complicated objects in each.  This is what you
want, since you'd rather spend a little extra time on intersecting bounding
volumes than wasting a lot of time intersecting the empty space around costly
objects.


Response from: Jim Arvo

    I'm glad you found my article interesting.  All your interesting
    mail finally motivated me to contribute to the discussion.  I
    thought I would toss out a pet idea of mine and see if it sparked
    any debate.  It turns out that Jeff Goldsmith also looked at
    simulated annealing for bounding box hierarchies.  One day one 
    of us will get some results.  Hopefully not negative results!

    With all the talk about octrees and such, it's clear that there 
    are a number of potential papers "waiting in the wings".  I've
    been thinking that by getting the right collaborations going,
    we (the ray tracing group) could easily "hand" IEEE several
    related papers, effectively defining a theme issue.  What do
    you think?


My reply:

	The efficiency article collection sounds possible.  Another idea which
someone (Mike Kaplan, maybe?  I forget) mentioned at last SIGGRAPH was "A
Characterization of Ten Ray Tracing Efficiency Algorithms".  If well done, this
would be a classic.  There are probably entirely new schemes still to be found,
and certainly trying to optimize and figure out good hybrid methods is an
area ripe for development.  But right now many of the structures and algorithms
are in place, and still have not been fully compared.  Timings are
unconvincing, and statistics are worthwhile but don't tell the whole story.
An in-depth comparison of the major algorithms and techniques to improve these
would be wonderful.  Someday, someday ... well, my hope is that a few of us
could do some writing along these lines, even if it's just brainstorming on
how to compare particular algorithms in a rigorous fashion  (e.g. How can we
simulate a scene mathematically?  OK, idealize each object as a box or sphere
for simplicity.  Now, how do we distribute the points to get realistic
clustering? Once we have a "scene generator" which could create various typical
distributions of objects in a scene, then we have to analyze how this generator
would interact with each algorithm, and be able to predict how each efficiency
scheme deals with the scenes generated.  Or there might be simpler ways to
isolate and analyze each factor which affects the efficiency of a scheme.
Anyway, whatever, but this stuff looks fun!).  Understanding the strengths of
the various techniques seems vital to being able to do any kind of "annealing"
process for optimization.

------------------------------------------------------------

Efficiency Tricks
-----------------

From Jeff Goldsmith:

    Here's a good hack for Ray Tracing News:
When using Tim Kay's heapsort on bounding volumes in order
to get the closest, don't bother to do that for illumination
rays.  I know it seems obvious, but I never thought to do it.

The obvious corollary to that idea has a little more reach to
it.  Since illumination rays form the bulk of the rays we 
trace, getting the nearest intersection is of limited value.
In addition, if CSG is used, more times occur when the nearest
intersection is of less value.  This seems to indicate that 
space tracing techniques are doing some amount of needless work.
Since it doesn't really cost that much, perhaps it is not a flaw,
but maybe space tracers should consider approaches that don't
worry about where along the path we are and optimize that problem
instead.

---------------------------------------------

More Book Recommendations
-------------------------

From: Jeff Goldsmith

    I agree completely with your comment about libraries.
Mine is a crucial resource for me.  Here are some of my
favorite books that are in my office:

	Geometry:
	
	    Computational Geometry for Design and Manufacture
	        Faux & Pratt
	 	--an early CAD text.  It has lots of good stuff
		on splines and 3D math.

	    Differential Geometry of Curves and Surfaces
		DoCarmo
		--A super text on classical differential geometry.
		(Not quite the same as analytic geometry.)

	    CRC Standard Math Tables
		--This has an awesome section on analytic geometry.
		Calculus, too.  Can't live without it.  It is not
		the same as the first part of the Chemistry and 
		Physics one.

	    Analytic Geometry
		Steen and Ballou
		--Once was the standard college text on the subject.
		That was a long time ago, but it is very easy to
		read and it covers the fundamentals.

	Computing:

	    Data Structures and Algorithms
		--Aho, Hopcroft and Ullman
		Read anything by these guys.

	    Data Structure Techniques
		--Standish
		More How-to than AHU's tome.

	    Numerical Analysis
		--Burden, Faires, and Reynolds
		I have the other two, as well.  This is the
		least complete of the three, but the algorithms
		inside are childishly easy to implement.  They
		always seem to work, too.  Best of all, for many
		cases, they have test data and solutions.

	    Software Tools
		--Kernighan and Plauger
		How to write command line interpreters, editors,
		macro expanders, the works.  Great reading.

	    Fundamentals of Computer Algorithms
		--Horowitz and Sahni
		Less technical than AHU, but pretty technical.
		Thicker.  It may very well answer the problem
		you can't figure out straight off.

	    The Art of Computer Programming
		--Knuth
		The "Encyclopedia"

	Physics:  (Seem awfully useful sometimes)

	    Gravitation
		--Misner, Thorne, and Wheeler
		The thickest book on my shelf.  It's a paperback, too.
		(It's bent three bookends permanently.  Cheap JPL ones.)
		Truly a tome on modern physics.
		
	    Modern Physics
		--Tippler
		Much easier to read than MTW.  Has lots of good appendices.

	    University Astronomy
		--Pasachoff and Kuttner
		I read this book for fun.  I wonder why I didn't read it
		while I was taking Kuttner's course?

	    The Feynman Lectures on Physics
		Awesome first course.  Most of my needs are problems in
		the text.

	Graphics, etc:

	    Raster Graphics Handbook
		--Conrac
		All about fundamentals of the craft.

	    Light and Color in Nature and Art
		--Williamson and Cummings
		Much easier to read than Hall's thesis, but less 
		technical as well. 

	Etc, Etc:

	    The Random House Dictionary of the English Language, 
	    College Edition
		The best collegiate sized dictionary around. 
		By far.

	    The Chicago Manual of Style
		Has most of the answers. Did you know that
		to recreate is to have fun, but to 
		re-create is computer graphics?

	    The Elements of Style
		The one that came before computers.

-----------------------------------------------

Bug for the Day						by Eric Haines
---------------

{This will be pretty unexciting for those who never intend to implement an
octree subdivision scheme.  For future implementers, I hope you find it of
use:  it took me quite a few hours to track this one down, so I think it is
worth going into.}

	This bug was one I had when implementing octree subdivision for ray
tracing.  The basic algorithm used was Glassner's:  once you intersect the
octree structure, move the intersection point in one half of the smallest
cube's dimension in the direction normal to the wall hit.  In other words,
find out what cube is the next cube by finding a point that should be well
inside of it, then translating this point into integer octree coordinates
and traversing the octree downwards until a leaf node is found.

	However, there are some subtle errors that can occur with moving to the
next octree cube.  My favorite is almost hitting the edge of a cube, moving
into the next cube, then getting caught moving to the cube diagonal to this
cube, i.e. moving from cube 1 to 2 to 3 ...

	X-->
	+---+---+
      ^ | 2 | $ |	Numbers are the order of cubes moved through.
      | +---#---+
      Y | 1 | 3 |
	+---+---+
	  ^________ray started here, and hit almost at the "#".
		   (ray is in +X, +Y direction)

This went into an infinite loop, going between 2 and 3 forever.  The reason
was that when I hit the boundary 1&2 I would add a Y increment (half minimum
box size) to the intersection point, then convert this to find that I was
now in box 2.  I would then shoot the ray again and it would hit the
wall at 2&$.  To this intersection point I would add an X increment.  However,
what would happen is that the Y intersection point would actually be ever so
slightly low - earlier when I hit the 1&2 wall adding the increment pushed us
into box 2.  But now when the Y intersection point was converted it would
put us in the 1+3 boxes, and X would then put us in box 3.  Basically, the
precision of the machine made the mapping between world space and octree
space be ever so slightly off.

	The infinite loop occurred when we shot the ray again at box 3.  It
would hit the 3/$ wall, get Y incremented, and because X was ever so slightly
less than what was truly needed to put the intersection point in the 3+$
boxes, we would go back to box 2, ad infinitum.  Another way to look at this
is that when we would intersect the ray against any of the walls near the
"#" point, the intersection point (due to roundoff) was always mapping to
box 1 if not incremented.  Incrementing in Y would move it to box 2, and in
X would move it to box 3, but then the next intersection test would yield
another point that would be in box 1.  Since we couldn't increment in
both directions at once, we could never get past 2 and 3 simultaneously.

	This bug occurs very rarely because of this: the intersection points
all have to be such that they are very near a corner, and the mapping of the
points must all land within box 1.  This problem occurred for me once in a
few million rays, which of course made it all that much more fun to search
for it.

	My solution was to check the distance of the intersections generated
each time: if the closest intersection was a smaller distance from the origin
than the closest distance for the previous cube move, then this intersection
point would not be used, but rather the next higher would be.  In this way
forward progress along the ray would always be made.

	By the way, I found that it was worthwhile to always use the original
ray origin for testing ray/cube intersections - doing this avoids any
cumulative precision errors which could occur by successively starting from
each new intersection point.  To simulate the origin starting within the cube
I would simply test only the 3 cube faces which faced away from the ray
direction (this was also faster to test).

	Anyway, hope this made sense - has anyone else run into this bug? Any
other solutions?

---------------------------------------------

A Pet Peeve (by Jeff Goldsmith)
-----------

Don't ever refer to pixels as rows and columns.  It makes it
hard to get the order (row,column)? (column,row)? right.  Refer
to pixels as (x,y) coordinates.  Not only is that the natural
system to do math on them, but it is much easier to visualize
in a debugging environment, as well as running the thing.  I
use the -x and -y npix switches on the tracer command line to
override any settings and have found them to be much easier to
deal with than the -r and -c that seem to be everywhere.  Note
that C's normal array order is (I think.  I always get these 
things wrong.) (y,x).

	[I agree: my problem now is that Y=0 is the bottom edge of the screen
	when dealing with the graphics package (HP's Starbase), and Y=0 is the
	top when directly accessing the frame buffer (HP's SRX). -- EAH]

---------------------------------------------

	Next "RT News" issue I'll include a write-up of Goldsmith/Salmon which
should hopefully make the algorithm clearer, plus some little additions I've
made.  I've found Goldsmith/Salmon to be a worthwhile, robust efficiency scheme
which hasn't received much attention.  It embodies an odd way of thinking
(I have to reread my notes about it when I want to change the code), as there
are a number of costs which must be taken into account and inherited.  It's
not immediately intuitive, but has a certain sense to it once all the pieces
are in place.  Hopefully I'll be able to shed some more light on it.

All for now,

Eric

 _ __                 ______                         _ __
' )  )                  /                           ' )  )
 /--' __.  __  ,     --/ __  __.  _. o ____  _,      /  / _  , , , _
/  \_(_/|_/ (_/_    (_/ / (_(_/|_(__<_/ / <_(_)_    /  (_</_(_(_/_/_)_
             /                               /|
            '                               |/

			"Light Makes Right"

			 March 8, 1988

-------------------------------------------------

Surface Acne
------------

From Eric Haines:

	A problem which just about every ray tracer has run into, and which
has rarely appeared in the literature (and even more rarely been solved in any
way) is what I call "surface acne".

	An easy way to explain this problem is with an example.  Say you are
looking at a double sided (i.e. no culling) cylinder primitive.  You shoot an
eye ray, hitting the outside.  Now you look at a light.  As it turns out, the
intersection point truly is bathed by the light, and so should see it.  What
actually may happen is that the shadow test ray hits the cylinder.  In images
this will show up as black dots or other anomalous shadings - "surface acne".
I've seen this left in some images to give an interesting textured effect, but
normally it's a real problem.

	How did this happen?  Well, theoretically it can't.  However, due to
precision error the following happens.  When you hit the cylinder and
calculated the intersection point in world space, the point computed was
actually ever so slightly inside the cylinder.  Now, when the shadow ray
is sent out, it is tested against the cylinder's surface, and an intersection
is found at some tiny distance from the origin.

	A common solution is to just assign an epsilon to each intersector and
cross your fingers.  In other words, what you really do is move the ray origin
ever so slightly along the shadow (or reflection or refraction) ray direction
and hope this was far enough that the new origin is 'outside' of the object
(in actuality, what you want is for the new origin to be on the same side of
the object as the parent ray, except for refraction rays, which want to start
on the opposite side).  This works fairly well for test systems, but is pretty
scary stuff for software used by anyone who didn't design it (e.g. some user
decides to input his molecular database in meters, causing all his data to be
much smaller in radius than my fudge factor.  When I add my fudge factor
distance to the ray, I find that my new ray origin is way outside the scene).

	Another solution is to not test the item intersected if it is not
self-shadowing.  For example, a polygon cannot cast a shadow on itself, so
should not be tested for intersection when a ray originates on its surface.
This works fine for some primitives, but falls apart when self-shadowing
objects (cylinders, tori, spline surfaces, etc) are used.

	I have also experimented with some root polishing techniques, which
help to solve some problems, but I'll leave it at this for now.  Has anyone
any better solutions for surface acne (ideally foolproof ones)?  I suspect
that the best solution is a combination of the above techniques, but hopefully
I'm missing some concept that might make this problem easy to solve.  Hope to
hear from you all on this!

-------------

Addenda from Jeff Goldsmith:

    Al [Barr] and I have used a technical term for "surface acne,"
too.  We called it "black dots" or more often "black shit."
(Zbuffers have similar problems.  The results are called
"zbuffer shit" or "zippers".  Mostly the cruder term is
used since the artifacts are not particularly desirable.)

--------------------------------------------------

Goldsmith/Salmon Hierarchy Building
-----------------------------------

Well, I was going to write up some info on the Goldsmith/Salmon hierarchy
building algorithm, but the RT News buffer was filled almost immediately and 
I haven't done it yet.  However, there was this from Jeff Goldsmith, about
his earlier paper (IEEE CG&A, May 1987):

       If you are going to spend some time and effort on automatic
    tree generation stuff (Note: paper 2 is almost done--mostly
    talks about parallelism and hypercubes, but some stuff on trees
    as well--mostly work heuristics that include primitives
    and so on) I'd like to hear some thinking about the evaluation
    function.  Firstly, it's optimized for primary rays.  That turns
    out to be an unfortunate choice, since most rays are secondary
    rays.  We've come up with a second order correction that is 
    good for evaluating trees, but turns the generation algorithm
    into O(n log^2 n).  We've not played around with it enough to
    tell whether it works.  If you have some thoughts/solutions,
    that would be nice.  Another finding on the same vein that is
    much more important is: the mean (see next note) seems to be 
    reasonably close, but sigma is very high for the predictions
    vs. actual tries.  This wasn't important (actually, wasn't
    detected) on a sequential machine, but became crucial on a
    parallel machine.  Some of the variation is due to our assumption/
    attempt at view direction independence.  (Clearly, stuff in back
    is not checked for intersection much.)  I don't know whether
    that is all of it--we get bizarre plots of this data.  If you
    have any thoughts on how to make a better or more precise
    evaluation function, I'd really like to hear the reasoning and
    perhaps steal and use the results.  
    Oh, the promised note: The mean is only correct if the highest
    level bounding volume (root node) is contained completely within
    the view volume.  If it isn't, the actual results end up proportional
    to the predicted ones, but I haven't worked out the constant.  
    (It shows up on our graphs pretty clearly, though.)

	The second part of the algorithm is the builder.  I'm not
    convinced that it is a very good method at all, but it met the
    criteria I set up when trying to decompose trees--O(below n^2)
    and reasonably local (I was trying to use simulated annealing
    at the time.)  Some other features were environmental; some were
    because I couldn't think of a better way.  In no sense am I convinced
    that the incremental approach or the specific one chosen is best.
    I'd like to hear about that, too.

	The only part I really like about the whole thing is the 
    general approach of using heuristics to guess at some value
    (rated in flops eventually) and then trying to optimize that
    value.  Beyond that, I think there is a whole realm of computational
    techniques waiting to be used to approximately solve optimization
    problems.  I'm really interested in other work done in that 
    direction and especially results regarding graphics.

	Thanks for the good words; I seem to have been mentioned
    in most of the last issue.  I bet that has something to do with
    my having acquired a network terminal on my desk less than a 
    month ago (yay!).  

-----------------------------------------------------

Efficiency Tricks followup
--------------------------

These are comments generated by Jeff Goldsmith's note that Kay/Kajiya sorting is
not needed for shadow rays.

-----------

Comments from Masataka Ohta:

In the latest ray tracing news, you write:

>Efficiency Tricks

>Since illumination rays form the bulk of the rays we 
>trace.

If so, instead of space tracing, you should use ray coherence
at least for the illumination rays.

The ray coherent approaches are found in CG&A vol. 6, no. 9 "The
Light Buffer: A Shadow-Testing Accelerator" and in my paper "ray
coherence theorem and constant time ray tracing algorithm" in
proceedings of CG International '87.

>In addition, if CSG is used, more times occur when the nearest
>intersection is of less value.  This seems to indicate that 
>space tracing techniques are doing some amount of needless work.

How about tracing illumination rays from light sources, instead
of from object surface? It will be faster for your CSG case,
if the surface point lies in the shadow, though if the surface
point is illuminated, there will be no speed improvement.

The problem is interesting to me because my research on coherent
ray tracer also suggests that it is much better to trace illumination
rays from the light source.

Do you have any other reasons to determine from where illumination rays
are fired?

----------------------------------------------------------------------

Jeff Goldsmith's reply:

Actually, I believe you, though I won't say with certainty 
that we know the best way to do shadow testing.  However,
I'm interested in fundamentally understanding the ray tracing
algorithm and determining what computation MUST be done, so
the realization that space tracing illumination rays still
seems meaningful.  In fact, it is my opinion that space tracing
is not the right way to go and "backwards" (classical) ray
tracing will eventually be closer to what will be used 30
years from now.  I won't even try to defend that position;
no one knows the answers.  What we are trying to do is
shed a little "light" on the subject.  Thanks for your
comments.

-----------------

From Eric Haines:

	I just got from Ohta the same note Ohta sent to you, plus your reply.
Your reply is so short that I've lost the sense of it.  So, if you don't mind,
a quick explanation would be useful.

> However,
> I'm interested in fundamentally understanding the ray tracing
> algorithm and determining what computation MUST be done, so
> the realization that space tracing illumination rays still
> seems meaningful.

What is "the realization that space tracing illumination rays"?  I'm missing
something here - which realization?

> In fact, it is my opinion that space tracing
> is not the right way to go and "backwards" (classical) ray
> tracing will eventually be closer to what will be used 30
> years from now.

Do you mean by "space tracing" Ohta's method?

Basically, it looks like I should reread Ohta's article, but I thought I'd
check first.

--------------

Further explanation from Jeff Goldsmith:

I think that a word got dropped from the sentence, either when I
typed it in or later.  (Who knows--I do that about as often as
computers do.)

I meant:  Since distance order is not needed for illumination
rays, space tracing methods in general (not Ohta's in particular)
do extra work.  It's not always clear that extra information costs
extra computation, but they usually go hand in hand.  (It was just
a rehash of the original message.)  Anyway, if extra computation is
being done, perhaps then there is an algorithm that does not do 
this computation, yet does all the others (or some others...)
that is of lower asymptotic time complexity.  

Basically, this all boils down to my response to various claims
that people have "constant time" ray tracers. It is just not 
true.  It can't be true if they are using a method that will yield
the first intersection along a path since we know that that 
computation cannot be done in less than O(n log n) without a
discretized distance measurement.  I don't think that space
tracers discretize distance in the sense of a bucket sort, but
I could be convinced, I suppose.  Anyway, that's what the ramblings
are all about.  If you have some insights, I'd like to start an
argument (sorry, discussion) on the net about the topic.  What
do you think?

------------------------------------------------------------

Extracts from USENET news
-------------------------

There was recently some interesting interchange about octree building on USENET.
Some people don't read or don't receive comp.graphics, so the rest of this
issue consists of these messages.

----------------

From Ruud Waij (who is not on the RT News e-mail mailing list):

In article <198@dutrun.UUCP> winffhp@dutrun.UUCP (ruud waij) writes:
My ray tracing program, which can display the 
primitives block, sphere cone and cylinder, 
uses spatial enumeration of the object space 
(subdivision in regularly located cubical cells 
(voxels)) to speed up computation.

The voxels each have a list of primitives.
If the surface of a primitive is inside a voxel,
this primitive will be put in the list of the voxel.

I am currently using bounding boxes around the 
primitives: if part of the bounding box is 
inside the voxel, the surface of the primitive 
is said to be inside the voxel.
This is a very easy method but also very s-l-o-w.

I am trying to find a better way of determining 
whether the surface of a primitive is in a voxel 
or not, but I am not very succesful.
Does anyone out there have any suggestions ?

---------------

Response from Paul Heckbert:
  
Yes, interesting problem!  Fitting a bounding box around the object and listing
that object in all voxels intersected by the bounding box will be inefficient as
it can list the object in many voxels not intersected by the object itself.
Imagine a long, thin cylinder at an angle to the voxel grid.

I've never implemented this, but I think it would solve your
problem for general quadrics:

    find zmin and zmax for the object.
    loop over z from zmin to zmax, stepping from grid plane to grid plane.
	find the conic curve of the intersection of the quadric with the plane.
	this will be a second degree equation in x and y (an ellipse,
	    parabola, hyperbola, or line).
	note that you'll have to deal with the end caps of your cylinders
	    and similar details.
	find ymin and ymax for the conic curve.
	loop over y from ymin to ymax,
	    stepping from grid line to grid line within the current z-plane
	    find the intersection points of the current y line with the conic.
	    this will be zero, one, or two points.
	    find xmin and xmax of these points.
	    loop over x from xmin to xmax.
		the voxel at (x, y, z) intersects the object

Perhaps others out there have actually implemented stuff like this and will
enlighten us with their experience.

-----------------

Response from Andrew Glassner:

  Ruud and I have discussed this in person, but I thought I'd respond
anyway - both to summarize our discussions and offer some comments
on the technique.

  The central question of the posting was how to assign the surfaces
of various objects to volume cells, in order to use some form spatial
subdivision to accelerate ray tracing.  Notice that there are at
least two assumptions underlying this method.  The first assumes that 
the interior of each object is homogeneous in all respects, and thus
uninteresting from a ray-tracing point of view.  As a counterexample,
if we have smoke swirling around inside a crystal ball, then this
"homogeneous-contents" assumption breaks down fast.  
 
  To compensate, we either must include the volume inside each object 
to each cell's object list (and support a more complex object description
encompassing both the surface and the contents), or include as new objects 
the stuff within the original.  

  The other assumption is that objects have hard edges; otherwise we have 
to revise our definition of "surface" in this context.  This can begin
to be a problem with implicit surfaces, though I haven't seen this really
discussed yet in print.  But so as long as we're using hard-edged objects 
with homogeneous interiors, the "surface in a cell" approach is still 
attractive.  From here on I'll assume that cells are rectangular boxes.

  So to which cells do we attach a particular surface?  Ruud's current 
technique (gathered from his posting) finds the bounding box of the surface 
and marks every cell that is even partly within the bounding volume.  Sure,
this marks a lot of cells that need not be marked.  One way to reduce the
marked cell count is to notice that if the object is convex, we can unmark 
any cell that is completely within the object; we test the 8 corners with 
an inside/outside test (fast and simple for quadrics; only slightly slower 
and harder for polyhedra).  If all 8 corners are "inside", unmark the cell.  
Of course, this assumes convex cells - like boxes.  Note that some quadrics
are not convex (e.g. hyperboloid of one sheet) so you must be at least
a little careful here.

  The opposite doesn't hold - just because all 8 corners are outside
does NOT mean a cell may be unmarked.  Consider the end of a cylinder
poking into one side of a box, like an ice-cream bar on a stick,
where the ice-cream bar itself is our cell.  The stick is within the 
ice cream, but all the corners of the ice cream bar are outside the stick.  
Since this box contains some of the stick's surface, the box should still 
be marked.  So our final cells have either some inside and some outside 
corners, or all outside corners.

  What do we lose by having lots of extra cells marked?  Probably not
much.  By storing the ray intersection parameter with each object after
an intersection has been computed, we don't ever need to actually
repeat an intersection.  If the ray id# that is being traced matches
the ray id# for which the object holds the intersection parameter, we
simply return the intersection value.  This requires getting access to
the object's description and then a comparison - probably the object
access is the most expensive step.  But most objects are locally
coherent (if you hit a cell containing object A, the next time you need
object A again will probably be pretty soon).  So "false positives" -
cells who claim to contain an object they really don't - aren't so bad,
since the pages containing an object will probably still be resident
when we need it again.

  We do need to protect ourselves, though, against a little gotcha that
I neglected to discuss in my '84 CG&A paper.  If you enter a cell and
find that you hit an object it claims to contain, you must check that
the intersection you computed actually resides within that cell.  It's
possible that the cell is a false positive, so the object itself isn't
even in the cell.  It's also possible that the object is something like
a boomerang, where it really is within the current cell but the actual
intersection is in another cell.  The loss comes in when the intersection
is actually in the next cell, but another surface in the next cell (but
not in this one) is actually in front.  Even worse, if you're doing CSG, 
that phony intersection can distort your entire precious CSG status tree!
The moral is not to be fooled just because you hit an object in a cell; 
check to be sure that the intersection itself is also within the cell.  

  How to find the bounding box of a quadric?  A really simple way is
to find the bounding box of the quadric in its canonical space, and
then transform the box into the object's position.  Fit a new bounding
box around the eight transformed corners of the original bounding box.
This will not make a very tight volume at all, (imagine a slanted,
tilted cylinder and its bounding box), but it's quick and dirty and 
I use it for getting code debugged and at least running.

  If you have a convex hull program, you can compute the hull for
concave polyhedra and use its bounding box; obviously you needn't
bother for convex polyhedra.  For parametric curved surfaces you can
try to find a polyhedral shell the is guaranteed to enclose the
surface; again you can find the shell's convex hull and then find
the extreme values along each co-ordinate.

  If your boxes don't have to be axis-aligned, then the problem changes
significantly.  Consider a sphere: an infinite number of equally-sized
boxes at different orientation will enclose the sphere minimally.  More
complicated shapes appear more formidable.  An O(n^3) algorithm for
non-aligned bounding boxes can be found in "Finding Minimal Enclosing
Boxes" by O'Rourke (International Journal of Computer and Information
Sciences, Vol 14, No 3, 1985, pp. 183-199).  

  Other approaches include traditional 3-d scan conversion, which I think
should be easily convertable into an adaptive octree environment.  Or you
can grab the bull by the horns and go for raw octree encoding, approximating
the surface with lots of little sugar cubes.  Then mark any cell in your
space subdivision tree that encloses (some or all of) any of these cubes.




 _ __                 ______                         _ __
' )  )                  /                           ' )  )
 /--' __.  __  ,     --/ __  __.  _. o ____  _,      /  / _  , , , _
/  \_(_/|_/ (_/_    (_/ / (_(_/|_(__<_/ / <_(_)_    /  (_</_(_(_/_/_)_
             /                               /|
            '                               |/

			"Light Makes Right"

			 March 26, 1988



Table of Contents:
	Intro, Eric Haines
	Mailing list changes and additions: Kuchkuda, Lorig, Rekola
	More on shadow testing, efficiency, etc., Jeff Goldsmith
	More comments on tight fitting octrees for quadrics, Jeff Goldsmith
	LINEAR-TIME VOXEL WALKING FOR OCTREES, Jim Arvo
	Efficiency Tricks, Eric Haines
	A Rendering Trick and a Puzzle, Eric Haines
	PECG correction, David Rogers

---------------------------------------------------------------

Well, NCGA was pretty uninspiring, as it rapidly becomes more and more PC
oriented. It was great to see people, though, and nice to escape the Ithaca
snow and rain.

As far as ray tracing goes, a few companies announced products.  The AT&T
Pixel Machine now has two rendering packages, PICLIB and RAYLIB (these may
be merged into one package someday - I would guess that separate development
efforts caused the split [any comments, Leonard?]).  With the addition of some
sort of VM capability, this machine becomes pretty astounding in its ray
tracing performance (in case you didn't get to SIGGRAPH last year, they had
a demo of moving by mouse a shiny ball on top of the mandrill texture map:
about a frame per second ray trace on a small part of the screen).  HP
announced its new graphics accelerator, the TurboSRX, and with it the eventual
availability of a ray tracing and (the first!) radiosity package as an extension
to their Starbase graphics library.  Ardent and their Dore' package were sadly
missing.  Apollo was also noticeable for their non-appearance.  Sun TAAC was
there, showing off some ray traced pictures but seemingly not planning to
release a ray tracer (the salesman claiming that whoever bought a TAAC would
simply write their own).  Stellar was there with their supercomputer
workstation - interesting, but no ray-tracing in sight.  Anyone else note
anything of ray-tracing (or other) interest?

-------------------------------------------------------------------------

Some mailing list changes and additions

Changed address:

# Roman Kuchkuda
# Megatek
alias	roman_kuchkuda \
	hpfcrs!hpfcla!hplabs!ucbvax!ucsd!megatek!kuchkuda@rutgers.edu


New people:

I saw Gray at NCGA and got his email address.  He worked on ray tracing at
RPI and is at Cray:

# Gray Lorig - volumetric data rendering, sci-vi, and parallel & vector
#	architectures.
# Cray Research, Inc.
# 1333 Northland Drive
# Mendota Heights, MN  55120
# (612)-681-3645
alias	gray_lorig	hpfcrs!hpfcla!hplabs!gray%rhea.CRAY.COM@uc.msc.umn.edu


By way of introduction, this from Erik Jansen:

    I visited the Helsinki University of Technology last week and found 
    there a lot of ray tracing activities going on. They are even reviving 
    their old EXCELL system (Markku Tamminen did a PhD work on a spatial 
    index based on an adaptive binary space subdivision in '81-'82, I met 
    him in '81 and '82 and we talked at these occasions about ray tracing 
    and spatial subdivision. In his PhD thesis (1982) there is a ray tracing 
    algorithm given for the EXCELL method (EXtendible CELL method).
    I decided to implement the algorithm for ray tracing polygon models.
    That implementation failed because I could only use our PDP-11 at that 
    time and I could have about ten cells in internal memory - too less
    for effective coherence. The program spend 95% of its time on swapping.
    So far the history).
    I told them about the RT-news and they are very interested to receive
    it. I will mail them (Charles Woodward, Panu Rekola, e.o.) your address,
    so that they can introduce themselves to the others.

#
# Panu Rekola - spline intersection, illumination models, textures
# Helsinki University of Technology
# Laboratory of Information Processing Science
# Room Y229A
# SF-02150 Espoo
# Finland
#	pre@kiuas.hut.fi	(or pre@hutcs.hut.fi)
alias	panu_rekola	hpfcrs!hpfcla!hpda!uunet!mcvax!hutcs!pre

Panu Rekola writes:

    I just received a message from Erik Jansen (Delft) in which he told
    me that you take care of a mailing list called "Ray Tracing News".
    (I already sent a message to Andrew Glassner on the same topic because
    Erik told me to contact him when he visited us some weeks ago.)
    Now, I would like to join the discussion; I promise to ask no stupid
    questions. I have previously worked here in a CAD project (where I wrote
    my MSc thesis on FEM elements) and since about a year I have been 
    responsible of our graphics. Even though my experience in the field is
    quite short I suppose I have learned a lot while all people want to see
    their models in color and with glass etc., visualization has been the
    bottleneck in our CAD projects.

    As an introduction to the critical members of the mailing list you
    could tell that I am a filter who read unstandard input from the models
    created by other people, manipulates the data with the normal graphics
    methods, and outpus images. The special features of our ray tracer are
    the EXCELL spatial directory (which has been optimized for ray tracing
    during the last few weeks), a new B-spline (and Bezier) algorithm,
    methods to display BR models with curved surfaces (even blend surfaces,
    although this part yet unfinished). The system will be semi-commercially
    used in a couple of companies soon (e.g. car and tableware industry).

------------------------------------------------------------------------

More on shadow testing, efficiency, etc. (followup to Ohta/Goldsmith
correspondence):

From Jeff Goldsmith:

    Sorry I haven't responded sooner, but movie-making has taken
    up all my time recently.  

    With respect to pre-sorting, etc.  It is important to note
    that the preprocessing time must be MUCH smaller than the 
    typical rendering time.  So far this has been true, and 
    even more so for animation.

    O(n) in my writing (explicitly in print) means linear in the
    number of objects in the scene.  Obviously, it is quite likely
    that the asymptotic time complexity (a.t.c.) of any ray tracing algorithm
    will be different for the number of rays.  Excluding ray coherence
    methods and hybrid z-buffer/ray tracing methods, the current
    a.t.c. is O(n) in the number of rays for ray tracing.  Actually, I
    think it is the same for these other methods because the hybrid
    methods just eliminate some of the rays from consideration and
    leave the rest to be just as hard and the coherence methods don't
    eliminate more than, say, 1/2 of the rays, I think.  In any event,
    for the a.t.c. to become sub-linear, there can be no such fraction,
    right?

    About space tracing: I think that I said that finding the 
    closest intersection is an O(log n) problem.  I agree, though,
    that that statement is not completely correct.  Bucket sort
    methods, for example, can reduce the a.t.c. below log n.  Also,
    global sort time (preprocessing) can distribute some of the
    computation across all rays, which can reduce the time complexity.
    What about the subdivide on the fly methods?  (e.g:
    Arvo and Kirk)  How do they fit in the scheme of things?
    I think your evaluation of the space tracing methods is correct,
    though the space complexity becomes important here, too.  Also,
    given a "full" space (like Fujimoto's demos,) the time complexity
    is smaller.  That leads to the question, "What if the time complexity
    of an algorithm depends on its input data?" Standard procedure is
    to evaluate "worst case" complexity, but we are probably interested
    in "average case" or more likely, "typical case."  Also, it would
    be worthwhile and interesting to understand which algorithms do
    better with which type of data.  We need to quantify this answer
    when trying to find good hybrid schemes.  (The next generation?)

    At SIGGRAPH '87 we had a round table and each answered the question,
    "what would you like to see happen to ray tracing in the next year."
    My choice was to see something proven about ray tracing.  It sounds
    like you are interested in that too.  Any takers?

--------------------------------------------------------------------

More comments on tight fitting octrees for quadrics
{followup to Ruud Waij's question last issue}

From Jeff Goldsmith:

	With respect to the conversation about octree testing, I've only
    done one try at that.  I just tested 9 points against the implicit
    representation of the surface.  (8 corners and the middle.)  I didn't
    use it for ray tracing (I even forget what for) but I suspect that 
    antialiasing will hide most errors generated that way.

	Jim Blinn came up with a clever way to do edges and minima/
    maxima of quadric surfaces using (surprise) homogeneous coordinates.
    I don't think there ever was a real paper out of it, but he published
    a tutorial paper in the Siggraph '84 tutorial notes on "The Mathematics
    of Computer Graphics."  That technique works for any quadric surface
    (cylinders aren't bounded, though) under any homogeneous transform
    (including perspective!)  He also talks about how to render these
    things using his method.  I tried it; it works great and is 
    incredibly fast.  I didn't implement many of his optimizations and
    can draw a texture mapped cylinder (no end caps) that fills the
    screen (512x512) on a VAX 780 in under a minute.

	As to how this applies to ray tracing, he gives a method for
    finding the silhouette of a quadric as well as minima and maxima.
    It allows for easy use of forward differencing, so should be fast
    enough to "render" quadrics into an octree.

	Bob Conley did a volume-oriented ray tracer for his thesis.
    I don't remember the details, but there'll be a long note about
    it that I'll pass on.  He mentions that his code can do index
    of refraction varying over position.  He uses a grid technique
    similar to Fujimoto's.   

---------------------------------------------------------------

From Jim Arvo:

    Just when you thought we had moved from octrees on to other things...
This just occurred to me yesterday. (Actually, that was several days ago.
This mail got bounced back to me twice now.  More e-mail gremlins I guess.)


LINEAR-TIME VOXEL WALKING FOR OCTREES
-------------------------------------

Here is a new way to attack the problem of "voxel walking" in octrees (at 
least I think it's new).  By voxel walking I mean identifying the successive
voxels along the path of a ray.  This is more for theoretical interest than
anything else, though the algorithm described below may actually be 
practical in some situations.  I make no claims about the practicality, 
however, and stick to theoretical time complexity for the most part.

For this discussion assume that we have recursively subdivided a cubical
volume of space into a collection of equal-sized voxels using a BSP tree
 -- i.e. each level imposes a single axis-orthogonal partitioning plane.  
The algorithm is much easier to describe using BSP trees, and from the point
of view of computational complexity, there is basically no difference 
between BSP trees and octrees.  Also, assuming that the subdivision has been
carried out to uniform depth throughout simplifies the discussion, but is by
no means a prerequisite.  This would defeat the whole purpose because we all
know how to efficiently walk the voxels along a ray in the context of 
uniform subdivision -- i.e. use a 3DDDA.

Assuming that the leaf nodes form an NxNxN array of voxels, any given ray 
will pierce at most O(N) voxels.  The actual bound is something like 3N,
but the point is that it's linear in N.  Now, suppose that we use a 
"re-traversal" technique to move from one voxel to the next along the ray.
That is, we create a point that is guaranteed to lie within the next voxel
and then traverse the hierarchy from the root node until we find the leaf
node, or voxel, containing this point.  This requires O( log N ) operations.
In real life this is quite insignificant, but here we are talking about the
actual time complexity.  Therefore, in the worst case situation of following
a ray through O( N ) voxels, the "re-traversal" scheme requires O( N log N )
operations just to do the "voxel walking."  Assuming that there is an upper
bound on the number of objects in any voxel (i.e. independent of N), this 
is also the worst case time complexity for intersecting a ray with the
environment.

In this note I propose a new "voxel walking" algorithm for octrees (call it
the "partitioning" algorithm for now) which has a worst case time complexity
of O( N ) under the conditions outlined above.  In the best case of finding
a hit "right away" (after O(1) voxels), both "re-traversal" and 
"partitioning" have a time complexity of O( log N ).  That is:

                    BEST CASE: O(1) voxels    WORST CASE: O(N) voxels
                    searched before a hit.    searched before a hit.
                  +---------------------------------------------------+
                  |                                                   |
  Re-traveral     |     O( log N )               O( N Log N )         |
                  |                                                   |
  Partitioning    |     O( log N )               O( N )               |
                  |                                                   |
                  +---------------------------------------------------+

The new algorithm proceeds by recursively partitioning the ray into little
line segments which intersect the leaf voxels.  The top-down nature of the 
recursive search ensures that partition nodes are only considered ONCE PER 
RAY.  In addition, the voxels will be visited in the correct order, thereby 
retaining the O( log N ) best case behavior.

Below is a pseudo code description of the "partitioning" algorithm.  It is
the routine for intersecting a ray with an environment which has been 
subdivided using a BSP tree.  Little things like checking to make sure
the intersection is within the appropriate interval have been omitted.
The input arguments to this routine are:

    Node : A BSP tree node which comes in two flavors -- a partition node 
           or a leaf node.  A partition node defines a plane and points to
           two child nodes which further partition the "positive" and 
           "negative" half-spaces.  A leaf node points to a list of 
           candidate objects.

    P    : The ray origin.  Actually, think of this as an endpoint of a 3D 
           line segment, since we are constraining the "ray" to be of finite
           length.

    D    : A unit vector indicating the ray direction.

    len  : The length of the "ray" -- or, more appropriately, the line 
           segment.  This is measured from the origin, P, along the 
           direction vector, D.

The function "Intersect" is initially passed the root node of the BSP tree,
the origin and direction of the ray, and a length, "len", indicating the 
maximum distance to intersections which are to be considered.  This starts
out being the distance to the far end of the original bounding cube.

============================================================================

FUNCTION Intersect( Node, P, D, len ) RETURNING "results of intersection"

    IF Node is NIL THEN RETURN( "no intersection" )

    IF Node is a leaf THEN BEGIN

        intersect ray (P,D) with objects in the candidate list

        RETURN( "the closest resulting intersection" )

        END IF

    dist := signed distance along ray (P,D) to plane defined by Node

    near := child of Node in half-space which contains P

    IF 0 < dist < len THEN BEGIN  /* the interval intersects the plane */

        hit_data := Intersect( near, P, D, dist )

        IF hit_data <> "no intersection" THEN RETURN( hit_data )

        Q := P + dist * D   /* 3D coords of point of intersection */

        far := child of Node in half-space which does NOT contain P

        RETURN( Intersect( far, Q, D, len - dist ) )

        END IF

    ELSE RETURN( Intersect( near, P, D, len ) ) 

    END

============================================================================

As the BSP tree is traversed, the line segments are chopped up by the
partitioning nodes.  The "shrinking" of the line segments is critical to 
ensure that only relevent branches of the tree will be traversed.

The actual encodings of the intersection data, the partitioning planes, and
the nodes of the tree are all irrelevant to this discussion.  These are 
"constant time" details.  Granted, they become exceedingly important when
considering whether the algorithm is really practial.  Let's save this
for later.

A naive (and incorrect) proof of the claim that the time complexity of this
algorithm is O(N) would go something like this:

    The voxel walking that we perform on behalf of a single ray is really
    just a search of a binary tree with voxels at the leaves.  Since each 
    node is only processed once, and since a binary tree with k leaves has
    k - 1 internal nodes, the total number of nodes which are processed in 
    the entire operation must be of the same order as the number of leaves.
    We know that there are O( N ) leaves.  Therefore, the time complexity 
    is O( N ).

But wait!  The tree that we search is not truly binary since many of the 
internal nodes have one NIL branch.  This happens when we discover that the 
entire current line segment is on one side of a partitioning plane and we
prune off the branch on the other side.  This is essential because there
are really N**3 leaves and we need to discard branches leading to all but 
O( N ) of them.  Thus, k leaves does not imply that there are only k - 1 
internal nodes.  The quention is, "Can there be more than O( k ) internal 
nodes?".

Suppose we were to pick N random voxels from the N**3 possible choices, then 
walk up the BSP tree marking all the nodes in the tree which eventually lead
to these N leaves.  Let's call this the subtree "generated" by the original
N voxels.  Clearly this is a tree and it's uniquely determined by the leaves.
A very simple argument shows that the generated subtree can have as many as 
2 * ( N - 1 ) * log N nodes.  This puts us right back where we started from,
with a time complexity of O( N log N ), even if we visit these nodes only 
once.  This makes sense, because the "re-traversal" method, which is also 
O( N log N ), treats the nodes as though they were unrelated.  That is, it 
does not take advantage of the fact that paths leading to neighboring 
voxels are likely to be almost identical, diverging only very near the 
leaves.  Therefore, if the "partitioning" scheme really does visit only 
O( N ) nodes, it does so because the voxels along a ray are far from random.
It must implicitly take advantage of the fact that the voxels are much more
likely to be brothers than distant cousins.

This is in fact the case.  To prove it I found that all I needed to assume 
about the voxels was connectedness -- provided I made some assumptions
about the "niceness" of the BSP tree.  To give a careful proof of this is
very tedious, so I'll just outline the strategy (which I *think* is 
correct).  But first let's define a couple of convenient terms:

1)  Two voxels are "connected" (actually "26-connected") if they meet at a 
    face, an edge, or a corner.  We will say that a collection of voxels is
    connected if there is a path of connected voxels between any two of them.

2)  A "regular" BSP tree is one in which each axis-orthogonal partition 
    divides the parent volume in half, and the partitions cycle: X, Y, Z, X,
    Y, Z, etc. (Actually, we can weaken both of these requirements 
    considerably and still make the proof work.  If we're dealing with 
    "standard" octrees, the regularity is automatic.)

Here is a sequence of little theorems which leads to the main result:

    THEOREM 1:  A ray pierces O(N) voxels.

    THEOREM 2:  The voxels pierced by a ray form a connected set.

    THEOREM 3:  Given a collection of voxels defined by a "regular" BSP 
                tree, any connected subset of K voxels generates a unique 
                subtree with O( K ) nodes.

    THEOREM 4:  The "partitioning" algorithm visits exactly the nodes of 
                the subtree generated by the voxels pierced by a ray.  
                Furthermore, each of these nodes is visited exaclty once 
                per ray.

    THEOREM 5:  The "partitioning" algorithm has a worst case complexity 
                of O( N ) for walking the voxels pierced by a ray.

Theorems 1 and 2 are trivial.  With the exception of the "uniqueness" part, 
theorem 3 is a little tricky to prove.  I found that if I completely removed
either of the "regularity" properties of the BSP tree (as opposed to just
weakening them), I could construct a counterexample.  I think that 
theorem 3 is true as stated, but I don't like my "proof" yet.  I'm looking 
for an easy and intuitive proof.  Theorem 4 is not hard to prove at all.  
All the facts become fairly clear if you see what the algorithm is doing.  
Finally, theorem 5, the main result, follows immediately from theorems 1 
through 4.


SOME PRACTICAL MATTERS:

Since log N is typically going to be very small -- bounded by 10, say -- 
this whole discussion may be purely academic.  However, just for the heck 
of it, I'll mention some things which could make this a (maybe) 
competative algorithm for real-life situations (in as much as ray tracing 
can ever be considered to be "real life").

First of all, it would probably be advisable to avoid recursive procedure
calls in the "inner loop" of a voxel walker.  This means maintaining an 
explicit stack.  At the very least one should "longjump" out of the 
recursion once an intersection is found.

The calculation of "dist" is very simple for axis-orthogonal planes, 
consisting of a subtract and a multiply (assuming that the reciprocals of 
the direction components are computed once up front, before the recursion 
begins).

A nice thing which falls out for free is that arbitrary partitioning 
planes can be used if desired.  The only penalty is a more costly distance
calculation.  The rest of the algorithm works without modification.  There 
may be some situations in which this extra cost is justified.

Sigh.  This turned out to be much longer than I had planned...

>>>>>> A followup message:

Here is a slightly improved version of the algorithm in my previous mail.
It turns out that you never need to explicitly compute the points of 
intersection with the partitioning planes.  This makes it a little more
attractive. 

                                                                 -- Jim


FUNCTION BSP_Intersect( Ray, Node, min, max ) RETURNING "intersection results"

BEGIN

    IF Node is NIL THEN RETURN( "no intersection" )

    IF Node is a leaf THEN BEGIN  /* Do the real intersection checking */

        intersect Ray with each object in the candidate 
        list discarding those farther away than "max." 

        RETURN( "the closest resulting intersection" )

        END IF

    dist := signed distance along Ray to plane defined by Node

    near := child of Node for half-space containing the origin of Ray

    far  := the "other" child of Node -- i.e. not equal to near.

    IF dist > max OR dist < 0 THEN  /* Whole interval is on near side. */

        RETURN( BSP_Intersect( Ray, near, min, max ) )

    ELSE IF dist < min THEN  /* Whole interval is on far side. */

        RETURN( BSP_Intersect( Ray, far , min, max ) )

    ELSE BEGIN  /* the interval intersects the plane */

        hit_data := BSP_Intersect( Ray, near, min, dist ) /* Test near side */

        IF hit_data indicates that there was a hit THEN RETURN( hit_data )

        RETURN( BSP_Intersect( Ray, far, dist, max ) )  /* Test far side. */

        END IF

    END


------------------------------------------------------------------------

Some people turn out to be on the e-mail mailing list but not the hardcopy
list for the RT News.  In case you don't get the RT News in hardcopy form, I'm
including the Efficiency Tricks article & the puzzle from it in this issue.


Efficiency Tricks, by Eric Haines
---------------------------------

Given a ray-tracer which has some basic efficiency scheme in use, how can we
make it faster?  Some of my tricks are below - what are yours?

[HBV stands for Hierarchical Bounding Volumes]

Speed-up #1:  [HBV and probably Octree] Keep track of the closest intersection
distance.  Whenever a primitive (i.e. something that exists - not a bounding
volume) is hit, keep its distance as the maximum distance to search.  During
further intersection testing use this distance to cut short the intersection
calculations.

Speed-up #2:  [HBV and possibly Octree] When building the ray tree, keep the
ray-tree around which was previously built.  For each ray-tree node, intersect
the object in the old ray tree, then proceed to intersect the new ray tree.
By intersecting the old object first you can usually obtain a maximum distance
immediately, which can then be used to aid Speed-up #1.

Speed-up #3:  When shadow testing, keep the opaque object (if any) which
shadowed each light for each ray-tree node.  Try these objects immediately
during the next shadow testing at that ray-tree node.  Odds are that whatever
shadowed your last intersection point will shadow again.  If the object is hit
you can immediately stop testing because the light is not seen.

Speed-up #4:  When shadow testing, save transparent objects for later
intersection.  Only if no opaque object is hit should the transparent objects
be tested.

Speed-up #5:  Don't calculate the normal for each intersection.  Get the
normal only after all intersection calculations are done and the closest object
for each node is know: after all, each ray can have only one intersection point
and one normal.  (Saving intermediate results is recommended for some
intersection calculations.)

Speed-up #6:  [HBV only] When shooting rays from a surface (e.g. reflection,
refraction, or shadow rays), get the initial list of objects to intersect
from the bounding volume hierarchy.  For example, a ray beginning on a sphere
must hit the sphere's bounding volume, so include all other objects in this
bounding volume in the immediate test list.  The bounding volume which
is the father of the sphere's bounding volume must also automatically be hit,
and its other sons should automatically be added to the test list, and so on
up the object tree.  Note also that this list can be calculated once for any
object, and so could be created and kept around under a least-recently-used
storage scheme.

------------------------------------------

A Rendering Trick and a Puzzle, by Eric Haines
----------------------------------------------

One common trick is to put a light at the eye to do better ambient lighting.
Normally if a surface is lit by only ambient light, its shading is pretty
crummy.  For example, a non-reflective cube totally in shadow will have all of
its faces shaded the exact same shade - very unrealistic.  The light at the eye
gives the cube definition.  Note that a light at the eye does not need shadow
testing - wherever the eye can see, the light can see, and vice versa.

The puzzle:  Actually, I lied.  This technique can cause a subtle error.  Do you
know what shading error the above technique would cause? [hint: assume the Hall
model is used for shading].

---------------------------------------------------------------------------

USENET roundup:

Other than a hilarious set of messages begun when Paul Heckbert's Jell-O (TM)
article was posted to USENET, and the perennial question "How do I find if a
point is inside a polygon?", not much of interest.  However, I did get a copy
of the errata in _Procedural Elements for Computer Graphics_ from David Rogers.

I updated my edition (the Second) with these corrections, which was generally
a time drain: my advice is to keep the errata sheets in this edition, checking
them only if you are planning to use an algorithm.  However, the third edition
corrections are mercifully short.


From: "David F. Rogers" <rochester!harvard!USNA.MIL!dfr@cornell.UUCP>
From:     David F. Rogers  <dfr@USNA.MIL>
Subject:  PECG correction
Date:     Thu, 10 Mar 88 13:21:11 EST

Correction list for PECG  2/26/86
David F. Rogers

There have been 3 printings of this book to date.
The 3rd printing occurred in approximately March 85.

To see if you have the 3rd printing look on page 386,
3rd line down and see if the word magenta is spelled
correctly.  If it is, you have the 3rd printing. If not, then
you have the 2nd or 1st printing.

To see if you have the 2nd printing look on page 90.  If
the 15th printed line in the algorithm is

  while Pixel(x,y) <> Boundary value

you have the 2nd printing.  If not you have the 1st printing.

Please send any additional corrections to me at

Professor David F. Rogers
Aerospace Engineering Department
United States Naval Academy
Annapolis, Maryland 21402

uucp:decvax!brl-bmd!usna!dfr
arpa:dfr@usna
_____________________________________________________________

Known corrections to the third printing:

Page	Para./Eq.	Line	Was		Should be

72	2		11	(5,5)		(5,1)
82	1 example	4	(8,5)		delete
100	5th equation		upper limit on integral should be 2
				vice 1
143	Fig. 3-14		yes branch of t < 0 and t > 1 decision blocks
				should extend down to Exit-line invisible
144	Cyrus-Beck
	algorithm	7	then 3		then 4
			11	then 3		then 4

145	Table 3-7	1	value for w
				[2 1]		[-2 1]

147	1st eq. 	23	V sub e sub x j V sub e sub y j
______________________________________________________________

Known corrections to the second printing:  (above plus)

text:

19	2		5	Britian 	Britain
36	Eq. 3		10	replace 2nd + with =
47	4		6	delta' > 0      delta'< = 0
82	1		6	set		complement
99	1		6	multipled	multiplied
100	1		6	Fig. 2-50a	Fig. 2-57a
100	1		8	Fig. 2-50b	Fig. 2-57b
122	write for new page
186	2		6	Fig. 3-37a	Fig. 3-38a
186	2		9	Fig. 3-38	Fig. 3-38b
187	Ref. 3-5		to appear	Vol. 3, pp. 1-23, 1984
194	Eq. 1			xn +		xn -
224	14 lines from bottom	t = 1/4 	t = 3/4
329	last eq.		-0.04		-0.13
	next to last eq.	-0.04 twice	-0.13 twice
	3rd from bottom 	0.14		-0.14
330	1st eq. 		-0.09		-0.14
	2nd eq. 		-0.09		-0.14
	3rd eq. 		-0.17		-0.27
	4th eq. 		0.36		0.30
				5.25		4.65
	last eq.		5.25		4.65
332			4	beta <		beta >
			6	beta <		beta >
355	2nd eq. 		w = s(u,w)	w = s(theta,phi)
385	2		5	magneta 	magenta
386			3	magneta 	magenta

algorithms: (send self-addressed long stamped envelope for xeroxed
	     corrections)

97	Bresenham	1	insert words  first quadrant  after modified
			10	remove ()
			12	1/2		I/2
			14	delta x 	x sub 2

117	Explicit	18	Icount = 0	delete
	clipping
			18	insert		m = Large
120			9	P'2             P'1
			12	insert after	Icount = 0
				end if
			13	insert after	1 if Icount <> 0 then
				neither end	   P' = P0
			14			removed statement label 1
			15	>=		>
			17			delete
			18			delete
			43	y>		yT>

122-124 Sutherland-	write for new pages
	Cohen

128	midpoint	4	insert after	initialize i
						i = 1
129			6	i = 1		delete
			6	insert		save original p1
						Temp = P1
			8	i = 2		i > 2
			11,12	save original.. delete
				Temp = P1
			14	add statement label 2
130			19-22	delete
			24	i = 2		i = i + 1
			29	<>		<> 0
			33	P1		P

143			3	wdotn		Wdotn
144			20	>=		>

176	Sutherland-	1	then 5		then 4
	Hodgman
177			9	4 x 4		2 x 2

198	floating	21,22	x,y		Xprev,Yprev
	horizon
199			4	Lower		Upper
200			11-19	rewrite as
				if y < Upper(x) and y > Lower(x) then Cflag = 0
				if y> = Upper(x) then Cflag = 1
				if y< = Lower(x) then Cflag = -1
			29	delete
			31	Xinc		(x2-x1)
			36	step Xinc	step 1
201			4	delete
			6	Xinc = 0       (x2-x1) = 0
			12	Y1 -		Y1 + Slope -
			12	insert after	Csign = Ysign
			13	Yi = Y1 	Yi = Y1 + Slope
			13	insert after	Xi = X1 + 1
			14-end	rewrite as	while(Csign = Ysign)
						   Yi = Yi + Slope
						    Xi = Xi + 1
						    Csign = Sign(Yi - Array(Xi))
						end while
						select nearest integer value
						if |Yi -Slope -Array(Xi - 1)| <=
						  |Yi - Array(Xi)| then
						    Yi = Yi - Slope
						    Xi = Xi -1
						end if
					     end if
					 return

258	subroutine Compute	N		i

402	HSV to Rgb	12	insert after	end if
			25	end if		delete

404	HLS to RGB	2	M1 = L*(1 - S)	M2 = L*(1 + S)
			4	M1		M2
			6	M2 = 2*L - M1	M1 = 2*L - M2
			10-12	=1		=L
			18	H		H + 120
			19	Value + 120	Value
			22	H		H - 120
			23	Value - 120	Value

405	RGB to HLS	22	M1 + M2 	M1 - M2

figures:

77	Fig. 2-39a		interchange Edge labels for scanlines 5 & 6
	Fig. 2-39b		interchange information for lists 1 & 3, 2 & 4

96	Fig. 2-57a,b		y sub i + 1	y sub(i+1)

99	Fig. 2-59		abcissa of lowest plot should be xi vice x

118	Fig. 3-4		first initialization block - add m = Large
				add F entry point just above IFLAG = -1
				decision block
119				to both IFLAG=-1 blocks add exit points to F

125	Fig. 3-5		line f - interchange Pm1 & Pm2

128	Fig. 3-6a		add initialization block immediately after Start
				initialize i, i=1

				immediately below new initialization block add
				entry point C

				in Look for the farthest vissible point from P1
				block - delete i=1

				in decision block i = 2 - change to i > 2

129	Fig. 3-6b		move return to below Save P1 , T = P1 block

				remove Switch end point codes block

				in Reset counter block replace i=2 with i=i + 1

180	Fig. 3-34b		Reverse direction of arrows of box surrounding
				word Start.

330	Fig. 5-16a		add P where rays meet surface

374	Fig. 5-42		delete unlabelled third exit from decision
				box r ray?

377	Fig. 5-44		in lowest box I=I+I sub(l (sub j)) replace
				S with S sub(j)
_________________________________________________________________________

Known corrections to the first printing:

90,91	scan line seed		write for xeroxed corrections
	fill algorithm
________________________________________________________________________
END OF RTNEWS