A few weeks ago I had an email conversation with Dale Innis about llTargetOmega support in OpenSim. This script function lets you set the angular velocity on a prim, which the client then interprets and displays spinning objects. It is not guarunteed to be synchronized between all clients, but it provides a rather useful visual effect regardless.
llTargetOmega didn't work for us a week ago, which confused me, as I saw that in the LSL portion of our code it was doing exactly the right thing and setting the angular velocity correctly. I should work, but it didn't. In the lack of it working people were setting fast timers that pushed out rotation updates. This caused a lot of extra load on the server, and was really the wrong approach for this.
Take 1: Terse Updates
OpenSim has 2 paths to sending information about Prims to the client (we'll get to the first one later). Terse Updates are a small update packet that contains just a bit of information on updated textures and some of the vectors used to establish prim position, velocity, acceleration, rotation, and angular velocity. When a prim is updated in the environment, Terse Updates are used to tell all the other clients about that change. One of the heavy users of the Terse Update path is the physics engine, as all the vectors the physics engine changes are in there. We've seen a lot of work on the Terse Update path as physics have gotten more and more tested.
On Tuesday I finally dug in and traced our Terse Update path, and found an interesting thing. When the object was physical (i.e. movement coming out of the physics engine) we did the right thing for Terse Updates. When it wasn't, we hard coded all the velocities to zero. So even if things were rotating, any time we sent an update we'd stop them.
Date: 2008-05-06 15:17:00 -0700 (Tue, 06 May 2008)
New Revision: 4543
send actual velocity and angular velocity in terse updates
instead of hardcoding to zero when the primitive is non physical.
llTargetOmega should work now.
Ok, so life is good, the issue is fixed, and we move on.
Except... it wasn't.
CSI: OpenSim, getting to the bottom of this
At this point a whole bunch of people on the IBM side jumped in. Mike Osias had a build that was on it's knees due to use of fake rotation, so he had all the good test cases, and opened mantis 1166. I'm not a scripter, so I needed some examples to know what should work. Alan Webb started to dive in and try to figure what was going on as well. I figured I'd spend an hour on it to try to figure out where things were at before getting back to avatar appearance bits.
After abount an hour Alan and I started comparing notes. The code in this area is extra confusing because we've got 2 vectors for angular velocity. An, no, they aren't actually different in any real way. Lots of people have tried to rationalize that they do different things, but they don't. This is cruft, and is part of what happens in an organically growing open project. The AngularVelocity / RotationalVelocity thing an opensim appendix, and should be surgically removed at some time in the near future.
But the behavior was even odder. I could set llTargetOmega on an object, and it wouldn't move. Then I'd touch it, and it would. I got Mike into my test environment and was looking at a spinning cube.
"Ok, you see that cube spinning?"
I grab it and move it. "What about now?"
"Yes, spinning now."
At this point I was confused a lot. Why would that be?
Take 2: Full Updates
I said there were 2 ways of a client finding out about prims, and this gets us back to the first one. In addition to Terse Update, there is what we call Full Updates, which are really just the full prim definition being sent down the wire. This is everything we know about the prim. This packet is also marked as reliable, to make sure the client doesn't drop it (terse updates are droppable).
And now we get back to organic code bases. One of the big activities since October was working physics in opensim. Lots and lots of work were spent on Terse Updates. Very little work was spent on full updates. It turned out that Full Updates were always hardcoding all the motion vectors to zero. The SendPrimitiveToClient function predates both physics and scripting by months. In a pre-physical opensim world passing the motion vectors didn't make any sense, as there wasn't anyway to set those values. The code worked well, so no one was really looking at it again, at least not in this specific area.
TerseUpdates (sent on minor prim movement) would make things spin. Full Updates (sent on initial prim rez, or after calls to osSetDynamicTextureURL) stopped the spinning. My earth projector turned out to be the perfect test case for this once I added rotation to the globe.
Originally I was going to punt on this and leave it to someone else, but then the thrill of the chase got to be too much. But there was one problem. This information is sent to the client in a 60 byte array, with basically undocumented positions. It was easy to fix terse updates because someone had already sorted it out, and I just needed to copy the decoding pattern there. For FullUpdates, it was more of a trial and error approach, represented by a series of checkins, reverts, and new attempts.
You know what happens when you get that array wrong? Spectacular fail. 3/4 of prims aren't in the right place, and touching an image board (user of osSetDynamicTextureURL) makes it fly away to some other part of your sim. Maybe in space. I eventually figured out a workable serialization:
Date: 2008-05-07 12:44:22 -0700 (Wed, 07 May 2008)
New Revision: 4566
seriously hope this gives us rotation and rotational velocity
As you can see, I was getting a little punchy on changelog entries.
So we're done and fixed, and back to work.... well not quite.
Take 3: Deselected Objects
When you edit an object the client stops it's motion, as nothing would be more evil than trying to edit an object that is flying away from you at 60 m/s. When you've deselected the object it tells the server. But the object is stopped. The client needs to be told again that it is spinning. I got that critical information from melanie on IRC, which was enough to pass on the buck.
We had Mike almost working, and Mike is no slouch on our code base (he's sent in a couple dozen patches in the past), so I flipped this one back to him with "we're almost done, but you'll need to find the right place in the deselect path to generate a Terse Update. Then I think we've got full llTargetOmega support."
A day later Mike sent in this final patch:
Date: 2008-05-08 05:48:29 -0700 (Thu, 08 May 2008)
New Revision: 4585
From: Michael Osias <firstname.lastname@example.org>
Patch to schedule terse update on deselect, specifically so llTargetOmega
sets rotational velocity on deselect.
This should complete our llTargetOmega support and fix:
And now. For real. llTargetOmega works.
Avartar Appearance as a User Service isn't coming this week, sorry folks. The above epic took much of my hacking time this week. It was a pretty solid educational experience for me in the way we actually communicate the contents of the Scene to the client, which was good to learn after a year on the project. 🙂
Something else to take away from this. Lot's of focus is currently on the OpenSim scripting implementation, as it should be, as that's a huge user visible portion of our function. llTargetOmega it self is < 6 lines of implementation. But our supporting scene model needed some work to actually get that info to the client.
I get asked all the time "how long until my favorite feature X is implemented", and the answer is always an unsatisfying (to me and them), "I'm not sure". Sometimes the plumbing is already there, and it's quick. Other times we're doing deep dives into our code base to implement what seems to the user to be a very simple function.
We're making constant forward progress, I'd even say rapid constant forward progress, but patience is always a good thing. Also, if you want OpenSim to work for whatever you application is, you should be trying to use it now and filing bug reports. That's how we function, personal itches, and knocking of mantis reports. Any ability that you have to narrow the bug to a specific section of code (even if you don't have a fix), helps a lot as well, as it removes possibly hours of core developer time trying to track down where things fail.