Castle Paradox

Mr B · Joined: 20 Mar 2003 Posts: 382

Hi guys,

I am having a bit of difficulty deciding whether to use an array or a linked list for a certain OHR application.

I have an array of 20 element sets. My game has a master script that loops continuously, checking the array during every cycle. It only needs to check the "active" elements of the array, which are probably going to be around 2-5 in general, though it can briefly explode to 18 or so.

Each element set contains the ID of a script and the clock cycle time at which the script is to be executed. Since these scripts will have all sorts of delays, there is no way to guarantee that the elements inserted last will be the first to finish.

If I cycle through the array as an array, then I may need to cycle up to the last element even if I have only one active element! This would be a waste of time.

If I make the elements into a singly-linked-list, then I will only have to traverse as many elements as are active. However, this will mean creating several additional scripts to handle insertion, deletion, pointer re-allocation, etc.

Either way, I will be having a stack of references to unused elements so that insertion doesn't require cycling.

So here is my question: since I will be having only 20 or so elements in my list, which method will take the least time to use? I hate the inefficiencies of cycling through all of those elements (even if I stop the cycling after it passes the active number of elements), but I am concerned about the time overhead of calling a large number of additional scripts.

At this point I am leaning towards a linked list, but I would appreciate your input before jumping into several hours of scripting.

Moogle1 · Posted: Sat Nov 12, 2005 9:03 pm Post subject:

Linked list will definitely be faster if you don't anticipate a large array size. If it only has 2-5 on the average case, you're looking at 50% less traversal time at least.
_________________

Slime Salad :: Independent Game Design

Inferior Minion · Posted: Sun Nov 13, 2005 12:02 am Post subject:

Mike Caron · Posted: Tue Nov 15, 2005 10:26 am Post subject:

You could always allow 20+1 elements, and then put a marker (such as -1) to mark the end of the list. Why not zero? Because the default is zero. If a script in the middle finishes early, there will be fragmentation. So if you have a map like this:

[0]: 251
[1]: 0
[2]: 12

Then 2 will never be accessed.

Vs.

[0]: 251
[1]: 0
[2]: 12
[3]: -1

Then your script will ignore 0s, and stop at -1.

Often, in other languages, I find myself storing an arbitrary number of <whatever>s, and having to return IDs. Simply going up by one every time isn't practical, since they're often stored in a flat array. So, my code usually looks something like this (in pseudo code, aObj() is the array):

Mr B · Joined: 20 Mar 2003 Posts: 382

Okay; it looks as if the advantage of linked lists completely overwhelms everything else. I am going to do a singly-linked-list, because there are only so many global variables. I will use an array for the stack, though.

Some more info on what I am doing; the nature of the game I am making is such that I will need scripts to be called with some sort of delay after a trigger (AI scripts, event coordination, etc.), but other scripts need to be processed in the mean time.

Therefore I will have a master script that will cycle continuously, each time incrementing a variable (which will double back on itself after it reaches a certain value). I will also have a linked list of global variable groups, each one containing the reference ID of a script and the cycle time at which that script is to be executed (among other things, such as values to be passed to the scripts).

At each cycle of the master script, the list will be examined to see if any scripts need to be executed.

As for the list itself, I think that I will insert the elements in terms of their execution time, from soonest to furthest off. I am also thinking of having a global variable that contains the cycle time of the next script that needs to be executed (this will be updated every time that a script is added or deleted). This way I only need to check a single variable once per cycle to see if I should even bother going through the list or not.

Ordinarily I would just examine the execution time of the first element in the list, but because of the cycling around that the cycle variables do, the smallest execution time is not always the next to be executed (if I wrap the cycle values around at 1000, and I am at 990 and have a script that needs to be executed in 20 cycles, it would be executed when the global cycle variable = 10, not 1010; however, I may have a script that needs to be executed at 998, which is sooner).

Does anyone see any pitfalls to this approach?

Inferior Minion · Posted: Tue Nov 15, 2005 1:08 pm Post subject:

It sounds like you're tryin got build a Priority Queue using a linked list. The downside is in the worst case it'll take O(n) to insert elements. Keeping things sorted should make your life easier, but I don't think the Linked List is your best solution.

Check out Heaps. They're Trees implemented using Arrays to create Priority Queues.

The following link builds a Max Heap (the root is the largest value in the tree) but a Min Heap can easily be created by flipping a few if statements:

http://www.iti.fh-flensburg.de/lang/algorithmen/sortieren/heap/heapen.htm

Implementation in HS requires an array, a variable for numElements, and the Heap Sort functions from that link. This should speed up insertion, deletion, and give you access to the raw data via the array if needed. The way I see your event scheduler working is a simple external counter, as you mentioned. You insert elements into your Priority Queue using an offset relative to that counter. Each time you increment your counter, you simply compare it against the first element in the array. When it's time to execute an event, you pop off the root element of your Priority Queue, processing each element until the root no longer matches the counter.

Checking becomes O(1), deleting becomes O(log n), insertion becomes O(log n), and if you ever really feel like traversing the array, it's O(n) because you know exactly how many elements are in the tree.

Hope that helps.

Edit: Fixed the deletion Big O, was a bit off. Also, was thinking about the external counter. If it loops, you'll screw up your heap. When your counter loops, those internal triggers will all shoot to smaller values than anything in your heap. You'll want to compare Time To Live (internal trigger minus external counter, taking looping into consideration), not simply the internal trigger values. This adds an extra calculation per compare, but should still be faster than using a Linked List or standard array implementation.
_________________

TMC · Posted: Wed Nov 16, 2005 4:12 am Post subject:

Horray, complicated uses of HS! :D:D:D

But Mike, why would we want an expanding array here? It's sort of irrelevant (and dangerous to use... at least, I tend to squeeze in my arrays as tightly and haphazardly as possible)

Yeah, there is no point doing this if you are going to only sort depending on the counter value instead of calculating time to live. I assume that each event needs to be only called once for one tick. If an event needs to be called every tick for 10, say, then things change and IM might correct me, but I don't think heaps will efficiently handle that (no deleting means recursive traversion). So, what exactly did you mean by "active elements"?

However, moving globals around in HS is slow, there is no command to do so (I've been meaning to implement one) which means it has to be done manually. So it would be a bad idea to use the heap idea directly. You could instead have a heap with pointers to the elements in the array along with activation time, an array of 20 elements, and a stack of unused elements. This is getting a little complicated and global hungry.

Incase you do want to use a heap, and because I'd never implemented one before (especially not in HS!) and wanted to have some fun, I got carried away and wrote these scripts (which are entirely not bug proof but 90% logic proof):

Inferior Minion · Posted: Wed Nov 16, 2005 9:39 am Post subject:

Mike Caron · Posted: Wed Nov 16, 2005 10:31 am Post subject:

Hmm, I suppose that my idea could be intepreted as a linked list.

But, I inteded it to be a means of speeding up a fixed size array. In his very first post, he says that he will only use 2-5 elements, with around 18 being a heavy-usage situation.

So, using that crazy big O notation, an insertion would be between O(1) and O(n), depending on how much fragmentation there is, a deletion would be O(1) (you already know where it is, just set it to zero), and accessing it would be O(1), for the same reason.

Unless, of course, multiplication gets slower depending on how many elements there are >_^
_________________
I stand corrected. No rivers ran blood today. At least, none that were caused by us.

Final Fantasy Q
OHR Developer BLOG
Official OHRRPGCE Wiki and FAQ

Mr B · Joined: 20 Mar 2003 Posts: 382

Wow..! I haven't thought about heaps for a year.

I think that I will end up using a linked list for this particular problem. The heap looks really attractive, but in this particular case I don't think that it will offset the expense of additional variables and scripting complication. I don't know exactly how many actions I will end up storing (~2-5 on average was just a random guess), but I don't anticipate having so many as would make a heap shine.

I intend to have a global variable containing the cycle value at which the next most recent action is to be executed (updated for additions and deletions), which should allow me to avoid cycling through the list when there are no scripts to be executed.

I considered swapping variables and and out of other save slots, but I wasn't confident about the time delay. If it is as low as you seem to be saying, I may very well have to look in to it (considering how much data is popping up in this project).

Thanks for all of this data. I wasn't really certain whether such a thing could be done in the OHR, but this really boosts my confidence.

So um yeah. Thanks for taking the trouble for all of this research -- I really appreciate it.

TMC · Posted: Wed Nov 16, 2005 5:06 pm Post subject:

Inferior Minion · Posted: Wed Nov 16, 2005 5:46 pm Post subject:

Mr B · Joined: 20 Mar 2003 Posts: 382

Wow...my head is spinning.

How can an unsorted linked list consistently require fewer actions than a sorted one?

The heap sounds good, but I don't know if it is taking into account my specific needs. Let me run over what I am thinking of doing with the linked list.

1.) The linked list will contain elements. Each element will be a set of global variables. The variables will contain A) a pointer to the next element in the list (or 0/-1 if the end), B) a cycle stamp showing the cycle it will be executed at, C) the ID of the script to be executed, D,E,F,G) four variables with data specific to the script being called.

2.) The linked list will contain x elements sorted by execution time; the element at the top of the list has the lowest cycle signature, and the one at the end has the largest. The element with the lowest cycle signature is not necessarily the element to be executed next, due to integer wrapping (or whatever it's called). However, the element to be executed next will almost always be at or near the top. The element at the bottom will almost always be the last one to be executed.

3.) There is a global variable that points to the first element in the list -- the one with the lowest cycle stamp (but not necessarily the one to be executed next).

4.) There is a global variable that points to the final element in the list -- the one with the highest cycle stamp. Whenever an element is inserted, it is checked against the final element variable to see if it is larger. If so, it is inserted after the final element and then the pointer is advanced.

5.) There is a global variable that registers the cycle at which the next action/element will be executed. This allows the scripts to completely bypass searching through the list if there is no script to be executed that cycle. This variable is updated every time an action is inserted or deleted.

6.) Whenever the scripts determine that an action/element needs to be executed, it begins at the first element in the list and continues/executes/deletes until it reaches an element with a cycle stamp larger than the current cycle.

It seems to me that the only time a heap would have an advantage is when integer wrapping occurs -- other than that, the execution process is linear right down the list, right?

Inferior Minion · Posted: Thu Nov 17, 2005 7:27 pm Post subject:

Mr B · Joined: 20 Mar 2003 Posts: 382

You're right; it should be sorted by TTL instead of the cycle value. However, if integer wrapping were ignored, the cycle value would be identical to the TTL, right?