This really pains me because I love using foreach. @AngryMoose, on the other hand, will probably rejoice :)
From Gama:
From Gama:
So while it should be ok to foreach, thanks to a bug in Unity's (very ooooold) Mono compiler version it really isn't unless you are compiling (not editing) your code in Visual Studio, and that's a pretty limiting restriction:
Matthew Hanlon pointed my attention to an unfortunate (yet also very interesting) discrepancy between Microsoft's current C# compiler and the older Mono/C# compiler that Unity uses 'under the hood' to compile your scripts on-the-fly. You probably know that you can use Microsoft Visual Studio to develop and even compile Unity/Mono compatible code. You just drop the respective assembly into the 'Assets' folder. All code is then executed in a Unity/Mono runtime environment. However, results can still differ depending on who compiled the code! foreach loops are just such a case, as I've only now figured out. While both compilers recognize whether a collection's GetEnumerator() returns a struct or a class, the Mono/C# has a bug which 'boxes' (see below, on boxing) a struct-enumerator to create a reference type.
So should you avoid foreach loops?
Don't use them in C# code that you allow Unity to compile for you.
Do use them to iterate over the standard generic collections (List<T> etc.) in C# code that you compile yourself with a recent compiler. Visual Studio as well as the free .NET Framework SDK are fine, and I assume (but haven't verified) that the one that comes with the latest versions of Mono and MonoDevelop is fine as well.
Comments
I am interested, however, to see the measured performance penalty, as opposed to the theoretical "it's bad". There are lots of performance myths flying around in the Unity world.
Also, there is some limitations with the AOT compiler (used to compile C# code for iOS) that prevents certain list operations to be executed on generic lists which contains value types (for instance, Contains, Remove, etc. crash at runtime). To overcome this, I rolled out my own implementation of IList<...> (in my case, most of the functions delegate to an underlying list; I just provided alternative implementations for the methods that don't work). This is fairly easy to do, and I'd rather use my own list than forgo foreach, especially if you have an implementation at hand already....
Or you could write an extension method to get the right enumerator:
foreach(var item in list.SafeEnum) { ... } //or is this risky??
EditThe article says LINQ does not work on iOS. This is mostly false. Certain things do not work, but that has to do with the underlying collections rather than LINQ itself. Our grid library use LINQ extensively. (We had to do some work to get certain algorithms and methods working, but again, it was the collections and extremely generic code, not LINQ).
Bottom line, from experience, avoid anything and everything that generates garbage during interactive gameplay sections. Setup time and even pause menu areas are usually fine to go nuts with garbage code if it saves you time. Call a GC.Collect() whenever players wont notice a stutter (ie. when you pause the game, when you change pages in the menu, etc.) to be proactive about garbage build up and minimize future collects.
The biggest thing to avoid is any sort of String work at runtime... even just calling .toString() to get the player's current score every frame can land you in crazy garbage trouble quickly, so watch you!
Unity does have a great suite of profilers for you to make use of, especially WRT garbage generate every frame, so if you make sure of them it's ridiculously easy to track down and address those issues when you need to.
Also, those recent Gamasutra articles on Unity memory management cover a lot of decent practices and are a good staring place if you are having issues with using too much memory or too many garbage collect, but there are tons more tricks than what they cover, so keep your brains open!
I think articles such as this one is useful to understand what's going on and to know where to look when things go bad. However, I am a bit skeptical about following such practices in general, mostly because they don't tell how how bad something is compared to other bad things. (I mean, how many programmers still cache Unity's transforms because it is supposedly slow?)
Doing something extreme to avoid a little garbage (compared to overall garbage) does not seem so useful. (Of course, avoiding foreach is not very extreme). Avoiding something that genertes lots of garbage (compared to overall garbage) is indeed useful.
But I 100% agree with using profilers; that takes care of it!
On the amount of garbage this generates, it's not about lots of garbage, it's that every little bit adds up. The additional allocations from boxing may seem tiny, but the problem isn't their size but that they add one more thing the creaky old mono GC has to check. The more items the GC has to check, the longer the pause when it does run.
So yeah, while I agree you shouldn't do extreme things to avoid GC until you know you have an issue, If there's an easy rule of thumb that will help you avoid garbage generation like "don't use foreach in game-time code", it's worth sticking to it. Yes you could go and generate an alternative implementation, but without testing it explicitly you don't know that you won't have the same problem. And in my experience rolling your own version of something that already in the library but just missing one tiny little feature that you can live without is almost always the wrong choice.
On strings and garbage, something interesting that was pointed out in the article as well, was that string.Format is also a culprit. Because it boxes any value types that get passed to it. So really, the most garbage-efficient way to build up a string is using a StringBuilder and it's typed .Append methods.
Or shouldn't I worry about this until I'm much more pro? :|
http://feedback.unity3d.com/suggestions/update-built-in-mono-compiler-and-runtime-to-a-newer-version
Anyways, @mattbenic, what you say is sane. I did read the article with the data he reports, but what is 12 KB per frame? How much does a game typically generate? For me, that is just a number. For all I know a game could generate GBs of garbage per frame. That is what I mean with the utility with these types of articles. Yes, maybe I know it's probably not GBs... but other may not, and then blindly follow the rules of thumb, which in many games simply don't matter.
But again, I agree with you intellectually, but I think we have slightly different different philosophies. (The idea to not use LINQ makes me want jump into a black hole :P)
Basically a generational GC maintains a nursery pool of objects that is much smaller than the entire working memory of the program. This means most of the frame to frame cruft can be quickly GC'd (without stutter) and the long lived objects are deferred to a full blown GC at a later stage. Whilst I've never used a generational GC I imagine this would make the whole class of problems we are talking about here pretty irrelevant.
Also I think in line with what @hermantulleken is saying, it's always worth asking if something happens per frame or just once. Usually per frame can change any equation drastically. GUI with literal strings declared everywhere equals ALL THE GARBAGE.
http://feedback.unity3d.com/suggestions/scripting-garbage-collection-sg
As such, while I agree it's awesome, I prefer to only use it in setup and load time-basically places where I expect there to be tons of allocation already. But in those places, hell yeah it's amazing to use :)
A question for you @HermanTulleken: If you're such a big LINQ user in grids, have you perhaps profiled it for garbage allocation in one of your 30/30 games? I'd be very curious to see the results of that. Missed that because I didn't search for garbage, only for compiler and runtime. Doh ;) Voted.
Unity does unfortunately include an input allocation you can't get rid of (on PC) and I suspect that's because they're using an input call in .Net that's known to allocate unnecessarily-might even be fixed in Unity 4.
But really, 12k per frame is pretty damned high (as @AngryMoose pointed out, that'll hit a 1 meg limit in under 2 seconds). On The Harvest we were down to zero or single to double digit byte allocations per frame-enough to almost entirely eliminate allocations between menu sessions.
I did a few quick tests: on my machine a garbage collect (taking about 4.2ms) about after 70 seconds, and there after irregularly at about 200 frames or so. Taking out the AStar algorithm did not seem to make a difference, nor did putting in a foreach loop over the grid in Update... (But of course these are not scientific tests). 4.2 ms is not something I would normally be worried about.
Also, I see something creates about 120 meshes every second, and destroys them again after 5 frames... (certainly not something we are doing... maybe the profiler renderer? Does that show up in the profiler?) I would not be surprised if that contributes to garbage more than the program language overhead. I will do some more tests when I have some more time, and let you know what I find.
I did another quick test. I took out everything (including rendering), and just had the foreach and for in Update. The foreach still caused some garbage collects (but much less frequently). For does not at all. As we of course suspect given the article. (It did surprise me however, since I tested it on a grid, not a list, and our grid does have a custom enumerator that does not box values... so something else is going on...) Also, the mesh creation is indeed the profiler (if you show less profilers, it creates fewer meshes... just for interests sake. Also, it seems to have no effect on garbage).
((Just a quick aside before anyone thinks our library is a slow garbage generator, which you may well given what I said earlier. We took great pains to optimise it where it matters. For instance, grid elements can be accessed constant time, no matter how complicated your shape or how many elements you have. We _do_ use LINQ and IEnumerables heavily, but this allows you to write fast code that is also neat. For instance, the lazy computation nature of enumerators allow you to say for instance grid.Any(p => (p - q).Magnitude < 3), which stops computation as soon as the first point is found. Another one is grid.SampleRandom(3), which returns three random points (lazily); something that can be very complicated to do with for loops given arbitrary grid shapes. And we have made points structs so that you can generate thousands without worrying about garbage (so you can say (p + q)*2 every frame, even though we had to give up inheritance to do that, and had to use our own version of List internally so that it can compile on iOS. We _do_ care about performance! Sorry for this derailment!))
Based on what you're describing, I'd love to see a talk from you at a community meet about using linq. I've started using it a bit (in tools and load time code), but it sounds like you've got to know it (and it's performance implications) quite well and could give a good intro to it. It's been a while since we had a more technical talk anyway ;)
Check the answer not so much the actual question ->
http://stackoverflow.com/questions/4044400/linq-performance-faq
For instance, using the laziness of LINQ could indeed provide performance benefits in some cases. But it is difficult. You have to design your algorithms a certain way, and indeed may need to design your whole program a certain way. Furthermore, you'd have to have a quite deep understanding of how the LINQ functions are implemented: is Count O(n)? Is Reverse lazy? In many cases you do not even have access to the knowledge without breaking abstraction, and risking your performance going down the drain if one of the implementations change. As is often the case with performance issues, knowledge is a dangerous thing.
There is also the issue of multiple iteration. The answer gives a filtering example, but what if you need to do a query twice? Is it better to keep the laziness, or forego it so that you only need to iterate once? In general it may be difficult to know. And even if you do know there is again the risk that it will change...
And then there are all the strange indirect costs. I recently did a test on a very tough problem (by game dev standards, taking some 10 seconds to execute in the version I optimised the most heavily). One of my optimisations was to replace LINQ code with normal array and for loops. It made a huge difference, but it turns out a large part of it came from replacing IEnumerators with concrete types, and not LINQ per se. Unfortunately, I could not determine exactly what was going on; it seems that the Unity profiler does not report frames that take over a certain amount of time to execute o.0!
The good news is, in a very large number of games, LINQ code makes up such a small part of the execution time that these concerns are pretty much irrelevant.
My general rule of thumb is this: when your code is so intense that using LINQ makes a difference, it means you are doing very interesting things with your data... Those interesting things are 9/10 times a better place to start looking for optimisation, and very often saves much more than removing LINQ would have. Programming is not about writing the most efficient programs possible, it's about using your time as efficiently as possible to write correct maintainable programs that run fast enough.
As my general rule of thumb I avoid using LINQ in FixedUpdate/Update/LateUpdate methods, use it sparingly in collision-related and other event-like methods (or in Awake and Start methods on objects that are created dynamically), and use it freely in Awake and Start methods that only run when the scene is loaded (but preferably followed by a manual invocation of the garbage collector before the scene starts updating).
Though the performance cost of very complex LINQ queries can potentially impact the framerate at runtime, as part of scene initialization the impact on loading times is highly unlikely to be perceivable by anyone.
So I've just released Smooth.Slinq, a faster-than-Linq-and-allocation-free, Linq / Scala inspired enumeration API for Unity.
Asset store link
Unity forums post
Smooth.Slinq and Smooth.Base documentation (base docs include basic types like Option, Tuple, etc).
Support forums
I would like to see how it looks in actual code though; could you provide some simple examples?
Suppose I wanted to back my entire library with your Slinq; what would be the steps?
Can it be hidden from users of my library that I use Slinq?
(Out of the box, foo.Slinq() extension methods are provided for IList<T>, LinkedList<T>, and Enumerable<T>, including descending ordering and other options where applicable. Smooth.Slinq.Slinqable defines these extensions and also has methods for creating simple numeric sequences, generic sequences generated by a delegate, empty Slinqs, repeating Slinqs, etc.)
So something like:
Note that one of the ways that real-world LINQ usage leads to allocations is due to closures. To avoid closures Slinq provides method signatures that take an extra, user-supplied parameter that is passed to any delegates and can be used to capture state:
A Slinq is equivalent to an IEnumerator, not an IEnumerable, so manually looped Slinq usage would look like:
Of course, Slinq has ForEach() methods, so those are also one-liners. "Loop syntax" is only needed for times when you want to consume part of a list with a particular operation and leave the rest for other operation(s).
Oh, I guess I should have linked the following doc for people who are unfamiliar with Option semantics (slinq.current is an Option<T>, as are the return values from potentially empty operations like First, Last, Max, etc. An Option<T> is a value type that can be thought of as a far more robust version of Nullable<T> and has two fields, public readonly bool isSome, and public readonly T value, as well as various chainable, LINQ-like methods.):
https://www.smooth-games.com/docs/smooth-base/
Documentation for Option<T>, Tuple<T1, ..., Tn>, etc are there.
Your Asset Store page has a couple of screenshots of the profiler, and makes claims about performance relative to Linq. Do you perhaps have an article/blogpost/whatever that discusses theses in more detail? Your URL just goes straight to your product forums.
And on the topic of your website and product forums, you may want to have a chat to @hermantulleken about their experiences launching Grids, and what did and didn't work from a marketing perspective. Time for a Humble Unity Plugin Bundle, methinks? ;)
(The reason I ask is that I'm actually solving similar problems (but for different reasons). For instance, the AOT compiler won't generate all the code necessary to do method calls like Contains, Remove, etc. on Lists and IEnumerables that contain value types. So I too have to roll our alternatives, and I have not figured out the right "place" to solve the problem. It would be interesting to hear how you approached it.)
I haven't actually looked at the LINQ code, but by simple profiling, "reasonable" assumptions, and various things I've read, I figure that for constant space operations, LINQ and Slinq have a near-identical amount of overhead, with LINQ using virtual methods while Slinq uses delegate invokes. The minor performance gains likely come from the fact that Slinqs are inherently stack-based and thus do not require iterator allocations and have less dereferencing which may lead to slightly faster IL. Also, the "bottom layer" Slinq will generally know how to manually enumerate its underlying collection, which saves a method call on each enumeration step.
But these are tiny differences. I think it's fair to say that LINQ and Slinq are equivalent speed-wise for constant space operations, with Slinq having a small advantage in simple benchmarks due to micro-optimizations. But the difference in raw speed is likely insignificant in real world scenarios, especially when compared to the cost of temporary allocations / deferred GC.
For linear space operations, well, LINQ has to allocate a lot of space! And AFAIK, it uses List<T>s, which will need various numbers of resizes depending on the size of the data, as well as the data composition for grouping operations.
Slinq is not currently optimized for raw speed, it uses singly linked lists for grouping and sorting in order to use a predictable amount of space that is directly linear to the size of the data and to facilitate "zero-copy" operations on pooled values. So Slinq does more method calls (and locking) to build it's storage, but once the pools are warm these calls are non-allocating.
Slinqs are also enumerators, not enumerables, which potentially allows for various optimizations since you can only enumerate once. It's also possible that some parts of Unity's version of LINQ are simply implemented badly. I mean, would that be surprising in any way? ;)
At any rate, I don't know exactly where all the speed gains come from for linear space operations, and some of it depends on the data, but they are there, and they are significant.
AOT is a bitch. ;)
I have another asset called Smooth.Compare that I use to eliminate comparison allocations and prevent JIT exceptions. Basically it's a replacement for System.Collections.Comparer/EqualityComparer<T>.Default that lets you register comparers or automatically creates them for various types using emit if JIT is available.
All my code that performs generic comparisons use the "Smooth.Collections" comparers which are explicitly registered for all known-to-be-compared value types when running in an AOT-only context. For operations like Contains on a List<T> you unfortunately have to supply a comparer on every method call, but you could potentially do this with an extension method.
And the reason I don't have implicit conversions to Slinq is that a Slinq enumerator of element type T is actually a Slinq<T, C> where C is a context type. And that C can't be filled in implicitly with the features available in C#.
Edit: Clarification on JITException avoidance.
My stuff on the Asset Store comes as code, and I have no plans to have obfuscated DLLs or anything else. I'm a coder and understand how important having the code is. I absolutely HATE using things that come in DLLs and being unable to modify / fix things or even see how they work when the documentation doesn't give enough detail (see: Unity, uLink).
I'm also currently working on packaging my "Foundation" classes in a free to use, free to redistribute way. These would have hooks for Smooth.Compare and Slinq for people who have them, otherwise it would fallback to LINQ and/or the default comparers (or manually defined ones, which would give users a reasonably clean way to avoid JITExceptions).
My experience with the Asset Store so far tells me it would be at least a week before such a thing were posted even if I submitted today (and I need to do a bunch of documenting, grr!)... But if you want to check it out I can get you the code directly.
Edit: Actually, here is the code, some of it has been recently written and/or refactored, so let me know if you notice anything strange:
https://www.smooth-games.com/unity/foundations/smooth-foundations.unitypackage
Changes include a Slinq API that is more drop-in compatible with LINQ, and "hooks" in Foundations that allow compiling against Slinq when available with a fallback to LINQ. Foundations also includes some useful embellishments to the LINQ API that work on various IEnumerable<T>s to help accomplish this.
Foundations can use Compare or fallback to manually set comparers. This gives library writers a way to manually circumvent JIT exceptions, potentially using Compare to quickly and easily diagnose potential problems during debugging but distributing without it.
I've also substantially reduced the prices of Slinq and Compare... it seems what would be considered a ludicrously cheap productivity / performance boost in the real world translates into "OMG too expensive, I'll just keep doing things the hard way and/or spiking FPS" in the zero-income-zero-users-to-please-and-zero-value-on-time world of Unity development. :D
Anyway, check it out if you get the chance and let me know if you find it helpful!
I've been looking through the docs for Foundations, and I see you've re-implemented a lot of standard classes (such as Tuple) that I wouldn't have expected. Why is this? Why would one use these instead of the .Net library equivalents?
And the Comparer stuff is there to prevent allocations and JIT exceptions. Actually Smooth.Collections.Comparer<T> and EqualityComparer<T> aren't currently in the docs because I forgot to change a #define when building the html. I've been meaning to rebuild the docs but that would imply resubmitting the assets and I've already been waiting for a while for approval and don't want to back to the end of the queue and wait another 2 weeks. 8(
Edit: Updated the online docs and unitypackage for Foundations. The new docs also include the Slinq fallback classes. And now that I look at the docs with fresh eyes I realize there are still some things missing, which I will probably update later tonight.
About selling to hobyist and and selling time to developers, I can give you what we experienced (obviously our tools differ quite a bit, and this is just one data point, so take it with the required amount of salt).
We had a similar experience with selling our Grids package. It seemed so obvious that it will save developers tonnes of time (and in your case, the problems your libraries solve are even more fundamental). The problem comes in with making them believe it. When you sell a bomb explosion, anyone can look at a video, see immediately it's what they want, and buy it. When it comes to abstract tools, even professionals can't make a judgement so quickly. You have to wade through code and docs, and even then it's a bit iffy whether it will actually work. And a bomb explosion is easily replaced. But a library that sits at the core of your games is a way bigger risk - what if you die? I think the trick to realise is that you are in fact not selling time - what you are really selling is trust.
Our main selling point is also "time" on paper - but people did not really believe it. So we made 30 games in a month to show people that it really is true. It's been our biggest driver of sales since. (It also allowed us to get some coverage, make a video that is watched way more often than our official "marketing" video, do some write-ups on our experience; it gave us a tonne of extra examples which users ask for all the time, etc.)
Maybe what you can do is do a massive (make it remarkable, not just a few graphs) benchmark project. Design a whole bunch of compelling tests that shows performance (I bundle memory and GC and all that with this) to show how well your software does compared to the built-ins. (I would take each of the LINQ operators in turn, figure out a way to show how the GC can kill you using the built-in version vs. the Smooth version and show it!) Or if you wanna sell LINQ, have a productivity competition between a the two methods ran over a time period.
It's also surprising how much effort we spend on "marketing" compared to extending and maintaining the library. (We bunch making examples for our web site, doing support, and such all part of marketing. It's a rather pleasant form of marketing - we make lots of cool stuff. It's certainly not making ads (which we tried, and found not to work)).
There are also other smaller things you can do to get more trust in you as professional, and your tool-house as a company. Things such as blogging about LINQ and other related issues problems; using a physical address, real name and personal email on the site; be around in other places (maybe you are already)... these can all help to make people believe you know what you are doing, see where you are coming from, etc. and give them something tangible to give them the feeling you will stick around.
Well these are just some random thoughts.
I wrote Slinq and Compare because they were incredibly useful for me, and the fact they they were stimulating technical problems added to the motivation! But given the sales figures it doesn't seem worthwhile to continue trying to sell them. Even if things ramped up quite a bit, I'd be much better off getting a job in the real world, solving interesting problems all day long, and working on my game as a hobby. I'd also have a lot more money to pay for art if I wanted to. :)
Plus, I'm working on refactoring my input layer, which isn't just for bindings, but is a clean, extensible, OO system for both managing user level inputs and converting them into the format you need / want for your game, and it would be *so* much cleaner and easier if I could rely on Slinq and Compare being available. Now, this asset would have straightforward and obvious "this does X, and you know you need X" value, so I figure it wouldn't be as reliant on marketing. Getting it out there and in use could spur sales of the other assets, but the pain of writing it without them is just... well, painful...
So I've decided to release Slinq and Compare under the MIT license. The mindshare and branding effect of giving the pure productivity and technical-problem-solving assets away for free will hopefully (!) provide more value for me than the potential sales.
Direct download links to the assets and documentation are available on the Smooth-Games forums.
http://blogs.unity3d.com/2014/05/20/the-future-of-scripting-in-unity/