Stuff Gil Says: A Pauseless HashMap

Saturday, March 29, 2014

A Pauseless HashMap

HashMaps are great. And fast. Well, fast most of the time. If you keep growing them, you'll get elevator music every once in a while. Then they go fast again. For a while.

Wouldn't it be nice if HashMaps didn't stall your code even when they were resizing?

Some background: As those of you who have read my various past rants may have noticed, I spend a lot of my time thinking about the behavior of {latency, response-time, reaction-time}. In addition to trying to better understand or teach about the behavior (with monitoring and measurement tools like HdrHistogram, LatencyUtils, and jHiccup), I actually work on things that try to improve bad behavior. For some definitions of "improve" and "bad". Eliminating pausing behavior in GC was the lowest hanging fruit, but more recent work has focused on eliminating pauses due to other things that stand out once those pesky GC blips are gone. Things like at-market-open deoptimzations. Things like lock deflation, lock de-biasing, class unloading, weak reference processing, and all sorts of TTSP (time to safepoint) issues. I've also learned a lot about how to bring down Linux's contribution to latency spikes.

But the JVM and the OS are not the only things that cause latency spikes. Sometimes it's your code, and the code is doing something "spiky". In my day job, I keep running into in actual, real-world low latency system code that is typically super-fast, but occasionally spikes in actual work latency due to some rare but huge piece of work that needs to be done. This is most often associated with some state accumulation. Once we eliminate GC pauses (which tend to dominate latency spikes, but also tend to simply disappear when Zing is applied), we get to see the things that were hiding in the GC noise. We often run into "nice" patterns of growing latency spikes at growing intervals, with a near-perfect doubling in both magnitude and interval between the spikes. This happens so often that we've studied the common causes, and (by far) the most common culprits seem to be HashMaps. The kind used to accumulate something during the day, and which resize in powers-of-two steps as a result.

I've had "build a Pauseless HashMap" on my weekend project list for over a year now, but finally got around to actually building it (at the request of a friend on the mechanical sympathy group). There are probably at least 17 ways to skin a HashMap so it won't stall puts and gets when it resizes, but this is my simple take on it:

https://github.com/giltene/PauselessHashMap

Keep in mind that (so far) this is a "probably-working draft" that's gone through some bench testing, but is not yet battle hardened (scrutiny is welcome).

I intentionally based this version on Apache Harmony's version of HashMap, and not on OpenJDK's, in order to make it available without GPLv2 license restrictions (for those who may want to include it in non-GPL products). The background resizing concept itself is simple, and can be applied just as easily to the OpenJDK version (e.g. if some future Java SE version wants to use it). You can use (https://svn.apache.org/repos/asf/harmony/enhanced/java/trunk/classlib/modules/luni/src/main/java/java/util/HashMap.java) as a baseline comparison for the code I started with.

This is also a classic example of how GC makes this sort of concurrent programming thing both fast and simple. This is a classic case of an asymmetric speed need between two concurrent actors that share mutable state. I worked hard to make the fast path get() and put() cheap, and managed (I think) to not even use volatiles in the fast path. In doing this, I shifted all the cost I could think of to the background work, where latency doesn't matter nearly as much. This sort of trick would be much harder (or slower) to do if GC wasn't there to safely pick up the junk behind us, as it would (invariably, I think) add a need for additional synchronizing work in the fast path.

37 comments:

Rüdiger MöllerMarch 30, 2014 at 4:37 AM
Hi Gil,
usually one tries to avoid resizes by statically "guessing" the "right" size :-). Async resizing can be of particular interest when having very large tables >5 million entries. Many implementations out there show up resizing pauses up to several seconds in these cases.
Would be interesting what cost (if any) there exists compared to the original "blocking-resize" version (for the non-resizing case). Did you do any comparision ?
ReplyDelete
Replies
IanApril 15, 2016 at 3:57 AM
This comment has been removed by the author.
ReplyDelete
Replies
IanApril 15, 2016 at 3:58 AM
Why did not use one of the many published extendible hashing (or dynamic hashing) systems, I recalled them from my Comp Sci degree 20 years ago, but yet have not seen them used in “modem” systems.
http://arl.wustl.edu/~sailesh/download_files/Limited_Edition/hash/Dynamic%20Hash%20Tables.pdf is one of many papers that can be used as the start of a research search.
ReplyDelete
Replies
Gil TeneApril 15, 2016 at 8:13 AM
Because I wanted lookup times similar to HashTable. The various extensible schemes come with significantly higher get() costs.
ReplyDelete
Replies
AnonymousDecember 6, 2018 at 5:58 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Rithi RawatDecember 21, 2018 at 10:53 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownDecember 22, 2018 at 3:41 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Robotic Process Automation TutorialDecember 22, 2018 at 4:44 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
prasathDecember 22, 2018 at 9:19 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Diya shreeDecember 30, 2018 at 4:45 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Mamta PrasadFebruary 13, 2019 at 1:38 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
IT TutorialsAugust 12, 2019 at 3:48 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
joseSeptember 19, 2019 at 12:06 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
BenishSeptember 23, 2019 at 3:47 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Jack sparrowJune 29, 2020 at 3:52 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
harinijegan80December 9, 2020 at 2:04 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
HuongkvJanuary 17, 2021 at 10:38 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Bensu SoralJune 25, 2021 at 10:34 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownJuly 21, 2021 at 6:52 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
gipaş tekstilAugust 10, 2021 at 8:24 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownAugust 14, 2021 at 11:14 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Mike JohnsonSeptember 30, 2021 at 10:39 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Mike JohnsonSeptember 30, 2021 at 10:43 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Selena3January 15, 2022 at 6:45 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
نهالستان ایرانJanuary 27, 2022 at 1:30 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownJanuary 31, 2022 at 7:26 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Peter JohnsonFebruary 2, 2022 at 9:11 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
Back linkerMay 10, 2022 at 7:32 AM
Bandicam crack is a lightweight screen recorder for Windows that can catch anything on your PC screen as a great video.I has been moreover deferential. exact play-success! Bandicam Cracked
ReplyDelete
Replies
Back linkerMay 10, 2022 at 8:48 AM
Took me technology to entre all of the observations, however I absolutely loved the article. It proved to be Very cordial to me and i am unconditional to all the commenters here! Its constantly clean whilst you can't on your own be informed, however with entertained! Avast Driver Updater Activation Code
ReplyDelete
Replies
warezsoftsJuly 11, 2022 at 3:17 AM
This is great site and its explanation was very nice.
warez reddit
ReplyDelete
Replies
JohnJuly 24, 2025 at 7:16 AM
Why did not use one of the many published extendible hashing (or dynamic hashing) systems, I recalled them from my Comp Sci degree 20 years ago, but yet have not seen them used in “modem” systems.
persian music
ReplyDelete
Replies
المثالي كلينNovember 20, 2025 at 11:00 PM
اعمال مميزة تابع اعمالنا في معالجة مشكلة الحمام والنظافة للمنازل من الداخل ك، الغرف حيص يتمثل الحل الأمثل في التعاون مع مكافحة الحمام الرياض لتنفيز تركيب طارد الحمام كإجراء وقائي فعال، بالإضافة إلى أهمية الصيانة والنظافة المستمرة كما أن استخدام الحلول الطبيعيhة الفعالة في التخلص من الروائح، مثل التعطر والتنظيف الشامل شارك خدمة رش حشرات بالقطيف وستفيد من وجدنا بالمنطقة الشرقية .
ReplyDelete
Replies
مجموعة ولاد االبلدNovember 22, 2025 at 3:00 PM
كل الشكر اعمال مميزو زورو اعمال مؤسسة البلد وتعرف علي خدماتنا تسليك مجاري في عجمان الامارات فـ تعد مشكلة انسداد المجاري أحد أكثر المشكلات شيوعًا في المنازل، وقد تؤدي إلى روائح كريهة ومشكلات صحية. هنا تأتي أهمية خدمة تسليك المجاري التي تقدمها "ولاد البلد" تعتمد الشركة على تقنيات حديثة وأدوات متخصصة للتعامل مع هذه المشكلة.
ReplyDelete
Replies
HarishApril 7, 2026 at 5:47 AM
Great article with a clean and easy-to-follow explanation. This kind of content really helps learners grow. You can also explore AI powered Core JAVA Online Training in ameerpet.
ReplyDelete
Replies
Linda L YanceyApril 30, 2026 at 11:48 PM
I want to help people figure out what they have and what helped me to completely eradicate the Herpes virus infection.
I had unprotected sex and a few days later felt sick thinking I was just hungover. After spending a day at the beach I felt like I had severe sun exhaustion. For the next three days, I had a bad fever off and on. I noticed what I thought was a canker sore inside my lower lip. I started to suspect I had herpes.
I searched online and called different clinics and two days later I was in the midst of a full blown outbreak! Sores all over my labia and anus and mouth. I became very, very depressed and was even thinking of suicide.
I started to try everything I could find online. None of these remedies had any visible results.
Finally, I found a naturopathic doctor, who honestly told me I should contact Dr. Utu for my condition and recommended the herpes herbal cleanser.
I ordered the Herpes Herbal Cleanser through a DHL Express courier and started treatment
Within two days I started seeing good results with only one sore left on my genitals and my mouth was all healed.
I completed the herbal treatment under Dr Utu guidance and he asked me to go do a blood test which I curiously did
I got the results of my blood test and can't believe that I tested negative for HSV2 and negative for HSV1
People still cure herpes infection with herbal medicine and even other more deadly diseases with lifestyle changes and herbal medicine
While there are bad doctors and bad physicians in clinics and online, there are also the good ones and Dr Utu is one of the best.
Dr Utu can only be reached at
drutuherbalcure@gmail.com

ReplyDelete
Replies

Add comment