本文深刻分析並驗證了不一樣Java對象佔用內存空間大小的狀況。對於不一樣的jvm實現,Java對象佔用的內存空間大小可能不盡相同,本文主要分析HotSpot jvm中的狀況,實驗環境爲64位window10系統、JDK1.8,使用JProfiler進行結論驗證。html
Java對象的內存佈局包括:對象頭(Header),實例數據(Instance Data)和補齊填充(Padding)。java
關於對象頭的詳細介紹能夠參看個人博文:http://blog.csdn.net/codersha...,這裏只關注其內存佔用大小。在64位機器上,默認不開啓指針壓縮(-XX:-UseCompressedOops)的狀況下,對象頭佔用12bytes,開啓指針壓縮(-XX:+UseCompressedOops)則佔用16bytes。node
原生類型(primitive type)的內存佔用以下:程序員
Primitive Type | Memory Required(bytes) |
---|---|
byte, boolean | 1 byte |
short, char | 2 bytes |
int, float | 4 bytes |
long, double | 8 bytes |
對象引用(reference)類型在64位機器上,關閉指針壓縮時佔用8bytes, 開啓時佔用4bytes。編程
Java對象佔用空間是8字節對齊的,即全部Java對象佔用bytes數必須是8的倍數。例如,一個包含兩個屬性的對象:int和byte,並非佔用17bytes(12+4+1),而是佔用24bytes(對17bytes進行8字節對齊)bootstrap
首先根據以上的計算規則,進行一個簡單的驗證。使用下面的程序進行驗證:數組
public class Test { public static void main(String[] args) throws InterruptedException { TestObject testObject = new TestObject(); Thread.sleep(600 * 1000); System.out.println(testObject); } } class TestObject { private int i; private double d; private char[] c; public TestObject() { this.i = 1; this.d = 1.0; this.c = new char[]{'a', 'b', 'c'}; } }
TestObject對象有四個屬性,分別爲int, double, Byte, char[]類型。在打開指針壓縮(-XX:+UseCompressedOops)的狀況下,在64位機器上,TestObject佔用的內存大小應爲:數據結構
12(Header) + 4(int) + 8(double) + 4(reference) = 28 (bytes),加上8字節對齊,最終的大小應爲32 bytes。app
JProfiler中的結果爲:jvm
能夠看到,TestObject佔用的內存空間大小(Shallow Size)爲32 bytes。
關於Retained Size和Shallow Size的區別,能夠參看:
http://blog.csdn.net/e5945/ar...
當指針壓縮關閉時(-XX:-UseCompressedOops),在64位機器上,TestObject佔用的內存大小應爲:
16(Header) + 4(int) + 8(double) + 8(reference) = 36 (bytes), 8字節對齊後爲 40 bytes。
JProfile的結果爲:
包裝類(Boolean/Byte/Short/Character/Integer/Long/Double/Float)佔用內存的大小等於對象頭大小加上底層基礎數據類型的大小。
包裝類型的對象內存佔用狀況以下:
Numberic Wrappers | +useCompressedOops | -useCompressedOops |
---|---|---|
Byte, Boolean | 16 bytes | 24 bytes |
Short, Character | 16 bytes | 24 bytes |
Integer, Float | 16 bytes | 24 bytes |
Long, Double | 24 bytes | 24 bytes |
64位機器上,數組對象的對象頭佔用24 bytes,啓用壓縮後佔用16字節。比普通對象佔用內存可能是由於須要額外的空間存儲數組的長度。基礎數據類型數組佔用的空間包括數組對象頭以及基礎數據類型數據佔用的內存空間。因爲對象數組中存放的是對象的引用,因此對象數組自己的大小=數組對象頭+length * 引用指針大小,總大小爲對象數組自己大小+存放的數據的大小之和。
舉兩個例子:
關閉壓縮:24 + 10 * 4 = 64bytes。
關閉壓縮: Integer數組自己:24(header) + 3 * 8(Integer reference) = 48 bytes; 總共:48 + 3 * 24(Integer) = 120 bytes。
開啓壓縮: Integer數組自己:16(header) + 3 * 4(Integer reference) = 28(padding) -> 32 (bytes) 總共:32 + 3 * 16(Integer) = 80 (bytes)
在JDK1.7及以上版本中,String包含2個屬性,一個用於存放字符串數據的char[], 一個int類型的hashcode, 部分源代碼以下:
public final class String implements java.io.Serializable, Comparable<String>, CharSequence { /** The value is used for character storage. */ private final char value[]; /** Cache the hash code for the string */ private int hash; // Default to 0 }
所以,在關閉指針壓縮時,一個String自己須要 16(Header) + 8(char[] reference) + 4(int) = 32 bytes。
除此以外,一個char[]佔用24 + length * 2 bytes(8字節對齊), 即一個String佔用的內存空間大小爲:
56 + length * 2 bytes (8字節對齊)。
舉幾個例子。
String自己:12(Header) + 4(char[] reference) + 4(int hash) = 20(padding) -> 24 (bytes); 存儲數據:16(char[] header) + 5*2 = 26(padding) -> 32 (bytes) 總共:24 + 32 = 56 (bytes)
根據上面的內存佔用計算規則,能夠計算出一個對象在內存中的佔用空間大小狀況,下面舉例分析下Java中的Enum, ArrayList及HashMap的內存佔用狀況,讀者能夠仿照分析計算過程來計算其餘數據結構的內存佔用狀況。
注: 下面的分析計算基於HotSpot Jvm, JDK1.8, 64位機器,開啓指針壓縮。
建立enum時,編譯器會生成一個相關的類,這個類繼承自java.lang.Enum。Enum類擁有兩個屬性變量,分別爲int的ordinal和String的name, 相關源碼以下:
public abstract class Enum<E extends Enum<E>> implements Comparable<E>, Serializable { /** * The name of this enum constant, as declared in the enum declaration. * Most programmers should use the {@link #toString} method rather than * accessing this field. */ private final String name; /** * The ordinal of this enumeration constant (its position * in the enum declaration, where the initial constant is assigned * an ordinal of zero). * * Most programmers will have no use for this field. It is designed * for use by sophisticated enum-based data structures, such as * {@link java.util.EnumSet} and {@link java.util.EnumMap}. */ private final int ordinal; }
如下面的TestEnum爲例進行枚舉類的內存佔用分析
public enum TestEnum { ONE(1, "one"), TWO(2, "two"); private int code; private String desc; TestEnum(int code, String desc) { this.code = code; this.desc = desc; } public int getCode() { return code; } public String getDesc() { return desc; } }
這裏TestEnum的每一個實例除了父類的兩個屬性外,還擁有一個int的code及String的desc屬性,因此一個TestEnum的實例自己所佔用的內存大小爲:
12(header) + 4(ordinal) + 4(name reference) + 4(code) + 4(desc reference) = 28(padding) -> 32 bytes.
總共佔用的內存大小爲:
按照上面對字符串類型的分析,desc和name都佔用:48 bytes。
因此TestEnum.ONE佔用總內存大小爲:
12(header) + 4(ordinal) + 4(code) + 48 * 2(desc, name) + 4(desc reference) + 4(name reference) = 128 (bytes)
JProfiler中的結果能夠驗證上述分析:
在分析ArrayList的內存以前,有必須先了解下ArrayList的實現原理。
ArrayList實現List接口,底層使用數組保存全部元素。其操做基本上是對數組的操做。下面分析下源代碼:
/** * The array buffer into which the elements of the ArrayList are stored. * The capacity of the ArrayList is the length of this array buffer. Any * empty ArrayList with elementData == DEFAULTCAPACITY_EMPTY_ELEMENTDATA * will be expanded to DEFAULT_CAPACITY when the first element is added. */ transient Object[] elementData; // non-private to simplify nested class access
ArrayList提供了三種方式的構造器,能夠構造一個默認的空列表、構造一個指定初始容量的空列表及構造一個包含指定collection元素的列表,這些元素按照該collection的迭代器返回它們的順序排列。
/** * Shared empty array instance used for default sized empty instances. We * distinguish this from EMPTY_ELEMENTDATA to know how much to inflate when * first element is added. */ private static final Object[] DEFAULTCAPACITY_EMPTY_ELEMENTDATA = {}; /** * Constructs an empty list with the specified initial capacity. * * @ initialCapacity the initial capacity of the list * @throws IllegalArgumentException if the specified initial capacity * is negative */ public ArrayList(int initialCapacity) { if (initialCapacity > 0) { this.elementData = new Object[initialCapacity]; } else if (initialCapacity == 0) { this.elementData = EMPTY_ELEMENTDATA; } else { throw new IllegalArgumentException("Illegal Capacity: "+ initialCapacity); } } /** * Constructs an empty list with an initial capacity of ten. */ public ArrayList() { this.elementData = DEFAULTCAPACITY_EMPTY_ELEMENTDATA; } /** * Constructs a list containing the elements of the specified * collection, in the order they are returned by the collection's * iterator. * * @ c the collection whose elements are to be placed into this list * @throws NullPointerException if the specified collection is null */ public ArrayList(Collection<? extends E> c) { elementData = c.toArray(); if ((size = elementData.length) != 0) { // c.toArray might (incorrectly) not return Object[] (see 6260652) if (elementData.getClass() != Object[].class) elementData = Arrays.copyOf(elementData, size, Object[].class); } else { // replace with empty array. this.elementData = EMPTY_ELEMENTDATA; } }
ArrayList提供了set(int index, E element)、add(E e)、add(int index, E element)、addAll(Collection<? extends E> c)等,這裏着重介紹一下add(E e)方法。
/** * Appends the specified element to the end of this list. * * @ e element to be appended to this list * @return <tt>true</tt> (as specified by {@link Collection#add}) */ public boolean add(E e) { ensureCapacityInternal(size + 1); // Increments modCount!! elementData[size++] = e; return true; }
add方法將指定的元素添加到此列表的尾部。這裏注意下ensureCapacityInternal方法,這個方法會檢查添加後元素的個數是否會超過當前數組的長度,若是超出,數組將會進行擴容。
/** * Default initial capacity. */ private static final int DEFAULT_CAPACITY = 10; /** * Increases the capacity of this <tt>ArrayList</tt> instance, if * necessary, to ensure that it can hold at least the number of elements * specified by the minimum capacity argument. * * @ minCapacity the desired minimum capacity */ public void ensureCapacity(int minCapacity) { int minExpand = (elementData != DEFAULTCAPACITY_EMPTY_ELEMENTDATA) // any size if not default element table ? 0 // larger than default for default empty table. It's already // supposed to be at default size. : DEFAULT_CAPACITY; if (minCapacity > minExpand) { ensureExplicitCapacity(minCapacity); } } private void ensureCapacityInternal(int minCapacity) { if (elementData == DEFAULTCAPACITY_EMPTY_ELEMENTDATA) { minCapacity = Math.max(DEFAULT_CAPACITY, minCapacity); } ensureExplicitCapacity(minCapacity); } private void ensureExplicitCapacity(int minCapacity) { modCount++; // overflow-conscious code if (minCapacity - elementData.length > 0) grow(minCapacity); }
若是初始時沒有指定ArrayList大小,在第一次調用add方法時,會初始化數組默認最小容量爲10。看下grow方法的源碼:
/** * Increases the capacity to ensure that it can hold at least the * number of elements specified by the minimum capacity argument. * * @ minCapacity the desired minimum capacity */ private void grow(int minCapacity) { // overflow-conscious code int oldCapacity = elementData.length; int newCapacity = oldCapacity + (oldCapacity >> 1); if (newCapacity - minCapacity < 0) newCapacity = minCapacity; if (newCapacity - MAX_ARRAY_SIZE > 0) newCapacity = hugeCapacity(minCapacity); // minCapacity is usually close to size, so this is a win: elementData = Arrays.copyOf(elementData, newCapacity); }
從上述代碼能夠看出,數組進行擴容時,會將老數組中的元素從新拷貝一份到新的數組中,每次數組擴容的增加是原容量的1.5倍。這種操做的代價是很高的,所以在實際使用時,應該儘可能避免數組容量的擴張。當可預知要保存的元素的數量時,要在構造ArrayList實例時,就指定其容量,以免數組擴容的發生。或者根據實際需求,經過調用ensureCapacity方法來手動增長ArrayList實例的容量。
ArrayList其餘操做讀取刪除等原理這裏不做介紹了。
下面開始分析ArrayList的內存佔用狀況。ArrayList繼承AbstractList類,AbstractList擁有一個int類型的modCount屬性,ArrayList自己擁有一個int類型的size屬性和一個數組屬性。
因此一個ArrayList實例自己的的大小爲:
12(header) + 4(modCount) + 4(size) + 4(elementData reference) = 24 (bytes)
下面分析一個只有一個Integer(1)元素的ArrayList<Integer>實例佔用的內存大小。
ArrayList<Integer> testList = Lists.newArrayList(); testList.add(1);
根據上面對ArrayList原理的介紹,當調用add方法時,ArrayList會初始化一個默認大小爲10的數組,而數組中保存的Integer(1)實例大小爲16 bytes。
則testList佔用的內存大小爲:
24(ArrayList itselft) + 16(elementData array header) + 10 * 4(elemetData reference) + 16(Integer) = 96 (bytes)
JProfiler中的結果驗證了上述分析:
要分析HashMap的內存佔用,一樣須要先了解HashMap的實現原理。
HashMap是一個「鏈表散列」的數據結構,即數組和鏈表的結合體。
從圖上能夠看出,HashMap底層是一個數組結構,數組中的每一項又是一個鏈表。當新建一個HashMap的時候,初始化一個數組,源碼以下:
/** * The table, initialized on first use, and resized as * necessary. When allocated, length is always a power of two. * (We also tolerate length zero in some operations to allow * bootstrapping mechanics that are currently not needed.) */ transient Node<K,V>[] table;
Node是鏈表中一個結點,一個Node對象保存了一對HashMap的Key,Value以及指向下一個節點的指針,源碼以下:
/** * Basic hash bin node, used for most entries. (See below for * TreeNode subclass, and in LinkedHashMap for its Entry subclass.) */ static class Node<K,V> implements Map.Entry<K,V> { final int hash; final K key; V value; Node<K,V> next; Node(int hash, K key, V value, Node<K,V> next) { this.hash = hash; this.key = key; this.value = value; this.next = next; } }
HashMap提供了四種方式的構造器,分別爲指定初始容量及負載因子構造器,指定初始容量構造器,不指定初始容量及負載因子構造器,以及根據已有Map生成新Map的構造器。
/** * Constructs an empty <tt>HashMap</tt> with the specified initial * capacity and load factor. * * @ initialCapacity the initial capacity * @ loadFactor the load factor * @throws IllegalArgumentException if the initial capacity is negative * or the load factor is nonpositive */ public HashMap(int initialCapacity, float loadFactor) { if (initialCapacity < 0) throw new IllegalArgumentException("Illegal initial capacity: " + initialCapacity); if (initialCapacity > MAXIMUM_CAPACITY) initialCapacity = MAXIMUM_CAPACITY; if (loadFactor <= 0 || Float.isNaN(loadFactor)) throw new IllegalArgumentException("Illegal load factor: " + loadFactor); this.loadFactor = loadFactor; this.threshold = tableSizeFor(initialCapacity); } /** * Constructs an empty <tt>HashMap</tt> with the specified initial * capacity and the default load factor (0.75). * * @ initialCapacity the initial capacity. * @throws IllegalArgumentException if the initial capacity is negative. */ public HashMap(int initialCapacity) { this(initialCapacity, DEFAULT_LOAD_FACTOR); } /** * Constructs an empty <tt>HashMap</tt> with the default initial capacity * (16) and the default load factor (0.75). */ public HashMap() { this.loadFactor = DEFAULT_LOAD_FACTOR; // all other fields defaulted } /** * Constructs a new <tt>HashMap</tt> with the same mappings as the * specified <tt>Map</tt>. The <tt>HashMap</tt> is created with * default load factor (0.75) and an initial capacity sufficient to * hold the mappings in the specified <tt>Map</tt>. * * @ m the map whose mappings are to be placed in this map * @throws NullPointerException if the specified map is null */ public HashMap(Map<? extends K, ? extends V> m) { this.loadFactor = DEFAULT_LOAD_FACTOR; putMapEntries(m, false); }
若是不指定初始容量及負載因子,默認的初始容量爲16, 負載因子爲0.75。
負載因子衡量的是一個散列表的空間的使用程度,負載因子越大表示散列表的裝填程度越高,反之愈小。對於使用鏈表法的散列表來講,查找一個元素的平均時間是O(1+a),所以若是負載因子越大,對空間的利用更充分,然然後果是查找效率的下降;若是負載因子過小,那麼散列表的數據將過於稀疏,對空間形成嚴重浪費。
HashMap有一個容量閾值屬性threshold,是根據初始容量和負載因子計算得出threshold=capacity*loadfactor, 若是HashMap中數組元素的個數超過這個閾值,則HashMap會進行擴容。HashMap底層的數組長度老是2的n次方,每次擴容容量爲原來的2倍。
擴容的目的是爲了減小hash衝突,提升查詢效率。而在HashMap數組擴容以後,最消耗性能的點就出現了:原數組中的數據必須從新計算其在新數組中的位置,並放進去,這就是resize。
public V put(K key, V value) { return putVal(hash(key), key, value, false, true); } /** * Implements Map.put and related methods * * @ hash hash for key * @ key the key * @ value the value to put * @ onlyIfAbsent if true, don't change existing value * @ evict if false, the table is in creation mode. * @return previous value, or null if none */ final V putVal(int hash, K key, V value, boolean onlyIfAbsent, boolean evict) { Node<K,V>[] tab; Node<K,V> p; int n, i; //初始化數組的大小爲16,容量閾值爲16*0.75=12 if ((tab = table) == null || (n = tab.length) == 0) n = (tab = resize()).length; //若是key的hash值對應的數組位置沒有元素,則新建Node放入此位置 if ((p = tab[i = (n - 1) & hash]) == null) tab[i] = newNode(hash, key, value, null); else { Node<K,V> e; K k; if (p.hash == hash && ((k = p.key) == key || (key != null && key.equals(k)))) e = p; else if (p instanceof TreeNode) e = ((TreeNode<K,V>)p).putTreeVal(this, tab, hash, key, value); else { for (int binCount = 0; ; ++binCount) { if ((e = p.next) == null) { p.next = newNode(hash, key, value, null); if (binCount >= TREEIFY_THRESHOLD - 1) // -1 for 1st treeifyBin(tab, hash); break; } if (e.hash == hash && ((k = e.key) == key || (key != null && key.equals(k)))) break; p = e; } } if (e != null) { // existing mapping for key V oldValue = e.value; if (!onlyIfAbsent || oldValue == null) e.value = value; afterNodeAccess(e); return oldValue; } } ++modCount; if (++size > threshold) resize(); afterNodeInsertion(evict); return null; }
從上面的源代碼中能夠看出:當咱們往HashMap中put元素的時候,先根據key的hashCode從新計算hash值,根據hash值獲得這個元素在數組中的位置(即下標),若是數組該位置上已經存放有其餘元素了,那麼在這個位置上的元素將以鏈表的形式存放,新加入的放在鏈頭,最早加入的放在鏈尾。若是數組該位置上沒有元素,就直接將該元素放到此數組中的該位置上。
關於HashMap數據的讀取這裏就不做介紹了。
這裏分析一個只有一組鍵值對的HashMap, 結構以下:
Map<Integer, Integer> testMap = Maps.newHashMap(); testMap.put(1, 2);
首先分析HashMap自己的大小。HashMap對象擁有的屬性包括:
/** * The table, initialized on first use, and resized as * necessary. When allocated, length is always a power of two. * (We also tolerate length zero in some operations to allow * bootstrapping mechanics that are currently not needed.) */ transient Node<K,V>[] table; /** * Holds cached entrySet(). Note that AbstractMap fields are used * for keySet() and values(). */ transient Set<Map.Entry<K,V>> entrySet; /** * The number of key-value mappings contained in this map. */ transient int size; /** * The number of times this HashMap has been structurally modified * Structural modifications are those that change the number of mappings in * the HashMap or otherwise modify its internal structure (e.g., * rehash). This field is used to make iterators on Collection-views of * the HashMap fail-fast. (See ConcurrentModificationException). */ transient int modCount; /** * The next size value at which to resize (capacity * load factor). * * @serial */ // (The javadoc description is true upon serialization. // Additionally, if the table array has not been allocated, this // field holds the initial array capacity, or zero signifying // DEFAULT_INITIAL_CAPACITY.) int threshold; /** * The load factor for the hash table. * * @serial */ final float loadFactor;
HashMap繼承了AbstractMap<K,V>, AbstractMap有兩個屬性:
transient Set<K> keySet; transient Collection<V> values;
因此一個HashMap對象自己的大小爲:
12(header) + 4(table reference) + 4(entrySet reference) + 4(size) + 4(modCount) + 4(threshold) + 8(loadFactor) + 4(keySet reference) + 4(values reference) = 48(bytes)
接着分析testMap實例在總共佔用的內存大小。
根據上面對HashMap原理的介紹,可知每對鍵值對對應一個Node對象。根據上面的Node的數據結構,一個Node對象的大小爲:
12(header) + 4(hash reference) + 4(key reference) + 4(value reference) + 4(next pointer reference) = 28 (padding) -> 32(bytes)
加上Key和Value兩個Integer對象,一個Node佔用內存總大小爲:32 + 2 * 16 = 64(bytes)
JProfiler中結果:
下面分析HashMap的Node數組的大小。
根據上面HashMap的原理可知,在不指定容量大小的狀況下,HashMap初始容量爲16,因此testMap的Node[]佔用的內存大小爲:
16(header) + 16 * 4(Node reference) + 64(Node) = 144(bytes)
JProfile結果:
因此,testMap佔用的內存總大小爲:
48(map itself) + 144(Node[]) = 192(bytes)
JProfile結果:
這裏只用一個例子說明如何對HashMap進行佔用內存大小的計算,根據HashMap初始化容量的大小,以及擴容的影響,HashMap佔用內存大小要進行具體分析,
不過思路都是一致的。
以上對計算Java對象佔用內存的基本規則及方法進行了介紹,並經過分析枚舉類,ArrayList, HashMap的內存佔用狀況介紹了分析複雜對象內存佔用的基本方法,
實際工做中須要精確計算Java對象內存佔用狀況的場景很少,不過做爲Java程序員的基本素養,瞭解這方面內容仍是頗有必要的。
An overview of memory saving techniques in Java
Memory consumption of popular Java data types – part 1
Memory consumption of popular Java data types – part 2
memory usage compare hashmap, arraylist, and array
How does a HashMap work in JAVA
《Java編程思想》《深刻理解Java虛擬機》