bwlp/qemu.git - Experimental fork of QEMU with video encoding patches

	Commit message (Collapse)	Author	Age	Files	Lines
...
*	tcg/i386: Handle ctpop opcode	Richard Henderson	2017-01-10	1	-1/+11
\| \| \| \|	Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Rely on undefined/undocumented behaviour of BSF/BSR	Richard Henderson	2017-01-10	1	-13/+22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The ISA manual documents the output is undefined if the input was zero. However, we document in target-i386 that the behavior of real silicon is to preserve the contents of the output register. We also mention that there are real applications that depend on this. That this is baked into silicon is mentioned as a potential cause for some false sharing behaviour wrt lzcnt/tzcnt. Taking advantage of this allows us to save 2 insns in the normal case, and 4 insns for i686 emulating a 64-bit clz. Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Handle ctz and clz opcodes	Richard Henderson	2017-01-10	1	-9/+116
\| \| \| \|	Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Allow bmi2 shiftx to have non-matching operands	Richard Henderson	2017-01-10	1	-14/+19
\| \| \| \| \| \| \| \| \| \|	Previously we could not have different constraints for different ISA levels, which prevented us from eliding the matching constraint for shifts. We do now have to make sure that the operands match for constant shifts. We can also handle some small left shifts via lea. Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Hoist common arguments in tcg_out_op	Richard Henderson	2017-01-10	1	-102/+95
\| \| \| \|	Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Fuly convert tcg_target_op_def	Richard Henderson	2017-01-10	1	-142/+198
\| \| \| \| \| \| \|	Use a switch instead of searching a table. Share constraints between 32-bit and 64-bit, when at all possible. Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Pass the opcode width to target_parse_constraint	Richard Henderson	2017-01-10	1	-9/+5
\| \| \| \| \| \| \| \| \| \| \| \|	This will let us choose how to interpret a given constraint depending on whether the opcode is 32- or 64-bit. Which will let us share more constraint combinations between opcodes. At the same time, change the interface to return the advanced pointer instead of passing it in/out by reference. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Transition flat op_defs array to a target callback	Richard Henderson	2017-01-10	1	-2/+12
\| \| \| \| \| \| \| \|	This will allow the target to tailor the constraints to the auto-detected ISA extensions. Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Implement field extraction opcodes	Richard Henderson	2017-01-10	1	-0/+38
\| \| \| \|	Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Extend TARGET_PAGE_MASK to the proper type	Richard Henderson	2016-09-20	1	-1/+1
\| \| \| \| \| \| \|	TARGET_PAGE_MASK, as defined, has type "int". We need to extend that to the proper target width before oring in an "unsigned". Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Add support for fence	Pranith Kumar	2016-09-16	1	-0/+17
\| \| \| \| \| \| \| \| \|	Generate a 'lock orl $0,0(%esp)' instruction for ordering instead of mfence which has similar ordering semantics. Signed-off-by: Pranith Kumar <bobby.prani@gmail.com> Message-Id: <20160714202026.9727-3-bobby.prani@gmail.com> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Support arbitrary size + alignment	Richard Henderson	2016-09-16	1	-9/+10
\| \| \| \| \| \| \| \| \| \| \|	Previously we allowed fully unaligned operations, but not operations that are aligned but with less alignment than the operation size. In addition, arm32, ia64, mips, and sparc had been omitted from the previous overalignment patch, which would have led to that alignment being enforced. Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Improve the alignment check infrastructure	Sergey Sorokin	2016-07-06	1	-6/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Some architectures (e.g. ARMv8) need the address which is aligned to a size more than the size of the memory access. To support such check it's enough the current costless alignment check implementation in QEMU, but we need to support an alignment size specifying. Signed-off-by: Sergey Sorokin <afarallax@yandex.ru> Message-Id: <1466705806-679898-1-git-send-email-afarallax@yandex.ru> Signed-off-by: Richard Henderson <rth@twiddle.net> [rth: Assert in tcg_canonicalize_memop. Leave get_alignment_bits available for, though unused by, user-mode. Retain logging difference based on ALIGNED_ONLY.]
*	tcg: Optimize spills of constants	Richard Henderson	2016-07-06	1	-7/+14
\| \| \| \| \| \| \| \| \| \| \|	While we can store constants via constrants on INDEX_op_st_i32 et al, we weren't able to spill constants to backing store. Add a new backend interface, tcg_out_sti, which may store the constant (and is allowed to fail). Rearrange the temp_* helpers so that we only attempt to directly store a constant when the temp is becoming dead/free. Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Clean up direct block chaining data fields	Sergey Fedorov	2016-05-13	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Briefly describe in a comment how direct block chaining is done. It should help in understanding of the following data fields. Rename some fields in TranslationBlock and TCGContext structures to better reflect their purpose (dropping excessive 'tb_' prefix in TranslationBlock but keeping it in TCGContext): tb_next_offset => jmp_reset_offset tb_jmp_offset => jmp_insn_offset tb_next => jmp_target_addr jmp_next => jmp_list_next jmp_first => jmp_list_first Avoid using a magic constant as an invalid offset which is used to indicate that there's no n-th jump generated. Signed-off-by: Sergey Fedorov <serge.fdrv@gmail.com> Signed-off-by: Sergey Fedorov <sergey.fedorov@linaro.org> Reviewed-by: Alex Bennée <alex.bennee@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg/i386: Make direct jump patching thread-safe	Sergey Fedorov	2016-05-13	1	-0/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Ensure direct jump patching in i386 is atomic by: * naturally aligning a location of direct jump address; * using atomic_read()/atomic_set() for code patching. tcg_out_nopn() implementation: Suggested-by: Richard Henderson <rth@twiddle.net>. Signed-off-by: Sergey Fedorov <serge.fdrv@gmail.com> Signed-off-by: Sergey Fedorov <sergey.fedorov@linaro.org> Message-Id: <1461341333-19646-6-git-send-email-sergey.fedorov@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: check for CONFIG_DEBUG_TCG instead of NDEBUG	Aurelien Jarno	2016-04-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Check for CONFIG_DEBUG_TCG instead of NDEBUG, drop now useless code. Cc: Richard Henderson <rth@twiddle.net> Signed-off-by: Aurelien Jarno <aurelien@aurel32.net> Message-id: 1461228530-14852-2-git-send-email-aurelien@aurel32.net Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
*	tcg: use tcg_debug_assert instead of assert (fix performance regression)	Aurelien Jarno	2016-04-21	1	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The TCG code is quite performance sensitive, but at the same time can also be quite tricky. That is why asserts that can be enabled with the --enable-debug-tcg configure option. This used to work the following way: \| #include "config.h" \| \| ... \| \| #if !defined(CONFIG_DEBUG_TCG) && !defined(NDEBUG) \| /* define it to suppress various consistency checks (faster) */ \| #define NDEBUG \| #endif \| \| ... \| \| #include <assert.h> Since commit 757e725b (tcg: Clean up includes) "config.h" as been replaced by "qemu/osdep.h" which itself includes <assert.h>. As a consequence the assertions are always enabled, even when using --disable-debug-tcg, causing a performance regression, especially on targets with many registers. For instance on qemu-system-ppc the speed difference is about 15%. tcg_debug_assert is controlled directly by CONFIG_DEBUG_TCG and already uses in some places. This patch replaces all the calls to assert into calss to tcg_debug_assert. Cc: Peter Maydell <peter.maydell@linaro.org> Cc: Richard Henderson <rth@twiddle.net> Signed-off-by: Aurelien Jarno <aurelien@aurel32.net> Message-id: 1461228530-14852-1-git-send-email-aurelien@aurel32.net Reviewed-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
*	tcg: Remove unnecessary osdep.h includes from tcg-target.inc.c	Peter Maydell	2016-02-23	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit 757e725b58c57d added a number of #include "qemu/osdep.h" files to the tcg-target.c files (as they were named at the time). These are unnecessary because these files are not standalone C files, and the tcg/tcg.c file which includes them will have already included osdep.h on their behalf. Remove the unneeded include directives. Reviewed-by: Eric Blake <eblake@redhat.com> Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Message-Id: <1456238983-10160-4-git-send-email-peter.maydell@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>
*	tcg: Rename tcg-target.c to tcg-target.inc.c	Peter Maydell	2016-02-23	1	-0/+2464
	Rename the per-architecture tcg-target.c files to tcg-target.inc.c. This makes it clearer that they are not intended to be standalone C files, but are instead #included into another source file. Reviewed-by: Eric Blake <eblake@redhat.com> Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Message-Id: <1456238983-10160-2-git-send-email-peter.maydell@linaro.org> Signed-off-by: Richard Henderson <rth@twiddle.net>