summaryrefslogtreecommitdiff
path: root/volk
AgeCommit message (Collapse)Author
2012-06-15volk: adding new kernels to test and profile.Tom Rondeau
2012-06-15filter: adding ssc and fsf versions of filter with associated new Volk kernels.Tom Rondeau
These routines work and pass QA. They could use some performance work. the FSF is just slightly slower than before; the SCC version is more noticably slower. Both could benefit, probably, by using SSE2 intrinsics to handle the shorts.
2012-06-14filter: added a ccf Volk dot product to use with ccf filters and used it in ↵Tom Rondeau
fir_filter_ccf. Produces improved results to previous version.
2012-06-14volk: fixes for 32f dot_prodTom Rondeau
Accepts num_points like everything else and handles splitting up numbers itself, not expected to be done externally. Adds AVX version, both aligned and unaligned.
2012-06-13volk: dot_produce for floats does 16 at a time.Tom Rondeau
This was done to make this have the same performance as float_dotprod from before. This makes all flavors of the 32f dotprod work the same way. Because it's expecting the input to have 4x more samples than specified, it's making qa for these fail.
2012-06-13filter: process 4 vectors each time in volk dot_prod to speed up fir filters.Tom Rondeau
This makes the volk version of the SSE FIR filter the same speed as using the hand-crafted float_dotprod from before.
2012-04-23volk: force kwargs keys to be of type str, not unicode for py25Josh Blum
2012-04-19volk: code simplification, overrule macro and python optsJosh Blum
2012-04-19volk: avoid sse2 saturation issue 32768->32767Josh Blum
2012-04-19volk: added set_float_rounding to volk_cpu_initJosh Blum
2012-04-19volk: avx overrule is gcc4.4, make prints matchJosh Blum
2012-04-19volk: disable AVX if GCC version < 4.6.0Nick Foster
2012-04-19volk: gcc version check without __GNUC_PREREQJosh Blum
2012-04-19volk: added gcc version check to xgetbvJosh Blum
Reference https://code.google.com/p/pcsx2/issues/detail?id=1195
2012-04-19volk: remove norc, implement machine overruleJosh Blum
2012-04-19volk: use archs.xml to specify compiler flags + supportJosh Blum
2012-04-19volk: fix volk_profile install ruleJosh Blum
2012-04-19volk: move avx cpuid_x86_bit check in archs.xmlJosh Blum
2012-04-19volk: fix msvc __cpuid pointer castJosh Blum
2012-04-19Volk: redo the archs.xml language to make checks generic. no more "type", no ↵Nick Foster
more piles of #if crap in the template.
2012-04-19volk: fix for cpuid_eax check with hardcoded valuesJosh Blum
2012-04-19volk: removed old generator python codeJosh Blum
2012-04-19volk: updated build system for avx checking supportJosh Blum
updated copy of cpuid.h with the latest from gcc 4.6
2012-04-19volk: build system work, can build stand-alone msvcJosh Blum
2012-04-19volk: python checks and build system stuffJosh Blum
2012-04-19volk: make orc a normal arch with overruleJosh Blum
2012-04-19volk: added compile utils and cleanup cmakelistsJosh Blum
2012-04-19volk: working build w/ cmakelistsJosh Blum
2012-04-19volk: created other templates for runtime + machinesJosh Blum
2012-04-19volk: added kernel defs and typedefsJosh Blum
2012-04-19volk: work on template stuffJosh Blum
2012-04-19Merge branch 'maint'Johnathan Corgan
2012-04-18volk: gcc version check without __GNUC_PREREQJosh Blum
2012-04-18volk: added xgetbv stuff from volk_work to maintJosh Blum
This ensures that the compiler has support for xgetbv. This also fixes MSVC by checking for _xgetbv. Also, restored copy of cpuid.h, this should not be modified.
2012-04-16Merge branch 'maint'Johnathan Corgan
Conflicts: volk/gen/make_cpuid_c.py
2012-04-16Volk: also check to make sure OSXSAVE is enabled so you don't check XGETBV ↵Nick Foster
when OS has it disabled.
2012-04-16Volk: add support for checking AVX enable state of OS.Nick Foster
Some systems (notably Xen hypervisor) appear to use XSETBV to disable AVX. This causes SIGILL when running AVX instructions. This commit makes Volk check XCR0 on the AVX arch before proceeding.
2012-04-13Removed whitespace and added dtools/bin/remove-whitespace as a tool to do ↵Tom Rondeau
this in the future. The sed script was provided by Moritz Fischer.
2012-03-22Merge branch 'master' into nextJohnathan Corgan
2012-03-22volk: fix for load prefs (config path)Josh Blum
1) add APPDATA to search path for windows 2) graceful fail and env vars not defined
2012-03-21volk: add include for config.h to cc fileJosh Blum
volk_32f_s32f_convert_16i_a was also a problem, previously volk_32f_s32f_convert_16i_u was fixed to have config.h. I heard putting config h stuff into public headers was bad practice. Including config.h into the generated cc file should fix this issue from now on.
2012-03-19Merge branch 'master' into nextTom Rondeau
Conflicts: volk/config/orc.m4
2012-03-19volk: change minimum acceptable ORC version to 4.12 (in cmake and updating ↵Tom Rondeau
version in autotools).
2012-03-15Merge branch 'master' into nextTom Rondeau
2012-03-15Merge branch 'volk_32bit_fixes_maint' into maintTom Rondeau
2012-03-15Merge branch 'master' into nextJohnathan Corgan
2012-03-15volk: set local includes first for precedenceJosh Blum
2012-03-15volk: turning off sse implementation of complex dot product for 32-bit ↵Tom Rondeau
machines until it's fixed.
2012-03-15volk: makes the float-to-int conversion consistent and fixes an overflow bug ↵Tom Rondeau
on 32-bit machines. Not currently used in GNU Radio, so no change in behavior there.
2012-03-03volk: include config.h to have rintf in windows/msvc.Tom Rondeau