summaryrefslogtreecommitdiff
path: root/volk/include
AgeCommit message (Collapse)Author
2012-06-15filter: adding ssc and fsf versions of filter with associated new Volk kernels.Tom Rondeau
These routines work and pass QA. They could use some performance work. the FSF is just slightly slower than before; the SCC version is more noticably slower. Both could benefit, probably, by using SSE2 intrinsics to handle the shorts.
2012-06-14filter: added a ccf Volk dot product to use with ccf filters and used it in ↵Tom Rondeau
fir_filter_ccf. Produces improved results to previous version.
2012-06-14volk: fixes for 32f dot_prodTom Rondeau
Accepts num_points like everything else and handles splitting up numbers itself, not expected to be done externally. Adds AVX version, both aligned and unaligned.
2012-06-13volk: dot_produce for floats does 16 at a time.Tom Rondeau
This was done to make this have the same performance as float_dotprod from before. This makes all flavors of the 32f dotprod work the same way. Because it's expecting the input to have 4x more samples than specified, it's making qa for these fail.
2012-06-13filter: process 4 vectors each time in volk dot_prod to speed up fir filters.Tom Rondeau
This makes the volk version of the SSE FIR filter the same speed as using the hand-crafted float_dotprod from before.
2012-04-13Removed whitespace and added dtools/bin/remove-whitespace as a tool to do ↵Tom Rondeau
this in the future. The sed script was provided by Moritz Fischer.
2012-03-22Merge branch 'master' into nextJohnathan Corgan
2012-03-21volk: add include for config.h to cc fileJosh Blum
volk_32f_s32f_convert_16i_a was also a problem, previously volk_32f_s32f_convert_16i_u was fixed to have config.h. I heard putting config h stuff into public headers was bad practice. Including config.h into the generated cc file should fix this issue from now on.
2012-03-15Merge branch 'master' into nextTom Rondeau
2012-03-15volk: turning off sse implementation of complex dot product for 32-bit ↵Tom Rondeau
machines until it's fixed.
2012-03-15volk: makes the float-to-int conversion consistent and fixes an overflow bug ↵Tom Rondeau
on 32-bit machines. Not currently used in GNU Radio, so no change in behavior there.
2012-03-03volk: include config.h to have rintf in windows/msvc.Tom Rondeau
2012-03-03volk: include config.h to have rintf in windows/msvc.Tom Rondeau
2012-03-01Merge branch 'master' into nextJohnathan Corgan
Conflicts: config/grc_volk.m4 gnuradio-core/src/guile/tests/gengen_ctors.test gnuradio-core/src/lib/Makefile.am gnuradio-core/src/lib/general/.gitignore gnuradio-core/src/lib/general/Makefile.am gnuradio-core/src/lib/gengen/.gitignore gnuradio-core/src/python/gnuradio/gr/Makefile.am volk/include/volk/Makefile.am
2012-02-23volk: float to short conversion is consistent between archs and tail cases. ↵Tom Rondeau
Rounds to nearest number.
2012-02-23Merge remote branch 'bistromath/volk_fix' into safe_alignTom Rondeau
Conflicts: volk/apps/volk_profile.cc
2012-02-22Remove ORC invocation since // doesn't dissuade the generator.Nick Foster
2012-02-21Volk: add scalar const support to the profiler/QA code. Disabled ↵Nick Foster
volk_32fc_s32fc_multiply_32fc_a's Orc impl due to it not working.
2012-02-14build: fix autotools for gnuradio-core volkificationJohnathan Corgan
2012-02-13volk: adding complex conjugate kernel.Tom Rondeau
2012-02-13volk: fixed complex multiply and conjugate kernel to use xor for conjugation.Tom Rondeau
2012-02-13volk: added a 32fc multiply conjugate kernel.Tom Rondeau
2012-02-13volk: added unaligned version of adding 2 vectors.Tom Rondeau
2012-02-13volk: new unaligned versions of float multipliers.Tom Rondeau
2012-02-13core: moving multiply_const_ff from gengen to general to take advantage of volk.Tom Rondeau
Also adds SSE and AVX and unaligned Volk versions for this.
2012-02-02volk: adding new functions to Makefile.Tom Rondeau
2012-02-02volk: improving performance of multiply_const and multiply two streams.Tom Rondeau
2012-02-02volk: adding unaligned versions of complex multiply a constant and complex ↵Tom Rondeau
multiply 2 streams.
2012-01-30volk: adding complex to imag kernel.Tom Rondeau
2012-01-29volk: added volk magnitiude squared functions (aligned/unaligned) for ↵Tom Rondeau
complex numbers.
2012-01-29volk: added unaligned volk function for magnitude of a complex number.Tom Rondeau
2012-01-28volk: fix lower bound of int conversion.Tom Rondeau
2012-01-26volk: float_to_int and float_to_char updated to clip instead of wrap around. ↵Tom Rondeau
The float to int clips at smaller than 2^32 because of the limits of the float representation.
2012-01-26volk: fix a warning.Tom Rondeau
2012-01-26volk: float_to_short now clips the values instead of wrapping around.Tom Rondeau
2012-01-06Merge branch 'master' into nextTom Rondeau
2012-01-06volk: minor changes to fix some warnings.Tom Rondeau
2011-12-11Merge branch 'master' into nextJohnathan Corgan
Conflicts: config/lf_warnings.m4 volk/config/lf_warnings.m4
2011-12-11volk: fixed some of the simpler warnings.Tom Rondeau
2011-12-08Removed autotools, gr-waveform, some cleanupJohnathan Corgan
Nick Foster owes Nick Corgan a six-pack of beer!
2011-12-05volk: updated autotools Makefile to build with new headers.Tom Rondeau
2011-12-01volk: squashed changes 32f_s32f_multiply_32f_a 32fc_x2_dot_prod_32fc_uJosh Blum
2011-12-01Volk: 32f_s32f_multiply_32fNick Foster
2011-12-01Volk: added 32fc x scalar multiply, implemented in Orc & generic. Orc/SSE ↵Nick Foster
tested 10x faster than generic.
2011-11-30volk: conversion tweaks to build avx on MSVCJosh Blum
2011-10-20removes gcc warning, dont need symbol export on plain c structsJosh Blum
2011-10-20volk: fixed signed/unsigned comparison warnings.Tom Rondeau
2011-05-18Volk: renamed everything else to _a instead of _a16, makefiles includedNick Foster
2011-05-18Volk: rename functions _a instead of _a16Nick Foster
2011-05-18Volk: rename aligned functions to just _a instead of _a16Nick Foster