US5457783A - Adaptive speech coder having code excited linear prediction - Google Patents
Adaptive speech coder having code excited linear prediction Download PDFInfo
- Publication number
- US5457783A US5457783A US07/927,137 US92713792A US5457783A US 5457783 A US5457783 A US 5457783A US 92713792 A US92713792 A US 92713792A US 5457783 A US5457783 A US 5457783A
- Authority
- US
- United States
- Prior art keywords
- codevector
- speech
- pulse
- transmission
- coder
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000003044 adaptive effect Effects 0.000 title description 10
- 239000013598 vector Substances 0.000 claims abstract description 98
- 230000004044 response Effects 0.000 claims abstract description 43
- 230000005540 biological transmission Effects 0.000 claims abstract description 34
- 238000001914 filtration Methods 0.000 claims abstract description 23
- 230000003595 spectral effect Effects 0.000 claims abstract description 22
- 230000001131 transforming effect Effects 0.000 claims abstract description 5
- 230000015572 biosynthetic process Effects 0.000 claims description 32
- 238000003786 synthesis reaction Methods 0.000 claims description 31
- 238000000034 method Methods 0.000 abstract description 47
- 230000005284 excitation Effects 0.000 description 36
- 239000011159 matrix material Substances 0.000 description 30
- 238000013459 approach Methods 0.000 description 11
- 230000000694 effects Effects 0.000 description 11
- 230000000737 periodic effect Effects 0.000 description 11
- 230000008569 process Effects 0.000 description 9
- 238000004364 calculation method Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000005070 sampling Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 230000007774 longterm Effects 0.000 description 3
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000000135 prohibitive effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011946 reduction process Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0007—Codebook element generation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
Definitions
- the present invention relates to the field of speech coding, and more particularly, to improvements in the field of adaptive coding of speech or voice signals wherein code excited linear prediction (CELP) techniques are utilized.
- CELP code excited linear prediction
- Digital telecommunication carrier systems have existed in the United States since approximately 1962 when the T1 system was introduced. This system utilized a 24-voice channel digital signal transmitted at an overall rate of 1.544 Mb/s. In view of cost advantages over existing analog systems, the T1 system became widely deployed.
- An individual voice channel in the T1 system was typically generated by band limiting a voice signal in a frequency range from about 300 to 3400 Hz, sampling the limited signal at a rate of 8 kHz, and thereafter encoding the sampled signal with an 8 bit logarithmic quantizer.
- the resultant digital voice signal was a 64 kb/s signal.
- 24 individual digital voice signals were multiplexed into a single data stream.
- the T1 system is limited to 24 voice channels if 64 kb/s voice signals are used.
- the individual signal transmission rate must be reduced from 64 kb/s to some lower rate.
- the problem with lowering the transmission rate in the typical T1 voice signal generation scheme, by either reducing the sampling rate or reducing the size of the quantizer, is that certain portions of the voice signal essential for accurate reproduction of the original speech is lost.
- TC transform coding
- ATC adaptive transform coding
- LPC linear prediction coding
- CELP code excited linear prediction
- a speech signal is divided into sequential blocks of speech samples.
- the samples in each block are arranged in a vector and transformed from the time domain to an alternate domain, such as the frequency domain.
- each block of speech samples is analyzed in order to determine the linear prediction coefficients for that block and other information such as long term predictors (LTP).
- LTP long term predictors
- Linear prediction coefficients are equation components which reflect certain aspects of the spectral envelope associated with a particular block of speech signal samples. Such spectral information represents the dynamic properties of speech, namely formants.
- Speech is produced by generating an excitation signal which is either periodic (voiced sounds), aperiodic (unvoiced sounds), or a mixture (e.g. voiced fricatives).
- the periodic component of the excitation signal is known as the pitch.
- the excitation signal is filtered by a vocal tract filter, determined by the position of the mouth, jaw, lips, nasal cavity, etc. This filter has resonances or formants which determine the nature of the sound being heard.
- the vocal tract filter provides an envelope to the excitation signal. Since this envelope contains the filter formants, it is known as the formant or spectral envelope. It is this spectral envelope which is reflected in the linear prediction coefficients.
- Long Term Predictors are filters reflective of redundant pitch structure in the speech signal. Such structure is removed by estimating the LTP values for each block and subtracting those values from current signal values. The removal of such information permits the speech signal to be converted to a digital signal using fewer bits. The LTP values are transmitted separately and added back to the remaining speech signal at the receiver.
- LPC techniques In order to understand how a speech signal is reduced and converted to digital form using LPC techniques, consider the generation of a synthesized or reproduced speech signal by an LPC vocoder.
- FIG. 1 A generalized prior art LPC vocoder is shown in FIG. 1.
- the device shown converts transmitted digital signals into synthesized voice signals, i.e., blocks of synthesized speech samples.
- a synthesis filter utilizing the LPCs determined for a given block of samples, produces a synthesized speech output by filtering the excitation signal in relation to the LPCs.
- Both the synthesis filter coefficients (LPCs) and the excitation signal are updated for each sample block or frame (i.e. every 20-30 milliseconds).
- the excitation signal can be either a periodic excitation signal or a noise excitation signal.
- synthesized speech produced by an LPC vocoder can be broken down into three basic elements:
- the speech signal has a definite pitch period (or periodicity) and this is accounted for by the periodic excitation signal which is composed largely of pulses spaced at the pitch period (determined from the LTP);
- the speech signal is much more like random noise and has no periodicity and this is provided for by the noise excitation signal.
- a switch controls which form of excitation signal is fed to the synthesis filter.
- the gain controls the actual volume level of the output speech.
- Both types of excitation (2) and (3) are, therefore, very different in the time domain (one being made up of equally spaced pulses while the other is noise-like) but both have the common property of a flat spectrum in the frequency domain.
- the correct spectral shape will be provided at the output of the synthesis by the LPCs.
- LPC vocoder requires the transmission of only the LPCs and the excitation information, i.e., whether the switch provides periodic or noise-like excitation to the speech synthesizer. Consequently, a reduced bit rate can be used to transmit speech signals processed in an LPC vocoder.
- CELP vocoders overcome this problem by leaving ON both the periodic and noise-like signals at the same time.
- the degree to which each of these signals makes up the excitation signal (e(n)) for provision to the synthesis filter is determined by separate gains which are assigned to each of the two excitations.
- the excitation will be a mixture of the two if the gains are both non-zero.
- CELP CELP and simple LPC vocoders.
- the input speech is analyzed in a step-by-step manner to determine what the most likely value is for the pitch period of the input speech. The important point to note is that this decision about the best pitch period is final. There is no comparison made against other possible pitch periods.
- the CELP vocoder has stored within it several hundred (or possibly several thousand) noise-like signals each of which is one frame long.
- the CELP vocoder uses each of these noise-like signals, in turn, to synthesize output speech and chooses the one which produces the minimum error between the input and synthesized speech signals, i.e., another closed-loop procedure.
- This stored set of noise-like signals is known as a codebook and the process of searching through each of the codebook signals in turn to find the best one is known as a codebook search.
- the major advantage of the closed-loop CELP approach is that, at the end of the search, the best possible values have been chosen for a given input speech signal--leading to major improvements in speech quality.
- CELP coding techniques require the transmission of only the LPC values, LTP values and address of the chosen codebook signal. It is not necessary to transmit an excitation signal. Consequently, CELP coding techniques are particularly desirable to increase the number of voice channels in the T1 system.
- CELP coding The primary disadvantage with current CELP coding techniques is the amount of computing power required. In CELP coding it is necessary to search a large set of possible pitch values and codebook entries. The high complexity of the traditional CELP approach is only incurred at the transmitter since the receiver consists of just the simple synthesis structure shown in FIG. 2. The present invention overcomes the need to perform traditional codebook searching. In order to understand the significance of such an improvement, it is helpful to review the traditional CELP coding techniques.
- the general CELP speech signal conversion operation is shown in FIG. 3.
- the order of conversion processes is as follows: (i) compute LPC coefficients, (ii) use LPC coefficients in determining LTP parameters (i.e. best pitch period and corresponding gain ⁇ ), (iii) use LPC coefficients and LTP parameters in a codebook search to determine the codebook parameters (i.e. the best codeword c(n) and corresponding gain g). In the present invention, it is this final process which has been improved.
- the codebook search strategy consists of taking each codebook vector (c(n)) in turn, passing it through the synthesis filter, comparing the output signal with the input speech signal and minimizing the error. Certain preprocessing steps are required. At the start of any particular frame, the excitation components associated with the LTP (p(n)) and the codebook (c(n)) are still to be computed. However even if both of these signals were to be completely zero for the whole frame, the synthesis filter nonetheless has some memory associated with it, thereby producing an output for the current frame even with no input. This frame of output due to the synthesis filter memory is known as the ringing vector r(n).
- the codebook signal c(n) can be represented in matrix form by an (N-by-1) vector c. This vector will have exactly the same elements as c(n) except in matrix form.
- the operation of filtering c by the impulse response of the LPC synthesis filter A can be represented by the matrix multiple Ac. This multiple produces the same result as the signal y(n) in equation (3) for ⁇ equal to zero.
- the synthesized output speech vector s' can be represented in matrix form as:
- equation (6) can be rearranged as:
- FIG. 4 A typical prior art codebook search is shown in FIG. 4 which sets forth the implementation of equations 5, 6 and 7 above.
- the input speech signal has the ringing vector r removed.
- the LTP vector p i.e. the pitch or periodic component p(n) of the excitation
- Ap the LPC synthesis filter
- the codebook is populated by many hundreds of possible vectors c. Consequently, it is desirable not to form Ac or c t A t for each possible codebook vector.
- This result is achieved by precomputing two variables before the codebook search, the (N-by-1) vector d and the (N-by-N) matrix F such that:
- the selected codebook vector is that vector associated with the largest value for: ##EQU5##
- the correct gain g for a given codebook vector is given by: ##EQU6##
- codebook search involves the following steps for each vector: scaling the vector; filtering the vector by long term predictor components to add pitch information to the vector; filtering the vector by short term predictors to add spectral information; subtracting the scaled and double filtered vector from the original speech signal and analyzing the answer to determine whether the best codebook vector has been chosen.
- the problems of the prior art are overcome and the advantages of the invention are achieved in an apparatus and method for speech coding in which analog speech signals are converted to digital speech signals for transmission.
- the speech coder utilizing CELP techniques, includes a first filter for filtering out the spectral information from the speech signal. The spectral information is provided for transmission.
- a second filter is provided for filtering out the pitch information from the speech signal and such pitch information is also provided for transmission.
- a codevector generator determines, in one embodiment, the characteristics of a bi-pulse codevector representative of the speech signal. In this embodiment the impulse response of the first filter is truncated for determining the codevector characteristics.
- the codevector generator includes a transformer for transforming codevector possibilities from being representative of pulse-like sound to being representative of noise-like sound. It is especially preferred for the transform to be a Hadamard transform. It is also preferred to scramble the transformed codevector to modify the sequency properties.
- the bi-pulse codevector generator and the scrambled codevector generator are combined with a single pulse codevector generator. In such an embodiment, it is preferred to include a comparator for evaluating the characteristics determined by the three codebook generators and choosing the output of the one providing the best codebook vector.
- FIG. 1 is a block diagram of a prior art generalized LPC vocoder
- FIG. 2 is a block diagram of a prior art generalized CELP vocoder-receiver
- FIG. 3 is a block diagram of a prior art generalized CELP vocoder-transmitter
- FIG. 4 is a flow chart of a prior art CELP codebook search
- FIG. 5 is a schematic view of an adaptive speech coder in accordance with the present invention.
- FIG. 6 is a general flow chart of those operations performed in the adaptive coder shown in FIG. 5, prior to transmission;
- FIG. 7 is a flow chart of a codebook search technique in accordance with the present invention.
- FIG. 8 is a flow chart of another codebook search technique in accordance with the present invention.
- FIG. 9 is a flow chart of those operations performed in the adaptive transform coder shown in FIG. 5, subsequent to reception to perform speech synthesis.
- the present invention is embodied in a new and novel apparatus and method for adaptive speech coding wherein rates have been significantly reduced.
- the present invention enhances CELP coding for reduced transmission rates by providing more efficient methods for performing a codebook search.
- FIG. 5 An adaptive CELP coder constructed in accordance with the present invention is depicted in FIG. 5 and is generally referred to as 10.
- the heart of coder 10 is a digital signal processor 12, which in the preferred embodiment is a TMS320C51 digital signal processor manufactured and sold by Texas Instruments, Inc. of Houston, Tex. Such a processor is capable of processing pulse code modulated signals having a word length of 16 bits.
- Processor 12 is shown to be connected to three major bus networks, namely serial port bus 14, address bus 16, and data bus 18.
- Program memory 20 is provided for storing the programming to be utilized by processor 12 in order to perform CELP coding techniques in accordance with the present invention. Such programming is explained in greater detail in reference to FIGS. 6 through 9.
- Program memory 20 can be of any conventional design, provided it has sufficient speed to meet the specification requirements of processor 12. It should be noted that the processor of the preferred embodiment (TMS320C51) is equipped with an internal memory.
- Data memory 22 is provided for the storing of data which may be needed during the operation of processor 12.
- a clock signal is provided by conventional clock signal generation circuitry (not shown) to clock input 24.
- the clock signal provided to input 24 is a 20 MHz clock signal.
- a reset input 26 is also provided for resetting processor 12 at appropriate times, such as when processor 12 is first activated. Any conventional circuitry may be utilized for providing a signal to input 26, as long as such signal meets the specifications called for by the chosen processor.
- Processor 12 is connected to transmit and receive telecommunication signals in two ways. First, when communicating with CELP coders constructed in accordance with the present invention, processor 12 is connected to receive and transmit signals via serial port bus 14. Channel interface 28 is provided in order to interface bus 14 with the compressed voice data stream. Interface 28 can be any known interface capable of transmitting and receiving data in conjunction with a data stream operating at the prescribed transmission rate.
- processor 12 when communicating with existing 64 kb/s channels or with analog devices, processor 12 is connected to receive and transmit signals via data bus 18.
- Converter 30 is provided to convert individual 64 kb/s channels appearing at input 32 from a serial format to a parallel format for application to bus 18. As will be appreciated, such conversion is accomplished utilizing known codecs and serial/parallel devices which are capable of use with the types of signals utilized by processor 12.
- processor 12 receives and transmits parallel 16 bit signals on bus 18.
- an interrupt signal is provided to processor 12 at input 34.
- analog interface 36 serves to convert analog signals by sampling such signals at a predetermined rate for presentation to converter 30.
- interface 36 converts the sampled signal from converter 30 to a continuous signal.
- FIGS. 6-9 the programming will be explained which, when utilized in conjunction with those components shown in FIG. 5, provides a new and novel CELP coder.
- Adaptive speech coding for transmission of telecommunications signals in accordance with the CELP techniques of the present invention is shown in FIG. 6.
- Telecommunication signals to be coded and transmitted appear on bus 18 and are presented to input buffer 40.
- Such telecommunication signals are sampled signals made up of 16 bit PCM representations of each sample where sampling occurs at a frequency of 8 kHz. For purposes of the present description, assume that a voice signal sampled at 8 kHz is to be coded for transmission.
- Buffer 40 accumulates a predetermined number of samples into a sample block.
- LPCs are determined for each block of speech samples at 42.
- the technique for determining the LPCs can be any desired technique such as that described in U.S. Pat. No. 5,012,517--Wilson et al., incorporated herein by reference. It is noted that the cited U.S. Patent concerns adaptive transform coding, however, the techniques described for determining LPCs are applicable to the present invention.
- the determined LPCs are formatted for transmission as side information at 44.
- the determined LPCs are also provided for LTP processing at 46, particularly to form the LPC synthesis filter.
- LTPs are determined for each block of speech samples at 46.
- the periodicity or pitch based information can be determined through the use of any known technique such as that described previously.
- the fundamental prerequisite for deriving an LTP filter is the calculation of a precise pitch or fundamental frequency estimate.
- the determined LTPs are also formatted for transmission as side information.
- the ringing vector associated with the synthesis filter is removed from the speech signal and the vector p (representative of LTP pitch information) is removed from the speech signal in accordance with equation (7), thereby forming the target vector x.
- the so-modified speech signal is thereafter provided for codebook searching in accordance with the present invention.
- codebook searching As will be described herein, three forms of codebook searching are performed in the present invention, namely, bi-pulse searching at 50, scrambled searching at 52 and single pulse searching at 54.
- bi-pulse searching technique shown in FIG. 7. It will be recalled that codebooks can be populated by many hundreds of possible vectors c. Since it is not desirable to form Ac or c t A t for each possible vector, precomputing two variables occurs before the codebook search, the (N-by-1) vector d and the (N-by-N) matrix F (equation 9). The process of pre-forming d by backward filtering is performed at 60.
- codebook vectors c Two major requirements on codebook vectors c are (i) that they have a flat frequency spectrum (since they will be shaped into the correct form for each particular sound by the synthesis filter) and (ii) that each codeword is sufficiently different from each other so that entries in the codebook are not wasted by having several almost identical to each other.
- all the entries in the codebook effectively consist of an (N-by-1) vector which is zero in all of its N samples except for two entries which are +1 and -1 respectively.
- N the preferred value of N is 64, however, in order to illustrate the principles of the invention, a smaller number of samples per vector is shown.
- each codevector c is of the form: ##EQU7##
- This form of vector is called a bi-pulse vector since it has only two non-zero pulses.
- This vector has the property of being spectrally flat as desired for codebook vectors. Since the +1 pulse can be in any of N possible positions and the -1 pulse can be in any one of (N-1) positions, the total number of combinations allowed is N(N-1). Since it is preferred that N equal 64, the potential size of the codebook is 4032 vectors. It is noted that use of a bi-pulse vector for the form of the codebook vector permits all the speech synthesis calculations by knowing the positioning of the +1, -1 pulses in the codevector c. Since only position information is required, no codebook need be stored. Therefore, the effect of a very large codebook can be achieved without requiring a large storage capacity.
- the problem of an ignored portion of the filtered vector is overcome by truncating impulse response ⁇ a n ⁇ of the LPC synthesis filter to a small number of values, i.e, use a new impulse response ⁇ a' n ⁇ defined as: ##EQU8## This calculation of the impulse response and its truncation are performed at 62 in FIG. 7.
- the original impulse response is chopped off after a certain number of samples. Therefore, the energy produced by the filtered vector Ac will now be mostly concentrated in this frame wherever the pulses happen to be. It is presently preferred for the value of NTRUNC to be 8.
- Precomputing the (N-by-N) matrix F (equation 9), based on the truncated impulse response, is performed at 64.
- the full response computation is used for the gain calculation since, although the truncated impulse response evens up the chances of all pulse positions being picked for a particular frame, the values of C i , G i produced by the bi-pulse process are not quite "exact" in the sense that they no longer exactly minimize the error between the gain-scaled filtered codevector gAc and the target vector x. Therefore, the untruncated response must be used to compute the value of the gain g which does actually minimize this error.
- C i 2 /G i and C i /G i were also used in traditional codebook searching in order to find the best codeword and the appropriate gain. By use of the present invention, these values are calculated more quickly. However, the time necessary to calculate the best codebook vector and the efficiency of such calculations can be improved even further.
- N 64. Consequently, even the simplified truncated search described above still requires the computation of C i , G i for N(N-1) or 4,032 vectors and this would be prohibitive in terms of the processing power required. In the present invention only a very small subset of these possible codewords is searched. This reduced search yields almost identical performance to the full codebook search.
- This equation states that G i is actually the correlation of the filtered codebook vector y with itself (i.e, the total energy in this signal). If the two pulses in the codebook vector are widely spaced, the filter response to the +1 pulse will not interact with the response to the -1 pulse and thus the total energy in the filtered vector y will be very consistent and fairly independent of where these +1, -1 pulses actually are located within the frame.
- the codebook search procedure just consists of scanning the d vector for its largest positive component which reveals i (the position of the +1 within the codebook vector c) and the largest negative component which reveals j (the position of the -1 within the codebook vector c).
- the numerator only search is much simpler than the alternative of computing C i , G i for each codevector. However, it relies on the assumption that G i remains constant for all pulses positions and this assumption is only approximately valid--especially if the +1, -1 pulses are close together.
- a search is made for a number (NDBUF) of the largest positive values (where NDBUF is a number greater than 1) and NDBUF largest negative values.
- the assumption is now made that, even allowing for the slight variation in G i with pulse position, the "best" codeword will still come from the pulse positions corresponding to these two sets ⁇ d(i -- max k ) ⁇ , ⁇ d(j -- min 1 ) ⁇ .
- this numerator only search to select NDBUF largest positive elements and NDBUF largest negative elements is performed at 66.
- the energy value E is set to zero at 68.
- the complexity reduction process of doing a numerator-only search has the effect of winnowing down the number of codevectors to be searched from approximately 4000 to around 25 by calculating the largest set of C i values based on the assumption that G i is approximately constant. For each of these 25, both C i , G i (using the truncated impulse response) are then computed and the best codeword (position of +1 and -1) is found. For this one best codeword, the un-truncated impulse response is then used to compute the codebook gain g at 80. Both positions i and j as well as the gain g are provided for transmission.
- Unvoiced sounds can be classified into definite types.
- plosives e.g. t, p, k
- the speech waveform resembles a sharp pulse which quickly decays to almost zero.
- the bi-pulse codebook described above is very effective at representing these signals since it itself consists of pulses.
- the other class of unvoiced signals is the fricatives (e.g. s, sh, f) which have a speech waveform which resembles random noise.
- This type of signal is not well modeled by the sequence of pulses produced by the bi-pulse codebook and the effect of using bi-pulses on these signals is the introduction of a very course raspiness to the output speech.
- the ideal solution would be to take the bi-pulse codebook vectors and transform them in some way such that they produced noise-like waveforms. Such an operation has the additional constraint that the transformation be easy to compute since this computation will be done many times in each frame.
- the transformation of the preferred embodiment is achieved using the Hadamard Transform. While the Hadamard Transform is known, its use for the purpose described below is new.
- the Hadamard transform is associated with an (N-by-N) transform matrix H which operates on the codebook vector c.
- the transformed codevector c' will have elements which have one of the three values 0,-2,+2. The actual proportion of these three values occurring within c' will actually be 1/2, 1/4, 1/4 respectively.
- This form of codevector is called a ternary codevector (since it assumes three distinct values). While ternary vectors have been used in traditional random CELP codebooks, the ternary vector processing of the invention is new.
- the transform matrix H has a very wide range of sequencies within its columns. Since c' is composed of a combination of columns of H as in equation (19), the vector c' will have similar sequency properties to H in the respect that in some speech frames there will be many changes of sign within c' while other frames will have c' vectors with relatively few changes. The actual sequency will depend on the +1,-1 pulse positions within c.
- a high sequency c' vector has the frequency transform characteristic of being dominated by lots of energy at high frequencies while a low sequency c' has mainly low frequency components.
- the effect of this wide range of sequency is that there are very rapid changes in the frequency content of the output speech from one frame to the next. This has the effect of introducing a warbly, almost underwater effect to the synthesized speech.
- the result of more consistent sequency is achieved by introducing a "scrambling matrix" S of the form: ##EQU12## where the elements along the main diagonal are randomly chosen as +1 or -1.
- a predetermined, fixed choice of +1 and -1 is used which does not change with time or on a frame-to-frame basis. It will be recalled that in the preferred embodiment N is 64.
- the preferred 64 diagonal values for the scrambling matrix S are as follows: -1, -1, -1, -1, -1, -1, 1, -1, 1, 1, -1, -1, -1, 1, 1, 1, -1, -1, 1, 1, 1, 1, 1, 1, -1, -1, 1, 1, 1, 1, 1, 1, 1, -1, 1, 1, 1, 1, 1, -1, -1, 1, 1, -1, 1, -1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, -1, 1, 1, 1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, 1, -1, -1, 1, -1, -1, -1.
- the target vector x having been previously generated at 46, is again backward filtered to form vector d at 82.
- This computation is made up of three stages: (i) the calculation of A t x is just the backward filtering operation described above, (ii) the multiplication by the scrambling matrix S matrix is trivial since it just involved inverting the sign of certain entries. It will be noted that only the +1, -1 entries in S need be stored in memory rather than the whole (N-by-N) matrix), (iii) the Hadamard transform can be computed efficiently by fast algorithms.
- the single pulse codebook is made up of vectors that are zero in every sample except one which has a +1 value.
- This codebook is not only similar in form to the bi-pulse codebook but also in its computational details. Consequently, a flow chart similar to that shown in FIG. 7, has not been shown. If the +1 value occurs in row k of the codeword c, the values C i , G i are now computed as:
- this codebook is identical to the bi-pulse codebook so that the concepts of a truncated impulse response for the codebook search and a numerator-only search are again utilized.
- the reason for the modification is that the SHC was designed to operate well for fricative unvoiced sounds (e.g. s, f, sh).
- the speech waveforms associated with these sounds are best described as being made up of a noise-like waveform with occasional large spikes/pulses.
- the bi-pulse codebook will represent these spikes very well but not the noise component, while the SHC will model the noise component but perform relatively poorly on the spikes.
- FIG. 9 a receiver constructed in accordance with the present invention is disclosed. It is noted that FIG. 9, similar to FIG. 6, is representative of programming used in conjunction with device 10 shown in FIG. 5. Transmitted telecommunication signals appearing on bus 18 are first buffered at 120 in order to assure that all of the bits associated with a single block are operated upon relatively simultaneously. The buffered signals are thereafter deformatted at 122. LPC information is provided to synthesis filter 124. LTP information is provided to the periodic excitation generator 126. The output of generator 126 is multiplied by the gain ⁇ at multiplier 128. The i and j information together with the identification of the particular search method chosen at 100 in FIG. 5, are provided to codevector construction generator 130. The output of generator 130 is multiplied by the gain g at multiplier 132. The outputs of multipliers 128 and 132 are summed in summer 134. The summed signal is provided to synthesis filter 124 as the excitation signal.
- the codevector will be a bi-pulse having a +1 at the i row and a -1 at the j row. If the scrambled search technique is used, since the pulse positions are known the codevector c for the SHC can be readily formed. This vector is then transformed and scrambled. Thereafter it is gain-scaled at 132 and filtered at 124 to form output speech vector gASHc. If the single pulse method was used, the codevector c is still capable of quick construction.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
e(n)=β·p(n)+g·c(n) (1)
s'(n)=r(n)+y(n) (4)
s'=r+Ae
gAc≅s-r-βAp (7)
C.sub.i =c.sup.t A.sup.t x
G.sub.i =c.sup.t A.sup.t Ac (8)
d=A.sup.t x & F=A.sup.t A (9)
C.sub.i =c.sup.t d
G.sub.i =c.sup.t Fc (10)
C.sub.i =(d.sub.i -d.sub.j)
G.sub.i =(F.sub.ii +F.sub.jj -2F.sub.ij) (11)
y=Ac (13)
y.sup.t =c.sup.t A.sup.t (14)
C.sub.i =d(i.sub.-- max.sub.k)-d(j.sub.-- min.sub.1)
G.sub.i =F(i.sub.-- max.sub.k,i.sub.-- max.sub.k)+F(j.sub.-- min.sub.1,j.sub.-- min.sub.1)-2F(i.sub.-- max.sub.k,j.sub.-- min.sub.1)(16)
c'=Hc (18)
c'=(column i of H)-(column j of H) (19)
c"=SHc (21)
Ci=c".sup.t A.sup.t x
Gi=c".sup.t A.sup.t Ac" (22)
C.sub.i =c.sup.t H.sup.t S.sup.t A.sup.t x=c.sup.t HSA.sup.t x
G.sub.i =c.sup.t H.sup.t S.sup.t A.sup.t ASHc=c.sup.t HSA.sup.t ASHc(23)
d"=HSA.sup.t x (24)
C.sub.i =c.sup.t d" (25)
G.sub.i =y".sup.t y" (26)
C.sub.i =d.sub.k
G.sub.i =F.sub.kk (27)
Claims (16)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/927,137 US5457783A (en) | 1992-08-07 | 1992-08-07 | Adaptive speech coder having code excited linear prediction |
US08/163,089 US5717824A (en) | 1992-08-07 | 1993-12-07 | Adaptive speech coder having code excited linear predictor with multiple codebook searches |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/927,137 US5457783A (en) | 1992-08-07 | 1992-08-07 | Adaptive speech coder having code excited linear prediction |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/163,089 Continuation-In-Part US5717824A (en) | 1992-08-07 | 1993-12-07 | Adaptive speech coder having code excited linear predictor with multiple codebook searches |
Publications (1)
Publication Number | Publication Date |
---|---|
US5457783A true US5457783A (en) | 1995-10-10 |
Family
ID=25454248
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US07/927,137 Expired - Lifetime US5457783A (en) | 1992-08-07 | 1992-08-07 | Adaptive speech coder having code excited linear prediction |
Country Status (1)
Country | Link |
---|---|
US (1) | US5457783A (en) |
Cited By (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5657421A (en) * | 1993-12-13 | 1997-08-12 | U.S. Philips Corporation | Speech signal transmitter wherein coding is maintained during speech pauses despite substantial shut down of the transmitter |
US5668924A (en) * | 1995-01-18 | 1997-09-16 | Olympus Optical Co. Ltd. | Digital sound recording and reproduction device using a coding technique to compress data for reduction of memory requirements |
EP0803117A4 (en) * | 1993-08-27 | 1997-10-29 | ||
US5701392A (en) * | 1990-02-23 | 1997-12-23 | Universite De Sherbrooke | Depth-first algebraic-codebook search for fast coding of speech |
US5729654A (en) * | 1993-05-07 | 1998-03-17 | Ant Nachrichtentechnik Gmbh | Vector encoding method, in particular for voice signals |
US5754976A (en) * | 1990-02-23 | 1998-05-19 | Universite De Sherbrooke | Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech |
US5797121A (en) * | 1995-12-26 | 1998-08-18 | Motorola, Inc. | Method and apparatus for implementing vector quantization of speech parameters |
US5799271A (en) * | 1996-06-24 | 1998-08-25 | Electronics And Telecommunications Research Institute | Method for reducing pitch search time for vocoder |
US5806034A (en) * | 1995-08-02 | 1998-09-08 | Itt Corporation | Speaker independent speech recognition method utilizing multiple training iterations |
US5819224A (en) * | 1996-04-01 | 1998-10-06 | The Victoria University Of Manchester | Split matrix quantization |
US5826223A (en) * | 1995-11-29 | 1998-10-20 | Samsung Electronics Xco., Ltd. | Method for generating random code book of code-excited linear predictive coding |
US5832180A (en) * | 1995-02-23 | 1998-11-03 | Nec Corporation | Determination of gain for pitch period in coding of speech signal |
US5878387A (en) * | 1995-03-23 | 1999-03-02 | Kabushiki Kaisha Toshiba | Coding apparatus having adaptive coding at different bit rates and pitch emphasis |
US5924062A (en) * | 1997-07-01 | 1999-07-13 | Nokia Mobile Phones | ACLEP codec with modified autocorrelation matrix storage and search |
US5943644A (en) * | 1996-06-21 | 1999-08-24 | Ricoh Company, Ltd. | Speech compression coding with discrete cosine transformation of stochastic elements |
US6389006B1 (en) * | 1997-05-06 | 2002-05-14 | Audiocodes Ltd. | Systems and methods for encoding and decoding speech for lossy transmission networks |
US6408268B1 (en) * | 1997-03-12 | 2002-06-18 | Mitsubishi Denki Kabushiki Kaisha | Voice encoder, voice decoder, voice encoder/decoder, voice encoding method, voice decoding method and voice encoding/decoding method |
US6499008B2 (en) * | 1998-05-26 | 2002-12-24 | Koninklijke Philips Electronics N.V. | Transceiver for selecting a source coder based on signal distortion estimate |
KR100438175B1 (en) * | 2001-10-23 | 2004-07-01 | 엘지전자 주식회사 | Search method for codebook |
US20050010402A1 (en) * | 2003-07-10 | 2005-01-13 | Sung Ho Sang | Wide-band speech coder/decoder and method thereof |
US20050084111A1 (en) * | 2003-09-09 | 2005-04-21 | Peter Lablans | Ternary and higher multi-value digital scramblers/descramblers |
US20050184888A1 (en) * | 2004-02-25 | 2005-08-25 | Peter Lablans | Generation and detection of non-binary digital sequences |
US20050194993A1 (en) * | 2004-02-25 | 2005-09-08 | Peter Lablans | Single and composite binary and multi-valued logic functions from gates and inverters |
US20060021003A1 (en) * | 2004-06-23 | 2006-01-26 | Janus Software, Inc | Biometric authentication system |
US20060031278A1 (en) * | 2004-08-07 | 2006-02-09 | Peter Lablans | Multi-value digital calculating circuits, including multipliers |
KR100633391B1 (en) | 2004-11-03 | 2006-10-16 | 한국전자통신연구원 | Codec transparent analog scrambler encryption / decoding device and method thereof |
US20090128190A1 (en) * | 2004-02-25 | 2009-05-21 | Peter Lablans | Implementing Logic Functions with Non-Magnitude Based Physical Phenomena |
US7548092B2 (en) | 2004-02-25 | 2009-06-16 | Ternarylogic Llc | Implementing logic functions with non-magnitude based physical phenomena |
US20100164548A1 (en) * | 2004-09-08 | 2010-07-01 | Ternarylogic Llc | Implementing Logic Functions With Non-Magnitude Based Physical Phenomena |
US20100232540A1 (en) * | 2009-03-13 | 2010-09-16 | Huawei Technologies Co., Ltd. | Preprocessing method, preprocessing apparatus and coding device |
US8374289B2 (en) | 2004-02-25 | 2013-02-12 | Ternarylogic Llc | Generation and detection of non-binary digital sequences |
CZ304212B6 (en) * | 2000-11-27 | 2014-01-08 | Nippon Telegraph And Telephone Corporation | Method, coder and decoder, and programs for coding and decoding speech parameters, and method, coder and decoder, and programs for coding and decoding speech |
CN111724814A (en) * | 2020-06-22 | 2020-09-29 | 广东西欧克实业有限公司 | A one-button intelligent voice interactive microphone system and using method |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4817157A (en) * | 1988-01-07 | 1989-03-28 | Motorola, Inc. | Digital speech coder having improved vector excitation source |
US5012517A (en) * | 1989-04-18 | 1991-04-30 | Pacific Communication Science, Inc. | Adaptive transform coder having long term predictor |
US5042069A (en) * | 1989-04-18 | 1991-08-20 | Pacific Communications Sciences, Inc. | Methods and apparatus for reconstructing non-quantized adaptively transformed voice signals |
US5060269A (en) * | 1989-05-18 | 1991-10-22 | General Electric Company | Hybrid switched multi-pulse/stochastic speech coding technique |
US5105464A (en) * | 1989-05-18 | 1992-04-14 | General Electric Company | Means for improving the speech quality in multi-pulse excited linear predictive coding |
US5119423A (en) * | 1989-03-24 | 1992-06-02 | Mitsubishi Denki Kabushiki Kaisha | Signal processor for analyzing distortion of speech signals |
US5138661A (en) * | 1990-11-13 | 1992-08-11 | General Electric Company | Linear predictive codeword excited speech synthesizer |
US5138662A (en) * | 1989-04-13 | 1992-08-11 | Fujitsu Limited | Speech coding apparatus |
US5187745A (en) * | 1991-06-27 | 1993-02-16 | Motorola, Inc. | Efficient codebook search for CELP vocoders |
US5195168A (en) * | 1991-03-15 | 1993-03-16 | Codex Corporation | Speech coder and method having spectral interpolation and fast codebook search |
US5224167A (en) * | 1989-09-11 | 1993-06-29 | Fujitsu Limited | Speech coding apparatus using multimode coding |
-
1992
- 1992-08-07 US US07/927,137 patent/US5457783A/en not_active Expired - Lifetime
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4817157A (en) * | 1988-01-07 | 1989-03-28 | Motorola, Inc. | Digital speech coder having improved vector excitation source |
US5119423A (en) * | 1989-03-24 | 1992-06-02 | Mitsubishi Denki Kabushiki Kaisha | Signal processor for analyzing distortion of speech signals |
US5138662A (en) * | 1989-04-13 | 1992-08-11 | Fujitsu Limited | Speech coding apparatus |
US5012517A (en) * | 1989-04-18 | 1991-04-30 | Pacific Communication Science, Inc. | Adaptive transform coder having long term predictor |
US5042069A (en) * | 1989-04-18 | 1991-08-20 | Pacific Communications Sciences, Inc. | Methods and apparatus for reconstructing non-quantized adaptively transformed voice signals |
US5060269A (en) * | 1989-05-18 | 1991-10-22 | General Electric Company | Hybrid switched multi-pulse/stochastic speech coding technique |
US5105464A (en) * | 1989-05-18 | 1992-04-14 | General Electric Company | Means for improving the speech quality in multi-pulse excited linear predictive coding |
US5224167A (en) * | 1989-09-11 | 1993-06-29 | Fujitsu Limited | Speech coding apparatus using multimode coding |
US5138661A (en) * | 1990-11-13 | 1992-08-11 | General Electric Company | Linear predictive codeword excited speech synthesizer |
US5195168A (en) * | 1991-03-15 | 1993-03-16 | Codex Corporation | Speech coder and method having spectral interpolation and fast codebook search |
US5187745A (en) * | 1991-06-27 | 1993-02-16 | Motorola, Inc. | Efficient codebook search for CELP vocoders |
Non-Patent Citations (2)
Title |
---|
Bergstr m et al, High Temporal Resolutin in Multi Pulse Coding , 1989 Int l Conf on Acoustics, Speech, & Signal Processing, May 23 26, 1989, pp. 770 773 vol. 2. * |
Bergstrom et al, "High Temporal Resolutin in Multi-Pulse Coding", 1989 Int'l Conf on Acoustics, Speech, & Signal Processing, May 23-26, 1989, pp. 770-773 vol. 2. |
Cited By (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5701392A (en) * | 1990-02-23 | 1997-12-23 | Universite De Sherbrooke | Depth-first algebraic-codebook search for fast coding of speech |
US5754976A (en) * | 1990-02-23 | 1998-05-19 | Universite De Sherbrooke | Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech |
US5729654A (en) * | 1993-05-07 | 1998-03-17 | Ant Nachrichtentechnik Gmbh | Vector encoding method, in particular for voice signals |
EP0803117A4 (en) * | 1993-08-27 | 1997-10-29 | ||
EP0803117A1 (en) * | 1993-08-27 | 1997-10-29 | Pacific Communication Sciences, Inc. | Adaptive speech coder having code excited linear prediction |
US5657421A (en) * | 1993-12-13 | 1997-08-12 | U.S. Philips Corporation | Speech signal transmitter wherein coding is maintained during speech pauses despite substantial shut down of the transmitter |
US5668924A (en) * | 1995-01-18 | 1997-09-16 | Olympus Optical Co. Ltd. | Digital sound recording and reproduction device using a coding technique to compress data for reduction of memory requirements |
US5832180A (en) * | 1995-02-23 | 1998-11-03 | Nec Corporation | Determination of gain for pitch period in coding of speech signal |
US5878387A (en) * | 1995-03-23 | 1999-03-02 | Kabushiki Kaisha Toshiba | Coding apparatus having adaptive coding at different bit rates and pitch emphasis |
US5806034A (en) * | 1995-08-02 | 1998-09-08 | Itt Corporation | Speaker independent speech recognition method utilizing multiple training iterations |
US5826223A (en) * | 1995-11-29 | 1998-10-20 | Samsung Electronics Xco., Ltd. | Method for generating random code book of code-excited linear predictive coding |
US5797121A (en) * | 1995-12-26 | 1998-08-18 | Motorola, Inc. | Method and apparatus for implementing vector quantization of speech parameters |
US5819224A (en) * | 1996-04-01 | 1998-10-06 | The Victoria University Of Manchester | Split matrix quantization |
US5943644A (en) * | 1996-06-21 | 1999-08-24 | Ricoh Company, Ltd. | Speech compression coding with discrete cosine transformation of stochastic elements |
US5799271A (en) * | 1996-06-24 | 1998-08-25 | Electronics And Telecommunications Research Institute | Method for reducing pitch search time for vocoder |
US6408268B1 (en) * | 1997-03-12 | 2002-06-18 | Mitsubishi Denki Kabushiki Kaisha | Voice encoder, voice decoder, voice encoder/decoder, voice encoding method, voice decoding method and voice encoding/decoding method |
US6389006B1 (en) * | 1997-05-06 | 2002-05-14 | Audiocodes Ltd. | Systems and methods for encoding and decoding speech for lossy transmission networks |
US20020159472A1 (en) * | 1997-05-06 | 2002-10-31 | Leon Bialik | Systems and methods for encoding & decoding speech for lossy transmission networks |
US7554969B2 (en) | 1997-05-06 | 2009-06-30 | Audiocodes, Ltd. | Systems and methods for encoding and decoding speech for lossy transmission networks |
US5924062A (en) * | 1997-07-01 | 1999-07-13 | Nokia Mobile Phones | ACLEP codec with modified autocorrelation matrix storage and search |
US6499008B2 (en) * | 1998-05-26 | 2002-12-24 | Koninklijke Philips Electronics N.V. | Transceiver for selecting a source coder based on signal distortion estimate |
CZ304212B6 (en) * | 2000-11-27 | 2014-01-08 | Nippon Telegraph And Telephone Corporation | Method, coder and decoder, and programs for coding and decoding speech parameters, and method, coder and decoder, and programs for coding and decoding speech |
KR100438175B1 (en) * | 2001-10-23 | 2004-07-01 | 엘지전자 주식회사 | Search method for codebook |
US20050010402A1 (en) * | 2003-07-10 | 2005-01-13 | Sung Ho Sang | Wide-band speech coder/decoder and method thereof |
US20050084111A1 (en) * | 2003-09-09 | 2005-04-21 | Peter Lablans | Ternary and higher multi-value digital scramblers/descramblers |
US7002490B2 (en) * | 2003-09-09 | 2006-02-21 | Ternarylogic Llc | Ternary and higher multi-value digital scramblers/descramblers |
US7696785B2 (en) | 2004-02-25 | 2010-04-13 | Ternarylogic Llc | Implementing logic functions with non-magnitude based physical phenomena |
US8374289B2 (en) | 2004-02-25 | 2013-02-12 | Ternarylogic Llc | Generation and detection of non-binary digital sequences |
US20050184888A1 (en) * | 2004-02-25 | 2005-08-25 | Peter Lablans | Generation and detection of non-binary digital sequences |
US7580472B2 (en) | 2004-02-25 | 2009-08-25 | Ternarylogic Llc | Generation and detection of non-binary digital sequences |
US20090128190A1 (en) * | 2004-02-25 | 2009-05-21 | Peter Lablans | Implementing Logic Functions with Non-Magnitude Based Physical Phenomena |
US7548092B2 (en) | 2004-02-25 | 2009-06-16 | Ternarylogic Llc | Implementing logic functions with non-magnitude based physical phenomena |
US20050194993A1 (en) * | 2004-02-25 | 2005-09-08 | Peter Lablans | Single and composite binary and multi-valued logic functions from gates and inverters |
US7218144B2 (en) | 2004-02-25 | 2007-05-15 | Ternarylogic Llc | Single and composite binary and multi-valued logic functions from gates and inverters |
US20060021003A1 (en) * | 2004-06-23 | 2006-01-26 | Janus Software, Inc | Biometric authentication system |
US20060031278A1 (en) * | 2004-08-07 | 2006-02-09 | Peter Lablans | Multi-value digital calculating circuits, including multipliers |
US7562106B2 (en) | 2004-08-07 | 2009-07-14 | Ternarylogic Llc | Multi-value digital calculating circuits, including multipliers |
US20100164548A1 (en) * | 2004-09-08 | 2010-07-01 | Ternarylogic Llc | Implementing Logic Functions With Non-Magnitude Based Physical Phenomena |
KR100633391B1 (en) | 2004-11-03 | 2006-10-16 | 한국전자통신연구원 | Codec transparent analog scrambler encryption / decoding device and method thereof |
US20100232540A1 (en) * | 2009-03-13 | 2010-09-16 | Huawei Technologies Co., Ltd. | Preprocessing method, preprocessing apparatus and coding device |
CN101609677B (en) * | 2009-03-13 | 2012-01-04 | 华为技术有限公司 | Preprocessing method, preprocessing device and preprocessing encoding equipment |
US8566085B2 (en) | 2009-03-13 | 2013-10-22 | Huawei Technologies Co., Ltd. | Preprocessing method, preprocessing apparatus and coding device |
US8831961B2 (en) | 2009-03-13 | 2014-09-09 | Huawei Technologies Co., Ltd. | Preprocessing method, preprocessing apparatus and coding device |
CN111724814A (en) * | 2020-06-22 | 2020-09-29 | 广东西欧克实业有限公司 | A one-button intelligent voice interactive microphone system and using method |
CN111724814B (en) * | 2020-06-22 | 2025-01-03 | 广东西欧克实业有限公司 | One-key intelligent voice interaction microphone system and use method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5457783A (en) | Adaptive speech coder having code excited linear prediction | |
US5717824A (en) | Adaptive speech coder having code excited linear predictor with multiple codebook searches | |
US5327520A (en) | Method of use of voice message coder/decoder | |
US5884253A (en) | Prototype waveform speech coding with interpolation of pitch, pitch-period waveforms, and synthesis filter | |
EP0470975B1 (en) | Methods and apparatus for reconstructing non-quantized adaptively transformed voice signals | |
EP0360265B1 (en) | Communication system capable of improving a speech quality by classifying speech signals | |
US6055496A (en) | Vector quantization in celp speech coder | |
US6006174A (en) | Multiple impulse excitation speech encoder and decoder | |
EP0523979A2 (en) | Low bit rate vocoder means and method | |
WO1980002211A1 (en) | Residual excited predictive speech coding system | |
US4945565A (en) | Low bit-rate pattern encoding and decoding with a reduced number of excitation pulses | |
US5027405A (en) | Communication system capable of improving a speech quality by a pair of pulse producing units | |
US5091946A (en) | Communication system capable of improving a speech quality by effectively calculating excitation multipulses | |
US5570453A (en) | Method for generating a spectral noise weighting filter for use in a speech coder | |
US5839098A (en) | Speech coder methods and systems | |
US5235670A (en) | Multiple impulse excitation speech encoder and decoder | |
US5692101A (en) | Speech coding method and apparatus using mean squared error modifier for selected speech coder parameters using VSELP techniques | |
US5673361A (en) | System and method for performing predictive scaling in computing LPC speech coding coefficients | |
WO1995006310A1 (en) | Adaptive speech coder having code excited linear prediction | |
EP0573215A2 (en) | Vocoder synchronization | |
JP2946528B2 (en) | Voice encoding / decoding method and apparatus | |
JP2615862B2 (en) | Voice encoding / decoding method and apparatus | |
Viswanathan et al. | A harmonic deviations linear prediction vocoder for improved narrowband speech transmission | |
JPH05127700A (en) | Method and device for speech encoding and decoding | |
Easton | Fast algorithms for low bit rate digital speech coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: PACIFIC COMMUNICATION SCIENCES, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST.;ASSIGNOR:CHHATWAL, HARPRIT S.;REEL/FRAME:006255/0773 Effective date: 19920903 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
CC | Certificate of correction | ||
AS | Assignment |
Owner name: BANK OF AMERICA NATIONAL TRUST & SAVINGS ASSOCIATI Free format text: SECURITY INTEREST;ASSIGNOR:PACIFIC COMMUNICATION SCIENCES, INC.;REEL/FRAME:007936/0861 Effective date: 19960430 |
|
AS | Assignment |
Owner name: PACIFIC COMMUNICATIONS SCIENCES, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST IN CERTAIN ASSETS (PATENTS);ASSIGNOR:BANK OF AMERICA NATIONAL TRUST AND SAVINGS ASSOCIATION, AS AGENT;REEL/FRAME:008587/0343 Effective date: 19961212 |
|
AS | Assignment |
Owner name: NUERA COMMUNICATIONS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PACIFIC COMMUNICATION SCIENCES, INC. (PCSI);REEL/FRAME:008811/0177 Effective date: 19971121 Owner name: NUERA COMMUNICATIONS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PACIFIC COMMUNICATION SCIENCES, INC. (PCSI);REEL/FRAME:008811/0079 Effective date: 19971119 |
|
AS | Assignment |
Owner name: NEUERA COMMUNICATIONS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PACIFIC COMMUNICATION SCIENCES, INC (PCSI);REEL/FRAME:008848/0188 Effective date: 19971211 |
|
AS | Assignment |
Owner name: NUERA OPERATING COMPANY, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NUERA COMMUNICATIONS, INC.;REEL/FRAME:008861/0280 Effective date: 19971219 |
|
AS | Assignment |
Owner name: NUERA COMMUNICATIONS, INC., A CORP. OF DE, CALIFOR Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PACIFIC COMMUNICATIONS SCIENCES, INC., A DELAWARE CORPORATION;REEL/FRAME:008886/0535 Effective date: 19960101 |
|
AS | Assignment |
Owner name: CREDIT SUISSE FIRST BOSTON, NEW YORK Free format text: SECURITY INTEREST;ASSIGNORS:CONEXANT SYSTEMS, INC.;BROOKTREE CORPORATION;BROOKTREE WORLDWIDE SALES CORPORATION;AND OTHERS;REEL/FRAME:009719/0537 Effective date: 19981221 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
AS | Assignment |
Owner name: NVERA HOLDINGS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NVERA OPETATING COMPANY, INC.;REEL/FRAME:011122/0720 Effective date: 19971219 Owner name: NUERA COMMUNICATIONS, INC., A CORPORATION OF DELAW Free format text: CHANGE OF NAME;ASSIGNOR:NUERA HOLDINGS, INC., A CORPORATION OF DELAWARE;REEL/FRAME:011137/0042 Effective date: 19980319 |
|
AS | Assignment |
Owner name: CONEXANT SYSTEMS, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: BROOKTREE CORPORATION, CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: BROOKTREE WORLDWIDE SALES CORPORATION, CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 Owner name: CONEXANT SYSTEMS WORLDWIDE, INC., CALIFORNIA Free format text: RELEASE OF SECURITY INTEREST;ASSIGNOR:CREDIT SUISSE FIRST BOSTON;REEL/FRAME:012252/0413 Effective date: 20011018 |
|
AS | Assignment |
Owner name: SILICON VALLEY BANK, CALIFORNIA Free format text: SECURITY AGREEMENT;ASSIGNOR:NUERA COMMUNICATIONS, INC.;REEL/FRAME:013045/0219 Effective date: 20020630 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAT HOLDER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO SMALL (ORIGINAL EVENT CODE: LTOS); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
AS | Assignment |
Owner name: MINDSPEED TECHNOLOGIES, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:CONEXANT SYSTEMS, INC.;REEL/FRAME:014468/0137 Effective date: 20030627 |
|
AS | Assignment |
Owner name: CONEXANT SYSTEMS, INC., CALIFORNIA Free format text: SECURITY AGREEMENT;ASSIGNOR:MINDSPEED TECHNOLOGIES, INC.;REEL/FRAME:014546/0305 Effective date: 20030930 |
|
AS | Assignment |
Owner name: NUERA COMMUNICATIONS INC., CALIFORNIA Free format text: RELEASE;ASSIGNOR:SILICON VALLEY BANK;REEL/FRAME:016164/0486 Effective date: 20050105 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 12 |
|
AS | Assignment |
Owner name: AUDIOCODES INC., NEW JERSEY Free format text: MERGER;ASSIGNOR:AUDIOCODES SAN DIEGO INC.;REEL/FRAME:021763/0963 Effective date: 20071212 Owner name: AUDIOCODES SAN DIEGO INC., CALIFORNIA Free format text: CHANGE OF NAME;ASSIGNOR:NUERA COMMUNICATIONS INC.;REEL/FRAME:021763/0968 Effective date: 20070228 |
|
AS | Assignment |
Owner name: CIRRUS LOGIC INC., TEXAS Free format text: MERGER;ASSIGNOR:PACIFIC COMMUNICATION SCIENCES INC.;REEL/FRAME:045630/0333 Effective date: 20150929 |