GitHub - golden1232004/SPTK

Branches Tags
Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
README		README
README.jp		README.jp
Repository files navigation

            Speech Signal Processing Toolkit (SPTK)
             version 3.0 release December 25, 2002

The Speech Signal Processing Toolkit (SPTK) was developed and
has been used in the research group of Prof. Satoshi Imai
(currently he is with Chiba Institute of Technology) and
Prof. Takao Kobayashi (currently he is with Interdisciplinary
Graduate School of Science and Engineering, Tokyo Institute of
Technology) at P&I laboratory, Tokyo Institute of Technology.
This time some tools are chosen and arranged for distribution by
Associate Prof. Keiichi Tokuda (currently he is with Department
of Computer Science, Nagoya Institute of Technology) as an
organizer in cooperation with Takashi Masuko (currently he is a
Research Associate at Department of Interdisciplinary Graduate
School of Science and Engineering, Tokyo Institute of
Technology), Dr. Kazuhito Koishida (currently he is with
Microsoft Research), and some graduate students.

The original source codes have been written by many people who
took part in research of the research group.  The most original
source codes of this distribution were written by Takao
Kobayashi (graph, data processing, FFT, sampling rate
conversion, etc.), Keiichi Tokuda (speech analysis, speech
synthesis, etc.), and Kazuhito Koishida (LSP, vector
quantization, etc.).

This version is accompanied by a Reference Manual (a total of
180 pages written in Japanese). A User's Manual "Examples for
using SPTK" is also attached.  We are planing to translate the
Reference Manual into English.

****************************************************************
                            Copying
****************************************************************

The Speech Signal Processing Toolkit (SPTK) version 3.0 is
released under a free license, without the commercial
restrictions we imposed on early versions (1.0 and 2.0).  The
toolkit has been placed under an X11 type license which as free
licenses go is pretty free.  We have deliberately chosen a
license that should be compatible with our commercial partners
and our free software users.  Note that there is no difference
between version 2.0 and 3.0 except the license.

However, although the code is free, we still offer no warranties
and no maintenance.  We will continue to endeavor to fix bugs
and answer queries when can, but are not in a position to
guarantee it.  We will consider maintenance contracts and
consultancy if desired, please contacts us for details.

If you are using SPTK in commercial environment, even though no
license is required, we would be grateful if you let us know as
it helps justify ourselves to our various sponsors.

The current copyright on the toolkit is

----------------------------------------------------------------
      Speech Signal Processing Toolkit (SPTK): version 3.0
                       SPTK Working Group

                 Department of Computer Science
                 Nagoya Institute of Technology
                              and
  Interdisciplinary Graduate School of Science and Engineering
                 Tokyo Institute of Technology
                    Copyright (c) 1984-2000
                      All Rights Reserved.

Permission is hereby granted, free of charge, to use and
distribute this software and its documentation without
restriction, including without limitation the rights to use,
copy, modify, merge, publish, distribute, sublicense, and/or
sell copies of this work, and to permit persons to whom this
work is furnished to do so, subject to the following conditions:

  1. The code must retain the above copyright notice, this list
     of conditions and the following disclaimer.

  2. Any modifications must be clearly marked as such.
                                                                       
NAGOYA INSTITUTE OF TECHNOLOGY, TOKYO INSITITUTE OF TECHNOLOGY,
SPTK WORKING GROUP, AND THE CONTRIBUTORS TO THIS WORK DISCLAIM
ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT
SHALL NAGOYA INSTITUTE OF TECHNOLOGY, TOKYO INSITITUTE OF
TECHNOLOGY, SPTK WORKING GROUP, NOR THE CONTRIBUTORS BE LIABLE
FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY
DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTUOUS
ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR
PERFORMANCE OF THIS SOFTWARE.
----------------------------------------------------------------

****************************************************************
                          Environment
****************************************************************

We expect that all programs can be compiled and can work on most
UNIX workstations.  Although we have confirmed that all programs
can be compiled in the following environments:

  SunOS, Solaris, HP-UX, FreeBSD, OSF1, Linux

we have not confirmed that all programs can work properly in all
these environments.

Note that some commands need C-shell (/bin/csh) since they are
implemented by C-shell scripts.

****************************************************************
                          Installation
****************************************************************

0) Enter 'src' directory.

1) Read the comments in "Makefile" and make any necessary changes.

2) To build and install all tools, type:

        % make
        % make install

The X-window library is not required for compilation of all tools
except a command 'xgr' since only 'xgr' uses the X-window library.

****************************************************************
                             Notice
****************************************************************

As the version advances, specifications for the Speech Signal
Processing Toolkit will be changed without notifications.

The distributed tools are a sub-set of the whole toolkit.
Therefore, some commands which appear at EXAMPLE and SEE ALSO in
the Reference Manual may not be included in this version.

****************************************************************
                           Bug report
****************************************************************

Bug reports, comments, questions for the Speech Signal
Processing Toolkit are very welcome.  Please send them to

   dsp-cmnd@ip.titech.ac.jp

by email. We will reply to any questions as much as possible but
we cannot assure that we will reply to all messages.

Please see

   http://kt-lab.ics.nitech.ac.jp/~tokuda/SPTK/

for more information about the Speech Signal Processing Toolkit,
(e.g., Examples for using Speech Signal Processing Toolkit).

****************************************************************
                             Notes
****************************************************************

Generic properties of each command are summarized as follows:

 i) Data has float-type format, i.e., single-precision floating
    point format.  This can be changed to double-type format by
    specifying a compile option. It is noted that in the current
    version the "float" specifications in the help messages do
    not change by the compile option.

 ii) Data files do not have headers or any structures, i.e.,
     they are flat row files.

iii) Basically they manipulate data through standard input and
     standard output.

 iv) To display (error) messages they use standard error output
     rather than standard output.

  v) They do not request interactive key inputs.

 vi) Options are specified on the command line.

****************************************************************
                        Acknowledgments
****************************************************************

The following people have contributed to the development of SPTK
in various ways.  It is their work that makes it all possible.
In no special order:

Takao Kobayashi
Keiichi Tokuda
Takashi Masuko
Chiyomi Miyajima
Masatsune Tamura
Takayoshi Yoshimura
Shinji Sako
Yoshihiko Nankaku
Fernando Gil Resende Junior
Toshihiko Kato
Gou Hirabayashi
Naohiro Isshiki
Noboru Miyazaki
Toshio Kanno
Kenji Chiba
Toshiaki Fukada
Satoshi Imai
Tadashi Kitamura

and others.

****************************************************************
                           Who we are
****************************************************************

SPTK working group is a voluntary group for developing the
Speech Signal Processing Toolkit.  Current SPTK working group
members are

  Keiichi Tokuda (Organizer) tokuda@ics.nitech.ac.jp
  Takashi Masuko             masuko@ip.titech.ac.jp
  Shinji Sako                sako@ics.nitech.ac.jp
  Yoshihiko Nankaku          nankaku@ics.nitech.ac.jp
  Jun-ichi Yamagishi         Junichi.Yamagishi@ip.titech.ac.jp

and the members are dynamically changing.  The current formal
contact address of SPTK working group is
dsp-cmnd@ip.titech.ac.jp.