如何从TextGrid文件中提取特定数据?

2024-09-19 20:31:35 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个TextGrid文件,它是从一个强制对齐器webMAUS输出的,我主要想了解如何使用python脚本从中提取特定的数据并输入到csv文件中,但我不确定如何进行。我需要能够运行大量的数据这一点

我想提取句子的持续时间,它在第一层,复合词“试用价”的持续时间,以及复合词中音素的持续时间,“试用价”在最后一层

多谢各位

File type = "ooTextFile" Object class = "TextGrid" xmin = 0 xmax = 4.360703 tiers? <exists> size = 3 item []: item [1]: class = "IntervalTier" name = "ORT-MAU" xmin = 0 xmax = 4.360703 intervals: size = 9 intervals [1]: xmin = 0.000000 xmax = 1.380000 text = "" intervals [2]: xmin = 1.380000 xmax = 1.570000 text = "She" intervals [3]: xmin = 1.570000 xmax = 1.800000 text = "told" intervals [4]: xmin = 1.800000 xmax = 1.920000 text = "me" intervals [5]: xmin = 1.920000 xmax = 2.150000 text = "about" **intervals [6]: xmin = 2.150000 xmax = 2.230000 text = "the" intervals [7]: xmin = 2.230000 xmax = 2.700000 text = "trial" intervals [8]: xmin = 2.700000 xmax = 3.010000 text = "offer"** intervals [9]: xmin = 3.010000 xmax = 4.360703 text = "" item [2]: class = "IntervalTier" name = "KAN-MAU" xmin = 0 xmax = 4.360703 intervals: size = 9 intervals [1]: xmin = 0.000000 xmax = 1.380000 text = "" intervals [2]: xmin = 1.380000 xmax = 1.570000 text = "S i:" intervals [3]: xmin = 1.570000 xmax = 1.800000 text = "t @U l d" intervals [4]: xmin = 1.800000 xmax = 1.920000 text = "m i:" intervals [5]: xmin = 1.920000 xmax = 2.150000 text = "@ b aU t" intervals [6]: xmin = 2.150000 xmax = 2.230000 text = "D @" intervals [7]: xmin = 2.230000 xmax = 2.700000 text = "t r aI @ l" intervals [8]: xmin = 2.700000 xmax = 3.010000 text = "Q f @" intervals [9]: xmin = 3.010000 xmax = 4.360703 text = "" item [3]: class = "IntervalTier" name = "MAU" xmin = 0 xmax = 4.360703 intervals: size = 23 intervals [1]: xmin = 0.000000 xmax = 1.380000 text = "<p:>" intervals [2]: xmin = 1.380000 xmax = 1.490000 text = "S" intervals [3]: xmin = 1.490000 xmax = 1.570000 text = "I" intervals [4]: xmin = 1.570000 xmax = 1.700000 text = "t" intervals [5]: xmin = 1.700000 xmax = 1.740000 text = "@U" intervals [6]: xmin = 1.740000 xmax = 1.800000 text = "l" intervals [7]: xmin = 1.800000 xmax = 1.860000 text = "m" intervals [8]: xmin = 1.860000 xmax = 1.920000 text = "I" intervals [9]: xmin = 1.920000 xmax = 1.960000 text = "@" intervals [10]: xmin = 1.960000 xmax = 2.010000 text = "b" intervals [11]: xmin = 2.010000 xmax = 2.120000 text = "aU" intervals [12]: xmin = 2.120000 xmax = 2.150000 text = "t" intervals [13]: xmin = 2.150000 xmax = 2.180000 text = "D" intervals [14]: xmin = 2.180000 xmax = 2.230000 text = "@" intervals [15]: xmin = 2.230000 xmax = 2.370000 text = "t" intervals [16]: xmin = 2.370000 xmax = 2.430000 text = "r" intervals [17]: xmin = 2.430000 xmax = 2.580000 text = "aI" intervals [18]: xmin = 2.580000 xmax = 2.610000 text = "@" intervals [19]: xmin = 2.610000 xmax = 2.700000 text = "l" intervals [20]: xmin = 2.700000 xmax = 2.820000 text = "Q" intervals [21]: xmin = 2.820000 xmax = 2.920000 text = "f" intervals [22]: xmin = 2.920000 xmax = 3.010000 text = "@" intervals [23]: xmin = 3.010000 xmax = 4.360703 text = "<p:>"

Tags: 文件数据textnamesizeitemclass持续时间