首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >将文本文件转换为json格式或筛选出键和值对。

将文本文件转换为json格式或筛选出键和值对。
EN

Stack Overflow用户
提问于 2017-02-21 03:49:57
回答 1查看 778关注 0票数 1

我试图执行一个任务,该任务可以从txt文件中找到键和值。我脑子里有些想法。1.使用regex 2.将txt转换为json。

我很难过滤和转换格式。有什么好的方法来完成这个任务,或者任何可用的api都可以执行这样的任务吗?

txt文件:

代码语言:javascript
复制
System:    Host: ict-vm Kernel: 4.4.0-53-generic i686 (32 bit)
           Desktop: MATE 1.16.1  Distro: Linux Mint 18.1 Serena
Machine:   System: ASUSTeK (portable) product: N43SN v: 1.0
           Mobo: ASUSTeK model: N43SN v: 1.0 serial: NB-1234567890
           Bios: American Megatrends v: N43SN.412 date: 09/21/2011
CPU:       Quad core Intel Core i7-2670QM (-HT-MCP-) cache: 6144 KB 
           clock speeds: max: 3100 MHz 1: 807 MHz 2: 814 MHz 3: 800 MHz
           4: 1078 MHz 5: 990 MHz 6: 811 MHz 7: 801 MHz 8: 844 MHz
Graphics:  Card-1: Intel 2nd Generation Core Processor Family Integrated Graphics Controller
           Card-2: NVIDIA GF108M [GeForce GT 550M]
           Display Server: X.org 1.18.4 drivers: intel (unloaded: fbdev,vesa) FAILED: nouveau
           tty size: 80x24 Advanced Data: N/A for root
Audio:     Card Intel 6 Series/C200 Series Family High Definition Audio Controller
           driver: snd_hda_intel
           Sound: Advanced Linux Sound Architecture v: k4.4.0-53-generic
Network:   Card-1: Qualcomm Atheros AR9285 Wireless Network Adapter (PCI-Express)
           driver: ath9k
           IF: wlp3s0 state: up mac: 7x:2t:61:d4:72:8a
           Card-2: Qualcomm Atheros AR8151 v2.0 Gigabit Ethernet driver: atl1c
           IF: enp4s0 state: down mac: 14:da:e1:ay:72:b5
Drives:    HDD Total Size: 507.9GB (0.4% used)
           ID-1: /dev/sda model: ST95005620AS size: 500.1GB
           ID-2: USB /dev/sdb model: DataTraveler_3.0 size: 7.8GB
RAID:      No RAID devices: /proc/mdstat, md_mod kernel module present
Sensors:   System Temperatures: cpu: 48.0C mobo: N/A gpu: 43.0
           Fan Speeds (in rpm): cpu: N/A
Info:      Processes: 223 Uptime: 40 min Memory: 531.1/3948.3MB
           Client: Shell (sudo) inxi: 2.2.35System:    Host: ict-vm Kernel: 4.4.0-53-generic i686 (32 bit)
           Desktop: MATE 1.16.1  Distro: Linux Mint 18.1 Serena
Machine:   System: ASUSTeK (portable) product: N43SN v: 1.0
           Mobo: ASUSTeK model: N43SN v: 1.0 serial: NB-1234567890
           Bios: American Megatrends v: N43SN.412 date: 09/21/2011
CPU:       Quad core Intel Core i7-2670QM (-HT-MCP-) cache: 6144 KB 
           clock speeds: max: 3100 MHz 1: 807 MHz 2: 814 MHz 3: 800 MHz
           4: 1078 MHz 5: 990 MHz 6: 811 MHz 7: 801 MHz 8: 844 MHz
Graphics:  Card-1: Intel 2nd Generation Core Processor Family Integrated Graphics Controller
           Card-2: NVIDIA GF108M [GeForce GT 550M]
           Display Server: X.org 1.18.4 drivers: intel (unloaded: fbdev,vesa) FAILED: nouveau
           tty size: 80x24 Advanced Data: N/A for root
Audio:     Card Intel 6 Series/C200 Series Family High Definition Audio Controller
           driver: snd_hda_intel
           Sound: Advanced Linux Sound Architecture v: k4.4.0-53-generic
Network:   Card-1: Qualcomm Atheros AR9285 Wireless Network Adapter (PCI-Express)
           driver: ath9k
           IF: wlp3s0 state: up mac: 7x:2t:61:d4:72:8a
           Card-2: Qualcomm Atheros AR8151 v2.0 Gigabit Ethernet driver: atl1c
           IF: enp4s0 state: down mac: 14:da:e1:ay:72:b5
Drives:    HDD Total Size: 507.9GB (0.4% used)
           ID-1: /dev/sda model: ST95005620AS size: 500.1GB
           ID-2: USB /dev/sdb model: DataTraveler_3.0 size: 7.8GB
RAID:      No RAID devices: /proc/mdstat, md_mod kernel module present
Sensors:   System Temperatures: cpu: 48.0C mobo: N/A gpu: 43.0
           Fan Speeds (in rpm): cpu: N/A
Info:      Processes: 223 Uptime: 40 min Memory: 531.1/3948.3MB
           Client: Shell (sudo) inxi: 2.2.35

json文件:

代码语言:javascript
复制
{
   System: {
      Host:'ict-vm',
      Kernel:'4.4.0-53-generic i686 (32 bit)',
       .....
      },
   Machine:{
      System: 'ASUSTeK (portable)'
    }
}

预期结果:我可以使用键获得值。

代码语言:javascript
复制
print(node['System'])

输出:

代码语言:javascript
复制
ASUSTeK (portable)

提前感谢

EN

回答 1

Stack Overflow用户

回答已采纳

发布于 2017-02-21 04:04:04

如果文件格式是固定的,并且没有嵌套的dicts,那么可以很容易地使用

代码语言:javascript
复制
import re
x="""System:       Host: ict-vm Kernel: 4.4.0-53-generic i686 (32 bit) Desktop: MATE 1.16.1  Distro: Linux Mint 18.1 Serena
Machine:      System: ASUSTeK (portable) product: N43SN v: 1.0 Mobo: ASUSTeK model: N43SN v: 1.0 serial: NB-1234567890 Bios: American Megatrends v: N43SN.412 date: 09/21/2011"""
y = re.findall(r"(\S+):\s*(.*?)(?=\s*\S+:|$)",x)
d={}
for i,j in y:
    if not j:
        d[i]={}
        k=d[i]
    else:
        k[i]=j
print d

输出:

{'Machine': {'product': 'N43SN', 'Mobo': 'ASUSTeK', 'System': 'ASUSTeK (portable)', 'Bios': 'American Megatrends', 'v': 'N43SN.412', 'date': '09/21/2011', 'model': 'N43SN', 'serial': 'NB-1234567890'}, 'System': {'Kernel': '4.4.0-53-generic i686 (32 bit)', 'Host': 'ict-vm', 'Distro': 'Linux Mint 18.1 Serena', 'Desktop': 'MATE 1.16.1'}}

您可以使用d['Machine']['System']访问

输出:ASUSTeK (portable)

编辑:

对于新的输入文件,我们需要调整regex.So的使用

代码语言:javascript
复制
\s*([^:\n]+):\s*((?:(?!: |\n)[\s\S])*)(?=\s+[^:\n]+:|$)

见演示。

https://regex101.com/r/EcIcPJ/1

票数 3
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/42358310

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档