Overview

Dataset statistics

Number of variables10
Number of observations6294
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory491.8 KiB
Average record size in memory80.0 B

Variable types

Text2
Categorical6
DateTime2

Dataset

Description한국가스안전공사의 긴급신고 전화 등을 통해 접수된 가스관련 사고의 접수 현황(사고일자, 진행상태, 주소, 가스종류) 자료로, 통계성자료로 활용가능하며, 국민들의 알권리를 보장하기 위해 제공하는 데이터입니다.
Author한국가스안전공사
URLhttps://www.data.go.kr/data/15001507/fileData.do

Alerts

접수지사 is highly imbalanced (86.5%)Imbalance
진행상태 is highly imbalanced (57.3%)Imbalance
사고가스 is highly imbalanced (52.7%)Imbalance
사고번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:01:00.436743
Analysis finished2023-12-12 18:01:02.130056
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사고번호
Text

UNIQUE 

Distinct6294
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
2023-12-13T03:01:02.409056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters69234
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6294 ?
Unique (%)100.0%

Sample

1st rowN-2021-0001
2nd rowN-2021-0002
3rd rowN-2021-0003
4th rowN-2021-0004
5th rowN-2021-0005
ValueCountFrequency (%)
n-2021-0001 1
 
< 0.1%
n-2022-1567 1
 
< 0.1%
n-2022-1565 1
 
< 0.1%
n-2022-1564 1
 
< 0.1%
n-2022-1563 1
 
< 0.1%
n-2022-1562 1
 
< 0.1%
n-2022-1561 1
 
< 0.1%
n-2022-1560 1
 
< 0.1%
n-2022-1559 1
 
< 0.1%
n-2022-1558 1
 
< 0.1%
Other values (6284) 6284
99.8%
2023-12-13T03:01:02.930952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 17492
25.3%
- 12588
18.2%
0 11342
16.4%
1 7189
10.4%
N 6294
 
9.1%
3 3480
 
5.0%
4 1954
 
2.8%
5 1869
 
2.7%
6 1794
 
2.6%
7 1747
 
2.5%
Other values (2) 3485
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50352
72.7%
Dash Punctuation 12588
 
18.2%
Uppercase Letter 6294
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 17492
34.7%
0 11342
22.5%
1 7189
14.3%
3 3480
 
6.9%
4 1954
 
3.9%
5 1869
 
3.7%
6 1794
 
3.6%
7 1747
 
3.5%
9 1744
 
3.5%
8 1741
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 12588
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 6294
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 62940
90.9%
Latin 6294
 
9.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 17492
27.8%
- 12588
20.0%
0 11342
18.0%
1 7189
11.4%
3 3480
 
5.5%
4 1954
 
3.1%
5 1869
 
3.0%
6 1794
 
2.9%
7 1747
 
2.8%
9 1744
 
2.8%
Latin
ValueCountFrequency (%)
N 6294
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 69234
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 17492
25.3%
- 12588
18.2%
0 11342
16.4%
1 7189
10.4%
N 6294
 
9.1%
3 3480
 
5.0%
4 1954
 
2.8%
5 1869
 
2.7%
6 1794
 
2.6%
7 1747
 
2.5%
Other values (2) 3485
 
5.0%

접수지사
Categorical

IMBALANCE 

Distinct30
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
본사
5862 
전남서부지사
 
43
대전광역본부
 
42
대구광역본부
 
37
부산광역본부
 
31
Other values (25)
 
279

Length

Max length6
Median length2
Mean length2.246584
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row본사
2nd row본사
3rd row본사
4th row본사
5th row본사

Common Values

ValueCountFrequency (%)
본사 5862
93.1%
전남서부지사 43
 
0.7%
대전광역본부 42
 
0.7%
대구광역본부 37
 
0.6%
부산광역본부 31
 
0.5%
광주광역본부 30
 
0.5%
전북본부 29
 
0.5%
충북본부 21
 
0.3%
충남본부 20
 
0.3%
강원광역본부 20
 
0.3%
Other values (20) 159
 
2.5%

Length

2023-12-13T03:01:03.123731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본사 5862
93.1%
전남서부지사 43
 
0.7%
대전광역본부 42
 
0.7%
대구광역본부 37
 
0.6%
부산광역본부 31
 
0.5%
광주광역본부 30
 
0.5%
전북본부 29
 
0.5%
충북본부 21
 
0.3%
충남본부 20
 
0.3%
강원광역본부 20
 
0.3%
Other values (20) 159
 
2.5%
Distinct972
Distinct (%)15.4%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
Minimum2021-01-01 00:00:00
Maximum2023-08-31 00:00:00
2023-12-13T03:01:03.282312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:01:03.435580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

접수요일
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
월요일
1007 
목요일
969 
화요일
949 
금요일
901 
토요일
896 
Other values (2)
1572 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금요일
2nd row금요일
3rd row금요일
4th row금요일
5th row금요일

Common Values

ValueCountFrequency (%)
월요일 1007
16.0%
목요일 969
15.4%
화요일 949
15.1%
금요일 901
14.3%
토요일 896
14.2%
수요일 892
14.2%
일요일 680
10.8%

Length

2023-12-13T03:01:03.574975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:01:03.706582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월요일 1007
16.0%
목요일 969
15.4%
화요일 949
15.1%
금요일 901
14.3%
토요일 896
14.2%
수요일 892
14.2%
일요일 680
10.8%

처리지사
Categorical

Distinct30
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
대전광역본부
 
416
경기동부지사
 
398
서울광역본부
 
336
서울서부지사
 
310
대구광역본부
 
308
Other values (25)
4526 

Length

Max length6
Median length6
Mean length5.5449635
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전남서부지사
2nd row경남서부지사
3rd row경기동부지사
4th row서울남부지사
5th row부산북부지사

Common Values

ValueCountFrequency (%)
대전광역본부 416
 
6.6%
경기동부지사 398
 
6.3%
서울광역본부 336
 
5.3%
서울서부지사 310
 
4.9%
대구광역본부 308
 
4.9%
경기광역본부 307
 
4.9%
부산광역본부 299
 
4.8%
광주광역본부 285
 
4.5%
서울남부지사 267
 
4.2%
서울동부지사 267
 
4.2%
Other values (20) 3101
49.3%

Length

2023-12-13T03:01:03.879533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대전광역본부 416
 
6.6%
경기동부지사 398
 
6.3%
서울광역본부 336
 
5.3%
서울서부지사 310
 
4.9%
대구광역본부 308
 
4.9%
경기광역본부 307
 
4.9%
부산광역본부 299
 
4.8%
광주광역본부 285
 
4.5%
서울남부지사 267
 
4.2%
서울동부지사 267
 
4.2%
Other values (20) 3101
49.3%

진행상태
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
결과등록
5098 
완료
926 
보고서
 
265
접수
 
5

Length

Max length4
Median length4
Mean length3.6620591
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row완료
2nd row결과등록
3rd row결과등록
4th row결과등록
5th row결과등록

Common Values

ValueCountFrequency (%)
결과등록 5098
81.0%
완료 926
 
14.7%
보고서 265
 
4.2%
접수 5
 
0.1%

Length

2023-12-13T03:01:04.093705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:01:04.319482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
결과등록 5098
81.0%
완료 926
 
14.7%
보고서 265
 
4.2%
접수 5
 
0.1%
Distinct973
Distinct (%)15.5%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
Minimum2020-09-05 00:00:00
Maximum2023-08-31 00:00:00
2023-12-13T03:01:04.468429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:01:04.628733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시고자소속
Categorical

Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
사용자
3207 
소방서
1516 
경찰서
516 
행정관청
365 
지역주민
 
291
Other values (5)
399 

Length

Max length5
Median length3
Mean length3.0672069
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사용자
2nd row사용자
3rd row기타
4th row사용자
5th row소방서

Common Values

ValueCountFrequency (%)
사용자 3207
51.0%
소방서 1516
24.1%
경찰서 516
 
8.2%
행정관청 365
 
5.8%
지역주민 291
 
4.6%
기타 168
 
2.7%
인터넷 109
 
1.7%
행인 69
 
1.1%
공급자 51
 
0.8%
국민안전처 2
 
< 0.1%

Length

2023-12-13T03:01:04.817404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:01:04.990251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사용자 3207
51.0%
소방서 1516
24.1%
경찰서 516
 
8.2%
행정관청 365
 
5.8%
지역주민 291
 
4.6%
기타 168
 
2.7%
인터넷 109
 
1.7%
행인 69
 
1.1%
공급자 51
 
0.8%
국민안전처 2
 
< 0.1%

주소
Text

Distinct3199
Distinct (%)50.8%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
2023-12-13T03:01:05.425477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length11.161265
Min length2

Characters and Unicode

Total characters70249
Distinct characters367
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2041 ?
Unique (%)32.4%

Sample

1st row전남 완도군 완도읍 가용리
2nd row경남 함양군 병곡면 도천리
3rd row경기 광주시 쌍령동
4th row서울 영등포구 대림동
5th row부산 연제구 연산동
ValueCountFrequency (%)
서울 1013
 
5.1%
경기 946
 
4.8%
부산 370
 
1.9%
전남 283
 
1.4%
강원 281
 
1.4%
경남 266
 
1.3%
충남 261
 
1.3%
대전 225
 
1.1%
경북 225
 
1.1%
충북 225
 
1.1%
Other values (2780) 15793
79.4%
2023-12-13T03:01:06.104610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13630
 
19.4%
4947
 
7.0%
3955
 
5.6%
3455
 
4.9%
2014
 
2.9%
1864
 
2.7%
1848
 
2.6%
1574
 
2.2%
1380
 
2.0%
1310
 
1.9%
Other values (357) 34272
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56302
80.1%
Space Separator 13630
 
19.4%
Decimal Number 312
 
0.4%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4947
 
8.8%
3955
 
7.0%
3455
 
6.1%
2014
 
3.6%
1864
 
3.3%
1848
 
3.3%
1574
 
2.8%
1380
 
2.5%
1310
 
2.3%
1278
 
2.3%
Other values (343) 32677
58.0%
Decimal Number
ValueCountFrequency (%)
1 96
30.8%
2 88
28.2%
3 54
17.3%
4 24
 
7.7%
7 15
 
4.8%
5 11
 
3.5%
6 8
 
2.6%
8 6
 
1.9%
0 5
 
1.6%
9 5
 
1.6%
Space Separator
ValueCountFrequency (%)
13630
100.0%
Other Punctuation
ValueCountFrequency (%)
* 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56302
80.1%
Common 13946
 
19.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4947
 
8.8%
3955
 
7.0%
3455
 
6.1%
2014
 
3.6%
1864
 
3.3%
1848
 
3.3%
1574
 
2.8%
1380
 
2.5%
1310
 
2.3%
1278
 
2.3%
Other values (343) 32677
58.0%
Common
ValueCountFrequency (%)
13630
97.7%
1 96
 
0.7%
2 88
 
0.6%
3 54
 
0.4%
4 24
 
0.2%
7 15
 
0.1%
5 11
 
0.1%
6 8
 
0.1%
8 6
 
< 0.1%
0 5
 
< 0.1%
Other values (3) 9
 
0.1%
Latin
ValueCountFrequency (%)
e 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56302
80.1%
ASCII 13947
 
19.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13630
97.7%
1 96
 
0.7%
2 88
 
0.6%
3 54
 
0.4%
4 24
 
0.2%
7 15
 
0.1%
5 11
 
0.1%
6 8
 
0.1%
8 6
 
< 0.1%
0 5
 
< 0.1%
Other values (4) 10
 
0.1%
Hangul
ValueCountFrequency (%)
4947
 
8.8%
3955
 
7.0%
3455
 
6.1%
2014
 
3.6%
1864
 
3.3%
1848
 
3.3%
1574
 
2.8%
1380
 
2.5%
1310
 
2.3%
1278
 
2.3%
Other values (343) 32677
58.0%

사고가스
Categorical

IMBALANCE 

Distinct21
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size49.3 KiB
LNG
3129 
프로판
1231 
가스없음
1169 
기타
355 
부탄
 
196
Other values (16)
 
214

Length

Max length7
Median length3
Mean length3.1169368
Min length2

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st row프로판
2nd row프로판
3rd row프로판
4th rowLNG
5th row가스없음

Common Values

ValueCountFrequency (%)
LNG 3129
49.7%
프로판 1231
 
19.6%
가스없음 1169
 
18.6%
기타 355
 
5.6%
부탄 196
 
3.1%
산소 46
 
0.7%
프로판+부탄 37
 
0.6%
암모니아 33
 
0.5%
질소 27
 
0.4%
수소 19
 
0.3%
Other values (11) 52
 
0.8%

Length

2023-12-13T03:01:06.274157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
lng 3129
49.7%
프로판 1231
 
19.6%
가스없음 1169
 
18.6%
기타 355
 
5.6%
부탄 196
 
3.1%
산소 46
 
0.7%
프로판+부탄 37
 
0.6%
암모니아 33
 
0.5%
질소 27
 
0.4%
수소 19
 
0.3%
Other values (11) 52
 
0.8%

Correlations

2023-12-13T03:01:06.377276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수지사접수요일처리지사진행상태시고자소속사고가스
접수지사1.0000.1400.8320.0520.2800.223
접수요일0.1401.0000.1270.0840.0740.097
처리지사0.8320.1271.0000.1310.3690.428
진행상태0.0520.0840.1311.0000.2540.482
시고자소속0.2800.0740.3690.2541.0000.493
사고가스0.2230.0970.4280.4820.4931.000
2023-12-13T03:01:06.498808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리지사진행상태접수요일시고자소속사고가스접수지사
처리지사1.0000.0680.0530.1260.1200.264
진행상태0.0681.0000.0570.1540.2920.027
접수요일0.0530.0571.0000.0370.0420.059
시고자소속0.1260.1540.0371.0000.1770.092
사고가스0.1200.2920.0420.1771.0000.058
접수지사0.2640.0270.0590.0920.0581.000
2023-12-13T03:01:06.634423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수지사접수요일처리지사진행상태시고자소속사고가스
접수지사1.0000.0590.2640.0270.0920.058
접수요일0.0591.0000.0530.0570.0370.042
처리지사0.2640.0531.0000.0680.1260.120
진행상태0.0270.0570.0681.0000.1540.292
시고자소속0.0920.0370.1260.1541.0000.177
사고가스0.0580.0420.1200.2920.1771.000

Missing values

2023-12-13T03:01:01.856979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:01:02.052090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사고번호접수지사접수일자접수요일처리지사진행상태사고일자시고자소속주소사고가스
0N-2021-0001본사2021-01-01금요일전남서부지사완료2020-12-31사용자전남 완도군 완도읍 가용리프로판
1N-2021-0002본사2021-01-01금요일경남서부지사결과등록2021-01-01사용자경남 함양군 병곡면 도천리프로판
2N-2021-0003본사2021-01-01금요일경기동부지사결과등록2021-01-01기타경기 광주시 쌍령동프로판
3N-2021-0004본사2021-01-01금요일서울남부지사결과등록2021-01-01사용자서울 영등포구 대림동LNG
4N-2021-0005본사2021-01-01금요일부산북부지사결과등록2021-01-01소방서부산 연제구 연산동가스없음
5N-2021-0006본사2021-01-02토요일대구광역본부결과등록2021-01-02사용자경북 김천시 평화동프로판
6N-2021-0007본사2021-01-02토요일울산본부결과등록2021-01-02사용자울산 울주군 상북면 궁근정리기타
7N-2021-0008본사2021-01-02토요일경기북부지사결과등록2021-01-02사용자경기 남양주시 오남읍 오남리 파라다이스빌LNG
8N-2021-0009본사2021-01-03일요일서울동부지사결과등록2021-01-03사용자서울 성북구 돈암동기타
9N-2021-0010본사2021-01-03일요일부산북부지사완료2021-01-03지역주민부산 연제구 거제동LNG
사고번호접수지사접수일자접수요일처리지사진행상태사고일자시고자소속주소사고가스
6284N-2023-1513본사2023-08-31목요일부산북부지사보고서2023-08-31사용자부산광역시 금정구 구서동LNG
6285N-2023-1514본사2023-08-31목요일서울남부지사결과등록2023-08-31사용자서울특별시 구로구 구로동LNG
6286N-2023-1515본사2023-08-31목요일경남본부결과등록2023-08-31사용자경상남도 김해시 주촌면가스없음
6287N-2023-1516본사2023-08-31목요일부산광역본부결과등록2023-08-31사용자부산광역시 사하구 다대동LNG
6288N-2023-1517본사2023-08-31목요일서울남부지사결과등록2023-08-31사용자서울특별시 영등포구 대림동LNG
6289N-2023-1518본사2023-08-31목요일대전광역본부결과등록2023-08-31사용자충청남도 보령시 주포면프로판
6290N-2023-1519본사2023-08-31목요일강원광역본부결과등록2023-08-31인터넷강원특별자치도 철원군 동송읍부탄
6291N-2023-1520본사2023-08-31목요일경기북부지사결과등록2023-08-31경찰서경기도 의정부시 의정부동LNG
6292N-2023-1521본사2023-08-31목요일충남본부결과등록2023-08-31사용자충청남도 천안시 서북구LNG
6293N-2023-1522본사2023-08-31목요일부산광역본부결과등록2023-08-31사용자부산광역시 강서구 명지동LNG