Overview

Dataset statistics

Number of variables3
Number of observations6735
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows11
Duplicate rows (%)0.2%
Total size in memory158.0 KiB
Average record size in memory24.0 B

Variable types

Categorical1
Text2

Alerts

Dataset has 11 (0.2%) duplicate rowsDuplicates
소방용수종류 is highly imbalanced (61.4%)Imbalance

Reproduction

Analysis started2024-03-14 03:25:39.031026
Analysis finished2024-03-14 03:25:39.684628
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

소방용수종류
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size52.7 KiB
지상식
3433 
지하식
2944 
급수탑
 
138
비상소화장치
 
53
기타
 
45
Other values (7)
 
122

Length

Max length6
Median length3
Mean length3.0191537
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상식
2nd row지상식
3rd row지상식
4th row지하식
5th row지상식

Common Values

ValueCountFrequency (%)
지상식 3433
51.0%
지하식 2944
43.7%
급수탑 138
 
2.0%
비상소화장치 53
 
0.8%
기타 45
 
0.7%
자연 38
 
0.6%
저수지 28
 
0.4%
저수조 25
 
0.4%
비상화장치 14
 
0.2%
자연수리 9
 
0.1%
Other values (2) 8
 
0.1%

Length

2024-03-14T12:25:39.749053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
지상식 3433
51.0%
지하식 2944
43.7%
급수탑 138
 
2.0%
비상소화장치 53
 
0.8%
기타 45
 
0.7%
자연 38
 
0.6%
저수지 28
 
0.4%
저수조 25
 
0.4%
비상화장치 14
 
0.2%
자연수리 9
 
0.1%
Other values (2) 8
 
0.1%
Distinct6683
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size52.7 KiB
2024-03-14T12:25:39.925391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length12
Mean length11.97075
Min length6

Characters and Unicode

Total characters80623
Distinct characters109
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6631 ?
Unique (%)98.5%

Sample

1st row덕진-팔복-지상-034
2nd row덕진-팔복-지상-026
3rd row덕진-팔복-지상-029
4th row덕진-전미-지하-075
5th row덕진-팔복-지상-037
ValueCountFrequency (%)
무진장-마령 5
 
0.1%
남원-금지-지상-046 2
 
< 0.1%
남원-식정-지하-013 2
 
< 0.1%
고창-고창-지상-132 2
 
< 0.1%
무진장-진안-지하-04 2
 
< 0.1%
무진장-진안-지하-03 2
 
< 0.1%
무진장-진안-지하-02 2
 
< 0.1%
무진장-진안-지하-01 2
 
< 0.1%
무진장-장수-지하-12 2
 
< 0.1%
정읍-연지-지상-447 2
 
< 0.1%
Other values (6676) 6719
99.7%
2024-03-14T12:25:40.218929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 20202
25.1%
6673
 
8.3%
0 5911
 
7.3%
3547
 
4.4%
3425
 
4.2%
2924
 
3.6%
1 2881
 
3.6%
2 2046
 
2.5%
3 1581
 
2.0%
1573
 
2.0%
Other values (99) 29860
37.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41268
51.2%
Dash Punctuation 20202
25.1%
Decimal Number 19139
23.7%
Space Separator 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6673
 
16.2%
3547
 
8.6%
3425
 
8.3%
2924
 
7.1%
1573
 
3.8%
1120
 
2.7%
1120
 
2.7%
1036
 
2.5%
998
 
2.4%
979
 
2.4%
Other values (87) 17873
43.3%
Decimal Number
ValueCountFrequency (%)
0 5911
30.9%
1 2881
15.1%
2 2046
 
10.7%
3 1581
 
8.3%
4 1391
 
7.3%
5 1203
 
6.3%
6 1118
 
5.8%
7 1078
 
5.6%
8 984
 
5.1%
9 946
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 20202
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41268
51.2%
Common 39355
48.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6673
 
16.2%
3547
 
8.6%
3425
 
8.3%
2924
 
7.1%
1573
 
3.8%
1120
 
2.7%
1120
 
2.7%
1036
 
2.5%
998
 
2.4%
979
 
2.4%
Other values (87) 17873
43.3%
Common
ValueCountFrequency (%)
- 20202
51.3%
0 5911
 
15.0%
1 2881
 
7.3%
2 2046
 
5.2%
3 1581
 
4.0%
4 1391
 
3.5%
5 1203
 
3.1%
6 1118
 
2.8%
7 1078
 
2.7%
8 984
 
2.5%
Other values (2) 960
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41268
51.2%
ASCII 39355
48.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 20202
51.3%
0 5911
 
15.0%
1 2881
 
7.3%
2 2046
 
5.2%
3 1581
 
4.0%
4 1391
 
3.5%
5 1203
 
3.1%
6 1118
 
2.8%
7 1078
 
2.7%
8 984
 
2.5%
Other values (2) 960
 
2.4%
Hangul
ValueCountFrequency (%)
6673
 
16.2%
3547
 
8.6%
3425
 
8.3%
2924
 
7.1%
1573
 
3.8%
1120
 
2.7%
1120
 
2.7%
1036
 
2.5%
998
 
2.4%
979
 
2.4%
Other values (87) 17873
43.3%

위치
Text

Distinct6622
Distinct (%)98.4%
Missing3
Missing (%)< 0.1%
Memory size52.7 KiB
2024-03-14T12:25:40.506802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length54
Mean length21.947861
Min length2

Characters and Unicode

Total characters147753
Distinct characters859
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6574 ?
Unique (%)97.7%

Sample

1st row기린대로881 효성 동문 좌측 20m
2nd row온고을로 376 (삼양화성 제2약품창고 건너편 인도)
3rd row기린대로881 효성 정문 우측 100m
4th row전미동 연봉마을 모정 앞
5th row덕진구 반월로 104(한국타이어 타이어마트점 옆)
ValueCountFrequency (%)
2501
 
8.1%
678
 
2.2%
익산시 424
 
1.4%
전주시 319
 
1.0%
입구 276
 
0.9%
고창군 256
 
0.8%
김제시 223
 
0.7%
맞은편 208
 
0.7%
인도 195
 
0.6%
정읍시 172
 
0.6%
Other values (13253) 25725
83.0%
2024-03-14T12:25:40.959133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24839
 
16.8%
1 4426
 
3.0%
3845
 
2.6%
3434
 
2.3%
( 3221
 
2.2%
) 3206
 
2.2%
2 2947
 
2.0%
2747
 
1.9%
2430
 
1.6%
2341
 
1.6%
Other values (849) 94317
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93163
63.1%
Space Separator 24839
 
16.8%
Decimal Number 19653
 
13.3%
Open Punctuation 3271
 
2.2%
Close Punctuation 3254
 
2.2%
Dash Punctuation 2335
 
1.6%
Uppercase Letter 487
 
0.3%
Other Punctuation 327
 
0.2%
Lowercase Letter 322
 
0.2%
Math Symbol 54
 
< 0.1%
Other values (3) 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3845
 
4.1%
3434
 
3.7%
2747
 
2.9%
2430
 
2.6%
2341
 
2.5%
2340
 
2.5%
2106
 
2.3%
1835
 
2.0%
1587
 
1.7%
1466
 
1.6%
Other values (767) 69032
74.1%
Uppercase Letter
ValueCountFrequency (%)
S 50
10.3%
G 46
9.4%
M 45
9.2%
T 40
 
8.2%
C 39
 
8.0%
A 38
 
7.8%
L 37
 
7.6%
K 34
 
7.0%
B 33
 
6.8%
P 25
 
5.1%
Other values (14) 100
20.5%
Lowercase Letter
ValueCountFrequency (%)
m 212
65.8%
k 12
 
3.7%
s 11
 
3.4%
i 10
 
3.1%
e 9
 
2.8%
a 9
 
2.8%
c 9
 
2.8%
o 8
 
2.5%
t 6
 
1.9%
g 5
 
1.6%
Other values (12) 31
 
9.6%
Decimal Number
ValueCountFrequency (%)
1 4426
22.5%
2 2947
15.0%
3 2307
11.7%
4 1772
9.0%
0 1752
 
8.9%
5 1700
 
8.7%
7 1288
 
6.6%
6 1248
 
6.4%
8 1156
 
5.9%
9 1057
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 147
45.0%
@ 83
25.4%
. 68
20.8%
? 15
 
4.6%
/ 6
 
1.8%
& 5
 
1.5%
! 1
 
0.3%
: 1
 
0.3%
1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 3221
98.5%
[ 42
 
1.3%
7
 
0.2%
{ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 3206
98.5%
] 42
 
1.3%
5
 
0.2%
} 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 49
90.7%
5
 
9.3%
Other Symbol
ValueCountFrequency (%)
21
95.5%
1
 
4.5%
Control
ValueCountFrequency (%)
15
83.3%
3
 
16.7%
Space Separator
ValueCountFrequency (%)
24839
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2335
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93183
63.1%
Common 53760
36.4%
Latin 809
 
0.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3845
 
4.1%
3434
 
3.7%
2747
 
2.9%
2430
 
2.6%
2341
 
2.5%
2340
 
2.5%
2106
 
2.3%
1835
 
2.0%
1587
 
1.7%
1466
 
1.6%
Other values (767) 69052
74.1%
Latin
ValueCountFrequency (%)
m 212
26.2%
S 50
 
6.2%
G 46
 
5.7%
M 45
 
5.6%
T 40
 
4.9%
C 39
 
4.8%
A 38
 
4.7%
L 37
 
4.6%
K 34
 
4.2%
B 33
 
4.1%
Other values (36) 235
29.0%
Common
ValueCountFrequency (%)
24839
46.2%
1 4426
 
8.2%
( 3221
 
6.0%
) 3206
 
6.0%
2 2947
 
5.5%
- 2335
 
4.3%
3 2307
 
4.3%
4 1772
 
3.3%
0 1752
 
3.3%
5 1700
 
3.2%
Other values (25) 5255
 
9.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93131
63.0%
ASCII 54550
36.9%
None 34
 
< 0.1%
Compat Jamo 31
 
< 0.1%
Arrows 5
 
< 0.1%
CJK 1
 
< 0.1%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24839
45.5%
1 4426
 
8.1%
( 3221
 
5.9%
) 3206
 
5.9%
2 2947
 
5.4%
- 2335
 
4.3%
3 2307
 
4.2%
4 1772
 
3.2%
0 1752
 
3.2%
5 1700
 
3.1%
Other values (66) 6045
 
11.1%
Hangul
ValueCountFrequency (%)
3845
 
4.1%
3434
 
3.7%
2747
 
2.9%
2430
 
2.6%
2341
 
2.5%
2340
 
2.5%
2106
 
2.3%
1835
 
2.0%
1587
 
1.7%
1466
 
1.6%
Other values (759) 69000
74.1%
None
ValueCountFrequency (%)
21
61.8%
7
 
20.6%
5
 
14.7%
1
 
2.9%
Compat Jamo
ValueCountFrequency (%)
16
51.6%
4
 
12.9%
4
 
12.9%
3
 
9.7%
2
 
6.5%
1
 
3.2%
1
 
3.2%
Arrows
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

Missing values

2024-03-14T12:25:39.564259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T12:25:39.642854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

소방용수종류소화전 번호위치
0지상식덕진-팔복-지상-034기린대로881 효성 동문 좌측 20m
1지상식덕진-팔복-지상-026온고을로 376 (삼양화성 제2약품창고 건너편 인도)
2지상식덕진-팔복-지상-029기린대로881 효성 정문 우측 100m
3지하식덕진-전미-지하-075전미동 연봉마을 모정 앞
4지상식덕진-팔복-지상-037덕진구 반월로 104(한국타이어 타이어마트점 옆)
5지상식덕진-팔복-지상-038덕진구 팔복로 147 (휴비스 신공장 정문 우측앞)
6지상식덕진-팔복-지상-035덕진구 동산동 766(효성C1공장 동문에서 우측150m)
7지상식덕진-팔복-지상-039덕진구 만성동 621-2(OK셀프 주유소 맞은편 화단)
8지상식덕진-팔복-지상-030덕진구 동산동 771 (효성C1공장 정문에서 우측 300m)
9지상식덕진-팔복-지상-028덕진구 동산동 771 (효성C1공장 정문에서 좌측 100m)
소방용수종류소화전 번호위치
6725지하식무진장-진안-지하-32군상리 유만봉씨댁앞 537
6726지하식무진장-진안-지하-33군상리 375-3
6727지하식무진장-진안-지하-01군상리 진안소방파출소 앞(폐쇄)
6728지하식무진장-진안-지하-35단양리 원단양 박용주씨댁앞 41
6729지하식무진장-진안-지하-36원단양리 원단양마을 김학용씨댁앞 32-26
6730지하식무진장-진안-지하-37진안읍 단양리 원단양길 40(송석동씨댁앞)
6731지하식무진장-진안-지하-38진안읍 단양리 단양길 38 (전승관씨댁 앞)
6732지하식무진장-진안-지하-02신괴리 485번지 괴정마을회관 앞
6733지하식무진장-진안-지하-03신괴리 456 소희섭씨댁앞
6734지하식무진장-진안-지하-04신괴리 748번지 김광호씨댁 앞

Duplicate rows

Most frequently occurring

소방용수종류소화전 번호위치# duplicates
0저수조완산-효자-저수조-001전주시 서진로(대한방직 내)2
1지상식고창-고창-지상-132고창군 고창읍 월곡뉴타운1길 602
2지상식남원-금지-지상-046남원시 수지면 산정유암길 113(등동마을회관 앞)2
3지상식남원-인월-지상-032남원시 산내면 백일길 7(백일마을회관 앞)2
4지상식남원-인월-지상-033남원시 아영면 신지길 39(신지마을회관 앞)2
5지상식덕진-팔복-지상-028덕진구 동산동 771 (효성C1공장 정문에서 좌측 100m)2
6지상식무진장-무주-지상-11당산1길15번지(배원식씨댁 앞)2
7지상식완산-교동-지상-029상관면 수월길 11 (수월경로당 앞)2
8지상식익산-남중-지상-109익산시 인북로 307(남중동 전북은행신동지점 옆2
9지상식익산-모현-지상-136익산시 익산대로 54길 31(미니마트 앞)2