Overview

Dataset statistics

Number of variables5
Number of observations676
Missing cells76
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.5 KiB
Average record size in memory40.2 B

Variable types

Text3
Categorical2

Alerts

시설주소상세 has 76 (11.2%) missing valuesMissing

Reproduction

Analysis started2024-01-09 22:51:48.420798
Analysis finished2024-01-09 22:51:48.921196
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct119
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
2024-01-10T07:51:49.070049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length13
Mean length8.8091716
Min length4

Characters and Unicode

Total characters5955
Distinct characters201
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row백제문화단지
2nd row백제문화단지
3rd row백제문화단지
4th row백제문화단지
5th row백제문화단지
ValueCountFrequency (%)
공영주차장 133
 
12.6%
제1공영주차장 49
 
4.7%
주차장 40
 
3.8%
금강자연휴양림 21
 
2.0%
화지중앙시장 15
 
1.4%
천안 14
 
1.3%
서산버드랜드 13
 
1.2%
국민여가캠핑장 12
 
1.1%
시민문화여성회관 11
 
1.0%
태조산공원 11
 
1.0%
Other values (129) 734
69.7%
2024-01-10T07:51:49.399849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
480
 
8.1%
392
 
6.6%
300
 
5.0%
289
 
4.9%
260
 
4.4%
255
 
4.3%
121
 
2.0%
121
 
2.0%
106
 
1.8%
1 104
 
1.7%
Other values (191) 3527
59.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5352
89.9%
Space Separator 392
 
6.6%
Decimal Number 142
 
2.4%
Open Punctuation 30
 
0.5%
Close Punctuation 30
 
0.5%
Math Symbol 5
 
0.1%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
480
 
9.0%
300
 
5.6%
289
 
5.4%
260
 
4.9%
255
 
4.8%
121
 
2.3%
121
 
2.3%
106
 
2.0%
100
 
1.9%
93
 
1.7%
Other values (182) 3227
60.3%
Decimal Number
ValueCountFrequency (%)
1 104
73.2%
2 24
 
16.9%
3 9
 
6.3%
4 5
 
3.5%
Space Separator
ValueCountFrequency (%)
392
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Punctuation
ValueCountFrequency (%)
· 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5352
89.9%
Common 603
 
10.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
480
 
9.0%
300
 
5.6%
289
 
5.4%
260
 
4.9%
255
 
4.8%
121
 
2.3%
121
 
2.3%
106
 
2.0%
100
 
1.9%
93
 
1.7%
Other values (182) 3227
60.3%
Common
ValueCountFrequency (%)
392
65.0%
1 104
 
17.2%
( 30
 
5.0%
) 30
 
5.0%
2 24
 
4.0%
3 9
 
1.5%
~ 5
 
0.8%
4 5
 
0.8%
· 4
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5352
89.9%
ASCII 599
 
10.1%
None 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
480
 
9.0%
300
 
5.6%
289
 
5.4%
260
 
4.9%
255
 
4.8%
121
 
2.3%
121
 
2.3%
106
 
2.0%
100
 
1.9%
93
 
1.7%
Other values (182) 3227
60.3%
ASCII
ValueCountFrequency (%)
392
65.4%
1 104
 
17.4%
( 30
 
5.0%
) 30
 
5.0%
2 24
 
4.0%
3 9
 
1.5%
~ 5
 
0.8%
4 5
 
0.8%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct24
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
장애인
105 
국가유공자
100 
환경친화적 자동차
47 
65세이상 고령자
44 
국민기초생활 수급자
43 
Other values (19)
337 

Length

Max length23
Median length10
Mean length6.9630178
Min length3

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row국가유공자
2nd row국민기초생활 수급자
3rd row장애인
4th row한부모가정
5th row자원봉사 시간

Common Values

ValueCountFrequency (%)
장애인 105
15.5%
국가유공자 100
14.8%
환경친화적 자동차 47
 
7.0%
65세이상 고령자 44
 
6.5%
국민기초생활 수급자 43
 
6.4%
자동차제원(경차) 42
 
6.2%
모범납세자 35
 
5.2%
병역명문가 28
 
4.1%
한부모가정 28
 
4.1%
의사상자 27
 
4.0%
Other values (14) 177
26.2%

Length

2024-01-10T07:51:49.521937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
장애인 119
 
12.3%
국가유공자 117
 
12.1%
국민기초생활 92
 
9.5%
수급자 61
 
6.3%
65세이상 49
 
5.1%
환경친화적 47
 
4.9%
자동차 47
 
4.9%
고령자 44
 
4.6%
자동차제원(경차 42
 
4.3%
모범납세자 35
 
3.6%
Other values (19) 314
32.5%

감면액
Categorical

Distinct11
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
50
339 
100
204 
70
64 
30
 
32
80
 
9
Other values (6)
 
28

Length

Max length7
Median length2
Mean length2.352071
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row100
2nd row100
3rd row100
4th row100
5th row100

Common Values

ValueCountFrequency (%)
50 339
50.1%
100 204
30.2%
70 64
 
9.5%
30 32
 
4.7%
80 9
 
1.3%
40 8
 
1.2%
20 7
 
1.0%
100(1년) 6
 
0.9%
10 3
 
0.4%
10~30 2
 
0.3%

Length

2024-01-10T07:51:49.625869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
50 339
50.1%
100 204
30.2%
70 64
 
9.5%
30 32
 
4.7%
80 9
 
1.3%
40 8
 
1.2%
20 7
 
1.0%
100(1년 6
 
0.9%
10 3
 
0.4%
10~30 2
 
0.3%
Distinct108
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size5.4 KiB
2024-01-10T07:51:49.832682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length17.943787
Min length4

Characters and Unicode

Total characters12130
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남 부여군 규암면 백제문로 455
2nd row충남 부여군 규암면 백제문로 455
3rd row충남 부여군 규암면 백제문로 455
4th row충남 부여군 규암면 백제문로 455
5th row충남 부여군 규암면 백제문로 455
ValueCountFrequency (%)
충남 646
 
20.4%
천안시 152
 
4.8%
아산시 112
 
3.5%
서북구 82
 
2.6%
서산시 73
 
2.3%
동남구 70
 
2.2%
청양군 55
 
1.7%
보령시 47
 
1.5%
논산시 42
 
1.3%
부여군 36
 
1.1%
Other values (238) 1844
58.4%
2024-01-10T07:51:50.166382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2483
20.5%
811
 
6.7%
688
 
5.7%
485
 
4.0%
1 461
 
3.8%
408
 
3.4%
370
 
3.1%
282
 
2.3%
3 259
 
2.1%
2 248
 
2.0%
Other values (147) 5635
46.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7230
59.6%
Space Separator 2483
 
20.5%
Decimal Number 2232
 
18.4%
Dash Punctuation 185
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
811
 
11.2%
688
 
9.5%
485
 
6.7%
408
 
5.6%
370
 
5.1%
282
 
3.9%
217
 
3.0%
214
 
3.0%
200
 
2.8%
191
 
2.6%
Other values (135) 3364
46.5%
Decimal Number
ValueCountFrequency (%)
1 461
20.7%
3 259
11.6%
2 248
11.1%
7 222
9.9%
4 204
9.1%
0 200
9.0%
6 199
8.9%
5 189
8.5%
9 131
 
5.9%
8 119
 
5.3%
Space Separator
ValueCountFrequency (%)
2483
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 185
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7230
59.6%
Common 4900
40.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
811
 
11.2%
688
 
9.5%
485
 
6.7%
408
 
5.6%
370
 
5.1%
282
 
3.9%
217
 
3.0%
214
 
3.0%
200
 
2.8%
191
 
2.6%
Other values (135) 3364
46.5%
Common
ValueCountFrequency (%)
2483
50.7%
1 461
 
9.4%
3 259
 
5.3%
2 248
 
5.1%
7 222
 
4.5%
4 204
 
4.2%
0 200
 
4.1%
6 199
 
4.1%
5 189
 
3.9%
- 185
 
3.8%
Other values (2) 250
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7230
59.6%
ASCII 4900
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2483
50.7%
1 461
 
9.4%
3 259
 
5.3%
2 248
 
5.1%
7 222
 
4.5%
4 204
 
4.2%
0 200
 
4.1%
6 199
 
4.1%
5 189
 
3.9%
- 185
 
3.8%
Other values (2) 250
 
5.1%
Hangul
ValueCountFrequency (%)
811
 
11.2%
688
 
9.5%
485
 
6.7%
408
 
5.6%
370
 
5.1%
282
 
3.9%
217
 
3.0%
214
 
3.0%
200
 
2.8%
191
 
2.6%
Other values (135) 3364
46.5%

시설주소상세
Text

MISSING 

Distinct99
Distinct (%)16.5%
Missing76
Missing (%)11.2%
Memory size5.4 KiB
2024-01-10T07:51:50.346752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length7.6333333
Min length1

Characters and Unicode

Total characters4580
Distinct characters186
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row백제문화단지
2nd row백제문화단지
3rd row백제문화단지
4th row백제문화단지
5th row백제문화단지
ValueCountFrequency (%)
공영주차장 58
 
7.4%
제1공영주차장 40
 
5.1%
주차장 26
 
3.3%
금강자연휴양림 21
 
2.7%
보령시청(교통과 18
 
2.3%
서산버드랜드 13
 
1.7%
국민체육센터 13
 
1.7%
국민여가캠핑장 12
 
1.5%
시민문화여성회관 11
 
1.4%
스포츠센터 11
 
1.4%
Other values (99) 564
71.7%
2024-01-10T07:51:50.629487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
300
 
6.6%
216
 
4.7%
175
 
3.8%
168
 
3.7%
148
 
3.2%
129
 
2.8%
128
 
2.8%
99
 
2.2%
85
 
1.9%
83
 
1.8%
Other values (176) 3049
66.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4165
90.9%
Space Separator 216
 
4.7%
Decimal Number 116
 
2.5%
Close Punctuation 36
 
0.8%
Open Punctuation 36
 
0.8%
Uppercase Letter 7
 
0.2%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
300
 
7.2%
175
 
4.2%
168
 
4.0%
148
 
3.6%
129
 
3.1%
128
 
3.1%
99
 
2.4%
85
 
2.0%
83
 
2.0%
83
 
2.0%
Other values (166) 2767
66.4%
Decimal Number
ValueCountFrequency (%)
1 80
69.0%
2 31
 
26.7%
3 5
 
4.3%
Close Punctuation
ValueCountFrequency (%)
) 27
75.0%
] 9
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 27
75.0%
[ 9
 
25.0%
Space Separator
ValueCountFrequency (%)
216
100.0%
Uppercase Letter
ValueCountFrequency (%)
F 7
100.0%
Other Punctuation
ValueCountFrequency (%)
· 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4165
90.9%
Common 408
 
8.9%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
300
 
7.2%
175
 
4.2%
168
 
4.0%
148
 
3.6%
129
 
3.1%
128
 
3.1%
99
 
2.4%
85
 
2.0%
83
 
2.0%
83
 
2.0%
Other values (166) 2767
66.4%
Common
ValueCountFrequency (%)
216
52.9%
1 80
 
19.6%
2 31
 
7.6%
) 27
 
6.6%
( 27
 
6.6%
] 9
 
2.2%
[ 9
 
2.2%
3 5
 
1.2%
· 4
 
1.0%
Latin
ValueCountFrequency (%)
F 7
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4165
90.9%
ASCII 411
 
9.0%
None 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
300
 
7.2%
175
 
4.2%
168
 
4.0%
148
 
3.6%
129
 
3.1%
128
 
3.1%
99
 
2.4%
85
 
2.0%
83
 
2.0%
83
 
2.0%
Other values (166) 2767
66.4%
ASCII
ValueCountFrequency (%)
216
52.6%
1 80
 
19.5%
2 31
 
7.5%
) 27
 
6.6%
( 27
 
6.6%
] 9
 
2.2%
[ 9
 
2.2%
F 7
 
1.7%
3 5
 
1.2%
None
ValueCountFrequency (%)
· 4
100.0%

Correlations

2024-01-10T07:51:50.709700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
감면정책이름감면액시설주소상세
감면정책이름1.0000.6320.000
감면액0.6321.0000.967
시설주소상세0.0000.9671.000
2024-01-10T07:51:51.053153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
감면액감면정책이름
감면액1.0000.276
감면정책이름0.2761.000
2024-01-10T07:51:51.118157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
감면정책이름감면액
감면정책이름1.0000.276
감면액0.2761.000

Missing values

2024-01-10T07:51:48.807817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:51:48.887540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명감면정책이름감면액시설주소시설주소상세
0백제문화단지국가유공자100충남 부여군 규암면 백제문로 455백제문화단지
1백제문화단지국민기초생활 수급자100충남 부여군 규암면 백제문로 455백제문화단지
2백제문화단지장애인100충남 부여군 규암면 백제문로 455백제문화단지
3백제문화단지한부모가정100충남 부여군 규암면 백제문로 455백제문화단지
4백제문화단지자원봉사 시간100충남 부여군 규암면 백제문로 455백제문화단지
5백제문화단지국가유공자 차량100충남 부여군 규암면 백제문로 455백제문화단지
6백제문화단지장애인 차량100충남 부여군 규암면 백제문로 455백제문화단지
7백제문화단지의사상자100충남 부여군 규암면 백제문로 455백제문화단지
8백제문화단지65세이상 고령자100충남 부여군 규암면 백제문로 455백제문화단지
9금강자연휴양림 숲속의집국가유공자10~30세종특별자치시 금남면 산림박물관길 110금강자연휴양림
시설명감면정책이름감면액시설주소시설주소상세
666아산시청 부설주차장모범납세자50충남 아산시 시민로 456
667아산시청 부설주차장장애인50충남 아산시 시민로 456
668아산시청 부설주차장환경친화적 자동차50충남 아산시 시민로 456
669아산시청 부설주차장자동차제원(경차)50충남 아산시 시민로 456
670북수리 제1공영주차장모범납세자50충남 아산시 배방읍 북수리 1122
671북수리 제1공영주차장장애인50충남 아산시 배방읍 북수리 1122
672북수리 제1공영주차장환경친화적 자동차50충남 아산시 배방읍 북수리 1122
673북수리 제1공영주차장자동차제원(경차)50충남 아산시 배방읍 북수리 1122
674금산군인삼골오토캠핑장국민기초생활 수급자30충남 금산군 제원면 용화리 225금산군인삼골오토캠핑장
675금산군인삼골오토캠핑장국가유공자30충남 금산군 제원면 용화리 225금산군인삼골오토캠핑장