Overview

Dataset statistics

Number of variables14
Number of observations7833
Missing cells22125
Missing cells (%)20.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory895.1 KiB
Average record size in memory117.0 B

Variable types

Categorical4
Text3
DateTime2
Unsupported2
Numeric3

Dataset

Description집단급식소(공공기관) 현황_인허가
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=H22M8PNJ5KXT7HNJQQBS13855813&infSeq=1

Alerts

시군명 is highly overall correlated with 소재지우편번호 and 3 other fieldsHigh correlation
위생업종명 is highly overall correlated with 소재지우편번호 and 5 other fieldsHigh correlation
영업상태명 is highly overall correlated with 위생업종명High correlation
위생업태명 is highly overall correlated with 위생업종명High correlation
소재지우편번호 is highly overall correlated with WGS84위도 and 2 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
위생업종명 is highly imbalanced (82.3%)Imbalance
폐업일자 has 6057 (77.3%) missing valuesMissing
다중이용업소여부 has 7833 (100.0%) missing valuesMissing
총시설규모(㎡) has 7833 (100.0%) missing valuesMissing
소재지도로명주소 has 79 (1.0%) missing valuesMissing
소재지우편번호 has 105 (1.3%) missing valuesMissing
WGS84위도 has 108 (1.4%) missing valuesMissing
WGS84경도 has 108 (1.4%) missing valuesMissing
다중이용업소여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
총시설규모(㎡) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 22:25:40.388858
Analysis finished2023-12-10 22:25:43.634340
Duration3.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
화성시
739 
용인시
631 
안산시
619 
성남시
597 
시흥시
543 
Other values (26)
4704 

Length

Max length4
Median length3
Mean length3.0460871
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 739
 
9.4%
용인시 631
 
8.1%
안산시 619
 
7.9%
성남시 597
 
7.6%
시흥시 543
 
6.9%
평택시 525
 
6.7%
수원시 404
 
5.2%
고양시 403
 
5.1%
부천시 351
 
4.5%
파주시 343
 
4.4%
Other values (21) 2678
34.2%

Length

2023-12-11T07:25:43.697245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 739
 
9.4%
용인시 631
 
8.1%
안산시 619
 
7.9%
성남시 597
 
7.6%
시흥시 543
 
6.9%
평택시 525
 
6.7%
수원시 404
 
5.2%
고양시 403
 
5.1%
부천시 351
 
4.5%
파주시 343
 
4.4%
Other values (21) 2678
34.2%
Distinct7368
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
2023-12-11T07:25:43.941165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length10.095493
Min length2

Characters and Unicode

Total characters79078
Distinct characters786
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6984 ?
Unique (%)89.2%

Sample

1st row씨제이프레시웨이(주)아난티가평점클럽하우스
2nd row강변요양병원
3rd row(주)케이씨씨글라스 가평공장
4th row재단법인가평군복지재단(가평군노인복지관)
5th rowHJ매그놀리아국제병원,HJ매그놀리아국제요양병원
ValueCountFrequency (%)
주식회사 262
 
2.5%
어린이집 213
 
2.0%
주)아워홈 105
 
1.0%
구내식당 93
 
0.9%
주)현대그린푸드 47
 
0.4%
본우리집밥 46
 
0.4%
주)신세계푸드 41
 
0.4%
풀무원푸드앤컬처 41
 
0.4%
주)동원홈푸드 38
 
0.4%
씨제이프레시웨이(주 33
 
0.3%
Other values (7879) 9573
91.2%
2023-12-11T07:25:44.341619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2909
 
3.7%
2893
 
3.7%
2660
 
3.4%
) 2588
 
3.3%
( 2578
 
3.3%
2233
 
2.8%
1741
 
2.2%
1673
 
2.1%
1636
 
2.1%
1543
 
2.0%
Other values (776) 56624
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69664
88.1%
Space Separator 2660
 
3.4%
Close Punctuation 2590
 
3.3%
Open Punctuation 2580
 
3.3%
Uppercase Letter 986
 
1.2%
Decimal Number 335
 
0.4%
Lowercase Letter 138
 
0.2%
Other Punctuation 92
 
0.1%
Dash Punctuation 19
 
< 0.1%
Other Symbol 8
 
< 0.1%
Other values (2) 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2909
 
4.2%
2893
 
4.2%
2233
 
3.2%
1741
 
2.5%
1673
 
2.4%
1636
 
2.3%
1543
 
2.2%
1088
 
1.6%
1048
 
1.5%
1042
 
1.5%
Other values (701) 51858
74.4%
Uppercase Letter
ValueCountFrequency (%)
S 136
13.8%
C 107
 
10.9%
K 83
 
8.4%
D 59
 
6.0%
A 56
 
5.7%
F 56
 
5.7%
L 53
 
5.4%
T 46
 
4.7%
R 44
 
4.5%
M 44
 
4.5%
Other values (16) 302
30.6%
Lowercase Letter
ValueCountFrequency (%)
e 15
 
10.9%
a 12
 
8.7%
o 11
 
8.0%
c 11
 
8.0%
i 11
 
8.0%
s 10
 
7.2%
l 9
 
6.5%
b 7
 
5.1%
r 7
 
5.1%
t 7
 
5.1%
Other values (13) 38
27.5%
Decimal Number
ValueCountFrequency (%)
2 123
36.7%
1 102
30.4%
3 49
 
14.6%
5 16
 
4.8%
4 15
 
4.5%
6 10
 
3.0%
0 8
 
2.4%
7 7
 
2.1%
9 3
 
0.9%
8 2
 
0.6%
Other Punctuation
ValueCountFrequency (%)
& 39
42.4%
. 20
21.7%
, 13
 
14.1%
/ 11
 
12.0%
· 7
 
7.6%
: 1
 
1.1%
! 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 2588
99.9%
] 2
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 2578
99.9%
[ 2
 
0.1%
Space Separator
ValueCountFrequency (%)
2660
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69672
88.1%
Common 8282
 
10.5%
Latin 1124
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2909
 
4.2%
2893
 
4.2%
2233
 
3.2%
1741
 
2.5%
1673
 
2.4%
1636
 
2.3%
1543
 
2.2%
1088
 
1.6%
1048
 
1.5%
1042
 
1.5%
Other values (702) 51866
74.4%
Latin
ValueCountFrequency (%)
S 136
 
12.1%
C 107
 
9.5%
K 83
 
7.4%
D 59
 
5.2%
A 56
 
5.0%
F 56
 
5.0%
L 53
 
4.7%
T 46
 
4.1%
R 44
 
3.9%
M 44
 
3.9%
Other values (39) 440
39.1%
Common
ValueCountFrequency (%)
2660
32.1%
) 2588
31.2%
( 2578
31.1%
2 123
 
1.5%
1 102
 
1.2%
3 49
 
0.6%
& 39
 
0.5%
. 20
 
0.2%
- 19
 
0.2%
5 16
 
0.2%
Other values (15) 88
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69664
88.1%
ASCII 9399
 
11.9%
None 15
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2909
 
4.2%
2893
 
4.2%
2233
 
3.2%
1741
 
2.5%
1673
 
2.4%
1636
 
2.3%
1543
 
2.2%
1088
 
1.6%
1048
 
1.5%
1042
 
1.5%
Other values (701) 51858
74.4%
ASCII
ValueCountFrequency (%)
2660
28.3%
) 2588
27.5%
( 2578
27.4%
S 136
 
1.4%
2 123
 
1.3%
C 107
 
1.1%
1 102
 
1.1%
K 83
 
0.9%
D 59
 
0.6%
A 56
 
0.6%
Other values (63) 907
 
9.6%
None
ValueCountFrequency (%)
8
53.3%
· 7
46.7%
Distinct3895
Distinct (%)49.7%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
Minimum1979-07-29 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T07:25:44.467571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:44.634752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
영업
5776 
폐업
1757 
운영중
 
281
폐업 등
 
19

Length

Max length4
Median length2
Mean length2.0407251
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
영업 5776
73.7%
폐업 1757
 
22.4%
운영중 281
 
3.6%
폐업 등 19
 
0.2%

Length

2023-12-11T07:25:44.781712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:25:45.143170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 5776
73.6%
폐업 1776
 
22.6%
운영중 281
 
3.6%
19
 
0.2%

폐업일자
Date

MISSING 

Distinct538
Distinct (%)30.3%
Missing6057
Missing (%)77.3%
Memory size61.3 KiB
Minimum2002-08-16 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T07:25:45.262341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:45.404022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

다중이용업소여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7833
Missing (%)100.0%
Memory size69.0 KiB

총시설규모(㎡)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7833
Missing (%)100.0%
Memory size69.0 KiB

위생업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
<NA>
7624 
집단급식소
 
209

Length

Max length5
Median length4
Mean length4.026682
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7624
97.3%
집단급식소 209
 
2.7%

Length

2023-12-11T07:25:45.520617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:25:45.599915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7624
97.3%
집단급식소 209
 
2.7%

위생업태명
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size61.3 KiB
위탁급식영업
2434 
어린이집
1817 
산업체
1474 
학교
549 
사회복지시설
541 
Other values (7)
1018 

Length

Max length8
Median length6
Mean length4.4166986
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row위탁급식영업
2nd row병원
3rd row산업체
4th row사회복지시설
5th row병원

Common Values

ValueCountFrequency (%)
위탁급식영업 2434
31.1%
어린이집 1817
23.2%
산업체 1474
18.8%
학교 549
 
7.0%
사회복지시설 541
 
6.9%
병원 365
 
4.7%
공공기관 348
 
4.4%
기타 집단급식소 165
 
2.1%
<NA> 92
 
1.2%
기숙사 27
 
0.3%
Other values (2) 21
 
0.3%

Length

2023-12-11T07:25:45.689895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
위탁급식영업 2434
30.4%
어린이집 1817
22.7%
산업체 1474
18.4%
학교 549
 
6.9%
사회복지시설 541
 
6.8%
병원 365
 
4.6%
공공기관 348
 
4.4%
집단급식소 167
 
2.1%
기타 165
 
2.1%
na 92
 
1.2%
Other values (2) 46
 
0.6%
Distinct6797
Distinct (%)87.7%
Missing79
Missing (%)1.0%
Memory size61.3 KiB
2023-12-11T07:25:45.973072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length55
Mean length32.626128
Min length13

Characters and Unicode

Total characters252983
Distinct characters703
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5934 ?
Unique (%)76.5%

Sample

1st row경기도 가평군 설악면 유명로 961-34, 클럽하우스동 지하2층
2nd row경기도 가평군 가평읍 북한강변로 160, 2층
3rd row경기도 가평군 가평읍 물안산길 42, 6동 2층
4th row경기도 가평군 가평읍 가화로 161, 노인복지관 1층
5th row경기도 가평군 설악면 미사리로 267-177
ValueCountFrequency (%)
경기도 7755
 
14.5%
1층 1549
 
2.9%
화성시 729
 
1.4%
일부호 711
 
1.3%
용인시 623
 
1.2%
안산시 619
 
1.2%
성남시 597
 
1.1%
지하1층 594
 
1.1%
2층 571
 
1.1%
시흥시 544
 
1.0%
Other values (9253) 39318
73.3%
2023-12-11T07:25:46.422302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45915
 
18.1%
1 9243
 
3.7%
8769
 
3.5%
8421
 
3.3%
8395
 
3.3%
8187
 
3.2%
8156
 
3.2%
6992
 
2.8%
( 6417
 
2.5%
) 6416
 
2.5%
Other values (693) 136072
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 149213
59.0%
Space Separator 45915
 
18.1%
Decimal Number 35647
 
14.1%
Open Punctuation 6422
 
2.5%
Close Punctuation 6421
 
2.5%
Other Punctuation 6267
 
2.5%
Dash Punctuation 1642
 
0.6%
Uppercase Letter 1161
 
0.5%
Lowercase Letter 251
 
0.1%
Math Symbol 35
 
< 0.1%
Other values (2) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8769
 
5.9%
8421
 
5.6%
8395
 
5.6%
8187
 
5.5%
8156
 
5.5%
6992
 
4.7%
4750
 
3.2%
3250
 
2.2%
3199
 
2.1%
2554
 
1.7%
Other values (616) 86540
58.0%
Uppercase Letter
ValueCountFrequency (%)
A 233
20.1%
B 171
14.7%
C 114
9.8%
D 69
 
5.9%
S 64
 
5.5%
E 62
 
5.3%
T 51
 
4.4%
L 51
 
4.4%
K 50
 
4.3%
H 41
 
3.5%
Other values (16) 255
22.0%
Lowercase Letter
ValueCountFrequency (%)
e 53
21.1%
o 25
10.0%
a 21
 
8.4%
t 19
 
7.6%
r 16
 
6.4%
c 15
 
6.0%
s 13
 
5.2%
n 13
 
5.2%
p 11
 
4.4%
i 11
 
4.4%
Other values (13) 54
21.5%
Decimal Number
ValueCountFrequency (%)
1 9243
25.9%
2 5316
14.9%
3 3812
10.7%
4 3120
 
8.8%
5 2885
 
8.1%
0 2547
 
7.1%
7 2453
 
6.9%
6 2414
 
6.8%
8 1936
 
5.4%
9 1921
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 6203
99.0%
& 23
 
0.4%
. 19
 
0.3%
/ 8
 
0.1%
@ 5
 
0.1%
* 5
 
0.1%
· 3
 
< 0.1%
: 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 6417
99.9%
[ 5
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 6416
99.9%
] 5
 
0.1%
Letter Number
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
45915
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1642
100.0%
Math Symbol
ValueCountFrequency (%)
~ 35
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 149215
59.0%
Common 102349
40.5%
Latin 1417
 
0.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8769
 
5.9%
8421
 
5.6%
8395
 
5.6%
8187
 
5.5%
8156
 
5.5%
6992
 
4.7%
4750
 
3.2%
3250
 
2.2%
3199
 
2.1%
2554
 
1.7%
Other values (615) 86542
58.0%
Latin
ValueCountFrequency (%)
A 233
16.4%
B 171
 
12.1%
C 114
 
8.0%
D 69
 
4.9%
S 64
 
4.5%
E 62
 
4.4%
e 53
 
3.7%
T 51
 
3.6%
L 51
 
3.6%
K 50
 
3.5%
Other values (41) 499
35.2%
Common
ValueCountFrequency (%)
45915
44.9%
1 9243
 
9.0%
( 6417
 
6.3%
) 6416
 
6.3%
, 6203
 
6.1%
2 5316
 
5.2%
3 3812
 
3.7%
4 3120
 
3.0%
5 2885
 
2.8%
0 2547
 
2.5%
Other values (15) 10475
 
10.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 149211
59.0%
ASCII 103758
41.0%
None 7
 
< 0.1%
Number Forms 5
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45915
44.3%
1 9243
 
8.9%
( 6417
 
6.2%
) 6416
 
6.2%
, 6203
 
6.0%
2 5316
 
5.1%
3 3812
 
3.7%
4 3120
 
3.0%
5 2885
 
2.8%
0 2547
 
2.5%
Other values (63) 11884
 
11.5%
Hangul
ValueCountFrequency (%)
8769
 
5.9%
8421
 
5.6%
8395
 
5.6%
8187
 
5.5%
8156
 
5.5%
6992
 
4.7%
4750
 
3.2%
3250
 
2.2%
3199
 
2.1%
2554
 
1.7%
Other values (614) 86538
58.0%
None
ValueCountFrequency (%)
4
57.1%
· 3
42.9%
Number Forms
ValueCountFrequency (%)
3
60.0%
2
40.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct6863
Distinct (%)87.6%
Missing2
Missing (%)< 0.1%
Memory size61.3 KiB
2023-12-11T07:25:46.791503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length52
Mean length26.419487
Min length14

Characters and Unicode

Total characters206891
Distinct characters668
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6013 ?
Unique (%)76.8%

Sample

1st row경기도 가평군 설악면 방일리 산 90-2 클럽하우스동 지하2층
2nd row경기도 가평군 가평읍 금대리 585 외1필지, 2층
3rd row경기도 가평군 가평읍 개곡리 산 280 외 5필지, 6동 2층
4th row경기도 가평군 가평읍 읍내리 625-8 노인복지관
5th row경기도 가평군 설악면 송산리 426-10
ValueCountFrequency (%)
경기도 7832
 
17.1%
1층 818
 
1.8%
화성시 740
 
1.6%
용인시 631
 
1.4%
안산시 619
 
1.3%
성남시 597
 
1.3%
일부 550
 
1.2%
시흥시 544
 
1.2%
평택시 525
 
1.1%
지하1층 413
 
0.9%
Other values (9048) 32592
71.1%
2023-12-11T07:25:47.248559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45357
21.9%
8700
 
4.2%
8295
 
4.0%
8166
 
3.9%
1 8159
 
3.9%
7951
 
3.8%
7477
 
3.6%
- 4875
 
2.4%
2 4477
 
2.2%
3 3697
 
1.8%
Other values (658) 99737
48.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 119053
57.5%
Space Separator 45357
 
21.9%
Decimal Number 34743
 
16.8%
Dash Punctuation 4875
 
2.4%
Uppercase Letter 919
 
0.4%
Close Punctuation 649
 
0.3%
Open Punctuation 648
 
0.3%
Other Punctuation 404
 
0.2%
Lowercase Letter 218
 
0.1%
Math Symbol 16
 
< 0.1%
Other values (3) 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8700
 
7.3%
8295
 
7.0%
8166
 
6.9%
7951
 
6.7%
7477
 
6.3%
3227
 
2.7%
2762
 
2.3%
2639
 
2.2%
2300
 
1.9%
2080
 
1.7%
Other values (583) 65456
55.0%
Uppercase Letter
ValueCountFrequency (%)
A 167
18.2%
B 109
11.9%
C 86
 
9.4%
D 59
 
6.4%
S 53
 
5.8%
T 49
 
5.3%
E 49
 
5.3%
L 47
 
5.1%
K 44
 
4.8%
H 34
 
3.7%
Other values (16) 222
24.2%
Lowercase Letter
ValueCountFrequency (%)
e 47
21.6%
o 24
11.0%
t 21
9.6%
a 16
 
7.3%
r 15
 
6.9%
c 12
 
5.5%
s 11
 
5.0%
n 11
 
5.0%
p 8
 
3.7%
i 8
 
3.7%
Other values (13) 45
20.6%
Decimal Number
ValueCountFrequency (%)
1 8159
23.5%
2 4477
12.9%
3 3697
10.6%
4 3196
 
9.2%
5 3055
 
8.8%
6 2866
 
8.2%
0 2527
 
7.3%
7 2506
 
7.2%
8 2202
 
6.3%
9 2058
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 344
85.1%
. 21
 
5.2%
& 21
 
5.2%
@ 7
 
1.7%
/ 6
 
1.5%
· 3
 
0.7%
: 2
 
0.5%
Letter Number
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
45357
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4875
100.0%
Close Punctuation
ValueCountFrequency (%)
) 649
100.0%
Open Punctuation
ValueCountFrequency (%)
( 648
100.0%
Math Symbol
ValueCountFrequency (%)
~ 16
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 119055
57.5%
Common 86693
41.9%
Latin 1141
 
0.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8700
 
7.3%
8295
 
7.0%
8166
 
6.9%
7951
 
6.7%
7477
 
6.3%
3227
 
2.7%
2762
 
2.3%
2639
 
2.2%
2300
 
1.9%
2080
 
1.7%
Other values (582) 65458
55.0%
Latin
ValueCountFrequency (%)
A 167
 
14.6%
B 109
 
9.6%
C 86
 
7.5%
D 59
 
5.2%
S 53
 
4.6%
T 49
 
4.3%
E 49
 
4.3%
e 47
 
4.1%
L 47
 
4.1%
K 44
 
3.9%
Other values (41) 431
37.8%
Common
ValueCountFrequency (%)
45357
52.3%
1 8159
 
9.4%
- 4875
 
5.6%
2 4477
 
5.2%
3 3697
 
4.3%
4 3196
 
3.7%
5 3055
 
3.5%
6 2866
 
3.3%
0 2527
 
2.9%
7 2506
 
2.9%
Other values (13) 5978
 
6.9%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 119048
57.5%
ASCII 87827
42.5%
None 7
 
< 0.1%
Number Forms 4
 
< 0.1%
Compat Jamo 3
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45357
51.6%
1 8159
 
9.3%
- 4875
 
5.6%
2 4477
 
5.1%
3 3697
 
4.2%
4 3196
 
3.6%
5 3055
 
3.5%
6 2866
 
3.3%
0 2527
 
2.9%
7 2506
 
2.9%
Other values (61) 7112
 
8.1%
Hangul
ValueCountFrequency (%)
8700
 
7.3%
8295
 
7.0%
8166
 
6.9%
7951
 
6.7%
7477
 
6.3%
3227
 
2.7%
2762
 
2.3%
2639
 
2.2%
2300
 
1.9%
2080
 
1.7%
Other values (578) 65451
55.0%
None
ValueCountFrequency (%)
4
57.1%
· 3
42.9%
Number Forms
ValueCountFrequency (%)
2
50.0%
2
50.0%
Compat Jamo
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2829
Distinct (%)36.6%
Missing105
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean14770.427
Minimum10005
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.0 KiB
2023-12-11T07:25:47.365722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10005
5-th percentile10333
Q112654.5
median15062
Q317118
95-th percentile18510
Maximum18635
Range8630
Interquartile range (IQR)4463.5

Descriptive statistics

Standard deviation2630.0452
Coefficient of variation (CV)0.17806156
Kurtosis-1.1631402
Mean14770.427
Median Absolute Deviation (MAD)2255
Skewness-0.24429707
Sum1.1414586 × 108
Variance6917137.6
MonotonicityNot monotonic
2023-12-11T07:25:47.473846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18623 54
 
0.7%
18487 44
 
0.6%
13487 34
 
0.4%
17118 32
 
0.4%
18622 29
 
0.4%
11521 28
 
0.4%
18103 26
 
0.3%
18449 25
 
0.3%
17746 21
 
0.3%
13486 20
 
0.3%
Other values (2819) 7415
94.7%
(Missing) 105
 
1.3%
ValueCountFrequency (%)
10005 1
 
< 0.1%
10009 1
 
< 0.1%
10011 8
0.1%
10012 1
 
< 0.1%
10013 2
 
< 0.1%
10016 3
 
< 0.1%
10017 3
 
< 0.1%
10019 2
 
< 0.1%
10020 5
0.1%
10021 3
 
< 0.1%
ValueCountFrequency (%)
18635 4
0.1%
18633 1
 
< 0.1%
18631 5
0.1%
18630 4
0.1%
18629 1
 
< 0.1%
18628 1
 
< 0.1%
18627 3
< 0.1%
18626 1
 
< 0.1%
18625 4
0.1%
18624 3
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5572
Distinct (%)72.1%
Missing108
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean37.390488
Minimum36.93703
Maximum38.171177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.0 KiB
2023-12-11T07:25:47.583182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.93703
5-th percentile37.03134
Q137.244148
median37.348958
Q337.528209
95-th percentile37.813357
Maximum38.171177
Range1.2341477
Interquartile range (IQR)0.28406108

Descriptive statistics

Standard deviation0.2319061
Coefficient of variation (CV)0.0062022753
Kurtosis-0.41045167
Mean37.390488
Median Absolute Deviation (MAD)0.13771982
Skewness0.41132943
Sum288841.52
Variance0.053780439
MonotonicityNot monotonic
2023-12-11T07:25:47.701006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.2999636094 10
 
0.1%
37.6115294238 10
 
0.1%
37.6403798187 9
 
0.1%
37.4285880307 9
 
0.1%
37.2529797883 9
 
0.1%
37.2907221232 8
 
0.1%
37.8098859526 8
 
0.1%
37.4383097867 8
 
0.1%
37.7549839908 7
 
0.1%
37.3397369385 7
 
0.1%
Other values (5562) 7640
97.5%
(Missing) 108
 
1.4%
ValueCountFrequency (%)
36.9370295505 4
0.1%
36.9378160852 1
 
< 0.1%
36.9423163206 1
 
< 0.1%
36.9433626437 4
0.1%
36.9445487055 2
< 0.1%
36.9506021366 4
0.1%
36.9512029715 1
 
< 0.1%
36.9515354628 1
 
< 0.1%
36.9519013346 2
< 0.1%
36.9523591039 1
 
< 0.1%
ValueCountFrequency (%)
38.171177259 1
< 0.1%
38.0986800654 1
< 0.1%
38.07992318 1
< 0.1%
38.0733390604 2
< 0.1%
38.0609177862 1
< 0.1%
38.0598698206 1
< 0.1%
38.0491130965 2
< 0.1%
38.0483373926 1
< 0.1%
38.0320583126 1
< 0.1%
38.0301190484 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct5572
Distinct (%)72.1%
Missing108
Missing (%)1.4%
Infinite0
Infinite (%)0.0%
Mean126.99342
Minimum126.53893
Maximum127.78118
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size69.0 KiB
2023-12-11T07:25:47.817240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.53893
5-th percentile126.71669
Q1126.81038
median126.98749
Q3127.11782
95-th percentile127.38194
Maximum127.78118
Range1.2422423
Interquartile range (IQR)0.30743865

Descriptive statistics

Standard deviation0.20669135
Coefficient of variation (CV)0.0016275752
Kurtosis0.17249377
Mean126.99342
Median Absolute Deviation (MAD)0.15009742
Skewness0.54376552
Sum981024.19
Variance0.042721312
MonotonicityNot monotonic
2023-12-11T07:25:47.950187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.8356453998 10
 
0.1%
126.8345037964 10
 
0.1%
126.8737927852 9
 
0.1%
126.9874907386 9
 
0.1%
127.4806834787 9
 
0.1%
127.1967269237 8
 
0.1%
126.7740169561 8
 
0.1%
126.8846367245 8
 
0.1%
127.0334258147 7
 
0.1%
126.7335459329 7
 
0.1%
Other values (5562) 7640
97.5%
(Missing) 108
 
1.4%
ValueCountFrequency (%)
126.538933301 2
< 0.1%
126.540609067 1
< 0.1%
126.54660312 2
< 0.1%
126.5494942369 2
< 0.1%
126.5522087631 1
< 0.1%
126.5527033181 2
< 0.1%
126.5537612382 1
< 0.1%
126.5560224786 1
< 0.1%
126.5561823996 2
< 0.1%
126.5570145405 1
< 0.1%
ValueCountFrequency (%)
127.7811755617 1
 
< 0.1%
127.7761461418 1
 
< 0.1%
127.7265930476 3
< 0.1%
127.7114410955 1
 
< 0.1%
127.7021476459 1
 
< 0.1%
127.7014405714 2
< 0.1%
127.6981639156 2
< 0.1%
127.6959380688 2
< 0.1%
127.6918987923 1
 
< 0.1%
127.684004699 1
 
< 0.1%

Interactions

2023-12-11T07:25:42.886356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.312274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.591362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.984916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.396423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.677915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:43.082048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.483496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:25:42.771143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:25:48.039768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명위생업태명소재지우편번호WGS84위도WGS84경도
시군명1.0000.1920.4220.9900.9420.925
영업상태명0.1921.0000.6530.0970.0810.077
위생업태명0.4220.6531.0000.2810.2930.185
소재지우편번호0.9900.0970.2811.0000.9090.835
WGS84위도0.9420.0810.2930.9091.0000.521
WGS84경도0.9250.0770.1850.8350.5211.000
2023-12-11T07:25:48.155672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명위생업종명영업상태명위생업태명
시군명1.0001.0000.1010.157
위생업종명1.0001.0001.0001.000
영업상태명0.1011.0001.0000.459
위생업태명0.1571.0000.4591.000
2023-12-11T07:25:48.254925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명영업상태명위생업종명위생업태명
소재지우편번호1.000-0.9290.1670.9180.0581.0000.123
WGS84위도-0.9291.000-0.2100.7060.0481.0000.129
WGS84경도0.167-0.2101.0000.6560.0461.0000.080
시군명0.9180.7060.6561.0000.1011.0000.157
영업상태명0.0580.0480.0460.1011.0001.0000.459
위생업종명1.0001.0001.0001.0001.0001.0001.000
위생업태명0.1230.1290.0800.1570.4591.0001.000

Missing values

2023-12-11T07:25:43.210695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:25:43.402591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:25:43.544619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군씨제이프레시웨이(주)아난티가평점클럽하우스2022-07-29영업<NA><NA><NA><NA>위탁급식영업경기도 가평군 설악면 유명로 961-34, 클럽하우스동 지하2층경기도 가평군 설악면 방일리 산 90-2 클럽하우스동 지하2층1247237.620183127.482277
1가평군강변요양병원20210513영업<NA><NA><NA><NA>병원경기도 가평군 가평읍 북한강변로 160, 2층경기도 가평군 가평읍 금대리 585 외1필지, 2층1242837.747333127.525983
2가평군(주)케이씨씨글라스 가평공장2009-05-13영업<NA><NA><NA><NA>산업체경기도 가평군 가평읍 물안산길 42, 6동 2층경기도 가평군 가평읍 개곡리 산 280 외 5필지, 6동 2층1241037.861024127.532241
3가평군재단법인가평군복지재단(가평군노인복지관)20210126영업<NA><NA><NA><NA>사회복지시설경기도 가평군 가평읍 가화로 161, 노인복지관 1층경기도 가평군 가평읍 읍내리 625-8 노인복지관1241337.833628127.511205
4가평군HJ매그놀리아국제병원,HJ매그놀리아국제요양병원2019-06-07영업<NA><NA><NA><NA>병원경기도 가평군 설악면 미사리로 267-177경기도 가평군 설악면 송산리 426-101246137.691127127.521381
5가평군신응수강변요양병원점20200318영업<NA><NA><NA><NA>위탁급식영업경기도 가평군 가평읍 북한강변로 160, 2층경기도 가평군 가평읍 금대리 585 외1필지(589) 2층1242837.747333127.525983
6가평군서울특별시교육청학생교육원축령산본원교육원20060710영업<NA><NA><NA><NA>공공기관경기도 가평군 상면 축령로45번길 40-135 (외 27필지)경기도 가평군 상면 행현리 산 136 외 27필지1244837.770447127.354775
7가평군(주)아워홈 교원가평비전센터점20210104영업<NA><NA><NA><NA>위탁급식영업경기도 가평군 설악면 유명로 2182, 교원비전센터 지하1층경기도 가평군 설악면 회곡리 313-1 외 2필지(322-2, 327-2)1245937.701531127.456575
8가평군(주)현대그린푸드 청평연수원지점2015-06-30영업<NA><NA><NA><NA>위탁급식영업경기도 가평군 청평면 고재길 33, 1층경기도 가평군 청평면 고성리 410-1 1층1245637.714401127.504255
9가평군가평현등연구소 집단급식소20081021영업<NA><NA><NA><NA>산업체경기도 가평군 조종면 운악청계로490번길 70-2 (외 33필지)경기도 가평군 조종면 운악리 64 외 33필지1243237.857792127.369269
시군명사업장명인허가일자영업상태명폐업일자다중이용업소여부총시설규모(㎡)위생업종명위생업태명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
7823화성시산길어린이집20110712폐업20221024<NA><NA><NA>어린이집경기도 화성시 병점1로 66-12 (병점동,(하이코아빌딩)105호)경기도 화성시 병점동 490-2 (하이코아빌딩)105호1841737.202138127.041527
7824화성시통통어린이집20061221폐업20221024<NA><NA><NA>어린이집경기도 화성시 정남면 덕절전원길 41경기도 화성시 정남면 덕절리 4431851537.138466127.026351
7825화성시나래어린이집20060804폐업20221024<NA><NA><NA>어린이집경기도 화성시 정남면 만년로 539-28경기도 화성시 정남면 괘랑리 1058-11851637.170381126.982391
7826화성시오누이 어린이집20060224폐업20221024<NA><NA><NA>어린이집경기도 화성시 병점중앙로155번길 15-14 (진안동)경기도 화성시 진안동 524-261840137.211092127.037652
7827화성시(주)자연애FNT 참유치원점20200120폐업20220211<NA><NA><NA>위탁급식영업경기도 화성시 병점2로 45, AA참교육문화센터 2층 일부호 (병점동)경기도 화성시 병점동 813 AA참교육문화센터 2층 일부호1841037.207024127.04152
7828화성시(주)원익아이피에스20190226폐업20220215<NA><NA><NA>산업체경기도 화성시 경기동로 267-24, B동 1층 (장지동)경기도 화성시 장지동 164-5 B동 1층1849937.159922127.102994
7829화성시(주)동일씨앤이2004-05-07폐업2023-08-01<NA><NA><NA>산업체경기도 화성시 팔탄면 칙골길 19경기도 화성시 팔탄면 월문리 55-11857737.123324126.866517
7830화성시대신기계공업2018-11-23폐업2023-11-10<NA><NA><NA>위탁급식영업경기도 화성시 정남면 만년로 711-78, 나동 1층경기도 화성시 정남면 괘랑리 480-1 1층 나동1851637.184761126.989042
7831화성시호텔푸르미르20160118폐업20221227<NA><NA><NA>산업체경기도 화성시 효행로 480 (안녕동)경기도 화성시 안녕동 188-21832437.209042126.984404
7832화성시한국건설기술연구원20150326폐업 등20180103<NA><NA>집단급식소공공기관경기도 화성시 마도면 마도로182번길 64 (건축물안전성능실험센터)경기도 화성시 마도면 백곡리 451-1번지1854437.178355126.725379