Overview

Dataset statistics

Number of variables18
Number of observations8227
Missing cells17813
Missing cells (%)12.0%
Duplicate rows313
Duplicate rows (%)3.8%
Total size in memory1.2 MiB
Average record size in memory151.0 B

Variable types

Categorical4
Numeric6
Text8

Dataset

Description시군구코드,처분일자,교부번호,업종명,업태명,업소명,소재지도로명,소재지지번,지도점검일자,행정처분상태,처분명,법적근거,위반일자,위반내용,처분내용,처분기간,영업장면적(㎡),운영형태
Author성북구
URLhttps://data.seoul.go.kr/dataList/OA-11143/S/1/datasetView.do

Alerts

시군구코드 has constant value ""Constant
행정처분상태 has constant value ""Constant
Dataset has 313 (3.8%) duplicate rowsDuplicates
업종명 is highly overall correlated with 운영형태High correlation
운영형태 is highly overall correlated with 지도점검일자 and 2 other fieldsHigh correlation
처분일자 is highly overall correlated with 교부번호 and 2 other fieldsHigh correlation
교부번호 is highly overall correlated with 처분일자 and 2 other fieldsHigh correlation
지도점검일자 is highly overall correlated with 처분일자 and 3 other fieldsHigh correlation
위반일자 is highly overall correlated with 처분일자 and 3 other fieldsHigh correlation
업종명 is highly imbalanced (57.8%)Imbalance
운영형태 is highly imbalanced (94.6%)Imbalance
소재지도로명 has 4600 (55.9%) missing valuesMissing
처분기간 has 7543 (91.7%) missing valuesMissing
영업장면적(㎡) has 5655 (68.7%) missing valuesMissing
지도점검일자 is highly skewed (γ1 = -78.93012893)Skewed
위반일자 is highly skewed (γ1 = -59.49116985)Skewed

Reproduction

Analysis started2024-04-29 15:23:07.318087
Analysis finished2024-04-29 15:23:15.996077
Duration8.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
3070000
8227 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3070000
2nd row3070000
3rd row3070000
4th row3070000
5th row3070000

Common Values

ValueCountFrequency (%)
3070000 8227
100.0%

Length

2024-04-30T00:23:16.061068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T00:23:16.154078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3070000 8227
100.0%

처분일자
Real number (ℝ)

HIGH CORRELATION 

Distinct2748
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20121410
Minimum20001106
Maximum20800317
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:16.278217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20001106
5-th percentile20030325
Q120070630
median20110309
Q320171208
95-th percentile20231120
Maximum20800317
Range799211
Interquartile range (IQR)100578.5

Descriptive statistics

Standard deviation65850.903
Coefficient of variation (CV)0.0032726784
Kurtosis0.21455482
Mean20121410
Median Absolute Deviation (MAD)50183
Skewness0.40848193
Sum1.6553884 × 1011
Variance4.3363415 × 109
MonotonicityDecreasing
2024-04-30T00:23:16.436973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20201216 66
 
0.8%
20230616 65
 
0.8%
20080414 58
 
0.7%
20230627 56
 
0.7%
20080305 54
 
0.7%
20240111 54
 
0.7%
20230623 53
 
0.6%
20080321 47
 
0.6%
20050704 47
 
0.6%
20080327 41
 
0.5%
Other values (2738) 7686
93.4%
ValueCountFrequency (%)
20001106 4
< 0.1%
20001208 1
 
< 0.1%
20010206 2
< 0.1%
20010217 2
< 0.1%
20010226 1
 
< 0.1%
20010329 2
< 0.1%
20010406 2
< 0.1%
20010409 4
< 0.1%
20010426 1
 
< 0.1%
20010507 1
 
< 0.1%
ValueCountFrequency (%)
20800317 1
 
< 0.1%
20240404 1
 
< 0.1%
20240319 1
 
< 0.1%
20240314 7
0.1%
20240228 4
< 0.1%
20240223 1
 
< 0.1%
20240222 1
 
< 0.1%
20240221 5
0.1%
20240220 1
 
< 0.1%
20240219 1
 
< 0.1%

교부번호
Real number (ℝ)

HIGH CORRELATION 

Distinct4248
Distinct (%)51.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0029842 × 1010
Minimum1.963005 × 1010
Maximum2.0230068 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:16.596673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.963005 × 1010
5-th percentile1.988005 × 1010
Q11.9980051 × 1010
median2.0030051 × 1010
Q32.009005 × 1010
95-th percentile2.0170051 × 1010
Maximum2.0230068 × 1010
Range6.0001758 × 108
Interquartile range (IQR)1.0999939 × 108

Descriptive statistics

Standard deviation88232380
Coefficient of variation (CV)0.0044050462
Kurtosis1.4746014
Mean2.0029842 × 1010
Median Absolute Deviation (MAD)50000247
Skewness-0.64723222
Sum1.6478551 × 1014
Variance7.7849528 × 1015
MonotonicityNot monotonic
2024-04-30T00:23:16.768879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20010051169 95
 
1.2%
20050050498 48
 
0.6%
20100050359 28
 
0.3%
20200051044 26
 
0.3%
20070050615 25
 
0.3%
19950050486 25
 
0.3%
19990050677 24
 
0.3%
19950050320 22
 
0.3%
20180050303 22
 
0.3%
19990050117 21
 
0.3%
Other values (4238) 7891
95.9%
ValueCountFrequency (%)
19630050001 3
 
< 0.1%
19650050002 10
0.1%
19670050003 1
 
< 0.1%
19670050004 1
 
< 0.1%
19680050012 2
 
< 0.1%
19690050007 1
 
< 0.1%
19690050013 11
0.1%
19700050005 1
 
< 0.1%
19710050003 16
0.2%
19710050006 1
 
< 0.1%
ValueCountFrequency (%)
20230067585 1
 
< 0.1%
20230067428 3
< 0.1%
20230067365 1
 
< 0.1%
20230067256 4
< 0.1%
20230067208 1
 
< 0.1%
20220060110 3
< 0.1%
20220060108 1
 
< 0.1%
20220059988 1
 
< 0.1%
20220059428 1
 
< 0.1%
20220059101 1
 
< 0.1%

업종명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
일반음식점
5847 
식품제조가공업
 
449
즉석판매제조가공업
 
407
단란주점
 
377
휴게음식점
 
355
Other values (15)
792 

Length

Max length13
Median length5
Mean length5.498116
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 5847
71.1%
식품제조가공업 449
 
5.5%
즉석판매제조가공업 407
 
4.9%
단란주점 377
 
4.6%
휴게음식점 355
 
4.3%
건강기능식품일반판매업 183
 
2.2%
식품소분업 109
 
1.3%
제과점영업 109
 
1.3%
집단급식소 81
 
1.0%
기타식품판매업 81
 
1.0%
Other values (10) 229
 
2.8%

Length

2024-04-30T00:23:16.913099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 5847
70.4%
식품제조가공업 449
 
5.4%
즉석판매제조가공업 407
 
4.9%
단란주점 377
 
4.5%
휴게음식점 355
 
4.3%
건강기능식품일반판매업 183
 
2.2%
식품소분업 109
 
1.3%
제과점영업 109
 
1.3%
집단급식소 81
 
1.0%
기타식품판매업 81
 
1.0%
Other values (11) 303
 
3.7%
Distinct60
Distinct (%)0.7%
Missing4
Missing (%)< 0.1%
Memory size64.4 KiB
2024-04-30T00:23:17.108041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length4.3248206
Min length2

Characters and Unicode

Total characters35563
Distinct characters137
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row탕류(보신용)
2nd row한식
3rd row정종/대포집/소주방
4th row분식
5th row호프/통닭
ValueCountFrequency (%)
한식 2022
24.1%
호프/통닭 972
11.6%
기타 682
 
8.1%
식품제조가공업 449
 
5.4%
즉석판매제조가공업 407
 
4.9%
단란주점 377
 
4.5%
정종/대포집/소주방 367
 
4.4%
통닭(치킨 356
 
4.3%
까페 344
 
4.1%
경양식 267
 
3.2%
Other values (50) 2131
25.4%
2024-04-30T00:23:17.438270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3724
 
10.5%
2022
 
5.7%
/ 1706
 
4.8%
1504
 
4.2%
1430
 
4.0%
1328
 
3.7%
973
 
2.7%
972
 
2.7%
971
 
2.7%
947
 
2.7%
Other values (127) 19986
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32524
91.5%
Other Punctuation 1722
 
4.8%
Close Punctuation 583
 
1.6%
Open Punctuation 583
 
1.6%
Space Separator 151
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3724
 
11.5%
2022
 
6.2%
1504
 
4.6%
1430
 
4.4%
1328
 
4.1%
973
 
3.0%
972
 
3.0%
971
 
3.0%
947
 
2.9%
906
 
2.8%
Other values (121) 17747
54.6%
Other Punctuation
ValueCountFrequency (%)
/ 1706
99.1%
, 10
 
0.6%
. 6
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 583
100.0%
Open Punctuation
ValueCountFrequency (%)
( 583
100.0%
Space Separator
ValueCountFrequency (%)
151
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32524
91.5%
Common 3039
 
8.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3724
 
11.5%
2022
 
6.2%
1504
 
4.6%
1430
 
4.4%
1328
 
4.1%
973
 
3.0%
972
 
3.0%
971
 
3.0%
947
 
2.9%
906
 
2.8%
Other values (121) 17747
54.6%
Common
ValueCountFrequency (%)
/ 1706
56.1%
) 583
 
19.2%
( 583
 
19.2%
151
 
5.0%
, 10
 
0.3%
. 6
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32524
91.5%
ASCII 3039
 
8.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3724
 
11.5%
2022
 
6.2%
1504
 
4.6%
1430
 
4.4%
1328
 
4.1%
973
 
3.0%
972
 
3.0%
971
 
3.0%
947
 
2.9%
906
 
2.8%
Other values (121) 17747
54.6%
ASCII
ValueCountFrequency (%)
/ 1706
56.1%
) 583
 
19.2%
( 583
 
19.2%
151
 
5.0%
, 10
 
0.3%
. 6
 
0.2%
Distinct4244
Distinct (%)51.6%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
2024-04-30T00:23:17.707056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length29
Mean length5.3566306
Min length1

Characters and Unicode

Total characters44069
Distinct characters950
Distinct categories13 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2586 ?
Unique (%)31.4%

Sample

1st row소머리국밥집
2nd row큰대쪽갈비
3rd row펍피맥
4th row한갈비탕
5th row청춘퓨전포차
ValueCountFrequency (%)
성일식품 90
 
1.0%
국제유통 48
 
0.5%
주식회사 30
 
0.3%
성신실내포차 26
 
0.3%
성신여대점 25
 
0.3%
대우푸르지오함바 25
 
0.3%
쌍둥이식당 25
 
0.3%
소문난만두 25
 
0.3%
무학7080소주방 20
 
0.2%
태양유통 20
 
0.2%
Other values (4571) 8794
96.3%
2024-04-30T00:23:18.118782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
956
 
2.2%
901
 
2.0%
805
 
1.8%
) 736
 
1.7%
( 735
 
1.7%
678
 
1.5%
672
 
1.5%
624
 
1.4%
564
 
1.3%
509
 
1.2%
Other values (940) 36889
83.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 38894
88.3%
Lowercase Letter 1130
 
2.6%
Uppercase Letter 916
 
2.1%
Space Separator 901
 
2.0%
Close Punctuation 738
 
1.7%
Open Punctuation 737
 
1.7%
Decimal Number 543
 
1.2%
Other Punctuation 178
 
0.4%
Dash Punctuation 16
 
< 0.1%
Math Symbol 6
 
< 0.1%
Other values (3) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
956
 
2.5%
805
 
2.1%
678
 
1.7%
672
 
1.7%
624
 
1.6%
564
 
1.5%
509
 
1.3%
486
 
1.2%
475
 
1.2%
414
 
1.1%
Other values (858) 32711
84.1%
Uppercase Letter
ValueCountFrequency (%)
A 82
 
9.0%
O 77
 
8.4%
B 75
 
8.2%
S 65
 
7.1%
C 56
 
6.1%
N 50
 
5.5%
E 49
 
5.3%
L 47
 
5.1%
K 44
 
4.8%
H 38
 
4.1%
Other values (16) 333
36.4%
Lowercase Letter
ValueCountFrequency (%)
a 149
13.2%
e 149
13.2%
o 104
 
9.2%
n 64
 
5.7%
r 63
 
5.6%
i 60
 
5.3%
s 59
 
5.2%
c 50
 
4.4%
l 49
 
4.3%
f 47
 
4.2%
Other values (15) 336
29.7%
Other Punctuation
ValueCountFrequency (%)
& 44
24.7%
. 42
23.6%
' 35
19.7%
, 26
14.6%
; 10
 
5.6%
/ 5
 
2.8%
? 5
 
2.8%
! 4
 
2.2%
4
 
2.2%
2
 
1.1%
Decimal Number
ValueCountFrequency (%)
0 157
28.9%
8 82
15.1%
2 71
13.1%
7 67
12.3%
1 40
 
7.4%
9 38
 
7.0%
5 27
 
5.0%
3 25
 
4.6%
4 22
 
4.1%
6 14
 
2.6%
Close Punctuation
ValueCountFrequency (%)
) 736
99.7%
] 2
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 735
99.7%
[ 2
 
0.3%
Space Separator
ValueCountFrequency (%)
901
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%
Letter Number
ValueCountFrequency (%)
4
100.0%
Other Number
ValueCountFrequency (%)
4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 38848
88.2%
Common 3125
 
7.1%
Latin 2050
 
4.7%
Han 42
 
0.1%
Hiragana 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
956
 
2.5%
805
 
2.1%
678
 
1.7%
672
 
1.7%
624
 
1.6%
564
 
1.5%
509
 
1.3%
486
 
1.3%
475
 
1.2%
414
 
1.1%
Other values (821) 32665
84.1%
Latin
ValueCountFrequency (%)
a 149
 
7.3%
e 149
 
7.3%
o 104
 
5.1%
A 82
 
4.0%
O 77
 
3.8%
B 75
 
3.7%
S 65
 
3.2%
n 64
 
3.1%
r 63
 
3.1%
i 60
 
2.9%
Other values (42) 1162
56.7%
Han
ValueCountFrequency (%)
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (25) 25
59.5%
Common
ValueCountFrequency (%)
901
28.8%
) 736
23.6%
( 735
23.5%
0 157
 
5.0%
8 82
 
2.6%
2 71
 
2.3%
7 67
 
2.1%
& 44
 
1.4%
. 42
 
1.3%
1 40
 
1.3%
Other values (20) 250
 
8.0%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 38848
88.2%
ASCII 5161
 
11.7%
CJK 36
 
0.1%
None 10
 
< 0.1%
CJK Compat Ideographs 6
 
< 0.1%
Number Forms 4
 
< 0.1%
Hiragana 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
956
 
2.5%
805
 
2.1%
678
 
1.7%
672
 
1.7%
624
 
1.6%
564
 
1.5%
509
 
1.3%
486
 
1.3%
475
 
1.2%
414
 
1.1%
Other values (821) 32665
84.1%
ASCII
ValueCountFrequency (%)
901
17.5%
) 736
 
14.3%
( 735
 
14.2%
0 157
 
3.0%
a 149
 
2.9%
e 149
 
2.9%
o 104
 
2.0%
8 82
 
1.6%
A 82
 
1.6%
O 77
 
1.5%
Other values (68) 1989
38.5%
Number Forms
ValueCountFrequency (%)
4
100.0%
None
ValueCountFrequency (%)
4
40.0%
4
40.0%
2
20.0%
CJK
ValueCountFrequency (%)
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
1
 
2.8%
1
 
2.8%
1
 
2.8%
Other values (20) 20
55.6%
CJK Compat Ideographs
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Hiragana
ValueCountFrequency (%)
2
50.0%
2
50.0%

소재지도로명
Text

MISSING 

Distinct1933
Distinct (%)53.3%
Missing4600
Missing (%)55.9%
Memory size64.4 KiB
2024-04-30T00:23:18.377763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length57
Mean length31.001103
Min length22

Characters and Unicode

Total characters112441
Distinct characters299
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1134 ?
Unique (%)31.3%

Sample

1st row서울특별시 성북구 보문로 58-1, 한주빌딩 1층 (보문동7가)
2nd row서울특별시 성북구 한천로78길 43, (석관동)
3rd row서울특별시 성북구 동소문로 227, 65,66호 (길음동)
4th row서울특별시 성북구 고려대로26길 42-1, (안암동5가)
5th row서울특별시 성북구 동소문로 227, 65,66호 (길음동)
ValueCountFrequency (%)
서울특별시 3627
 
17.6%
성북구 3627
 
17.6%
1층 656
 
3.2%
장위동 358
 
1.7%
정릉동 301
 
1.5%
하월곡동 280
 
1.4%
동선동1가 258
 
1.3%
길음동 256
 
1.2%
안암동5가 237
 
1.1%
석관동 226
 
1.1%
Other values (1565) 10800
52.4%
2024-04-30T00:23:18.792862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17005
 
15.1%
5168
 
4.6%
1 4966
 
4.4%
, 4955
 
4.4%
4028
 
3.6%
4007
 
3.6%
( 3930
 
3.5%
) 3930
 
3.5%
3664
 
3.3%
3654
 
3.2%
Other values (289) 57134
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 64261
57.2%
Decimal Number 17425
 
15.5%
Space Separator 17005
 
15.1%
Other Punctuation 4984
 
4.4%
Open Punctuation 3930
 
3.5%
Close Punctuation 3930
 
3.5%
Dash Punctuation 681
 
0.6%
Uppercase Letter 138
 
0.1%
Lowercase Letter 56
 
< 0.1%
Math Symbol 25
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5168
 
8.0%
4028
 
6.3%
4007
 
6.2%
3664
 
5.7%
3654
 
5.7%
3641
 
5.7%
3628
 
5.6%
3627
 
5.6%
3627
 
5.6%
3604
 
5.6%
Other values (246) 25613
39.9%
Uppercase Letter
ValueCountFrequency (%)
B 82
59.4%
A 16
 
11.6%
S 10
 
7.2%
V 7
 
5.1%
K 6
 
4.3%
L 4
 
2.9%
W 3
 
2.2%
E 3
 
2.2%
I 3
 
2.2%
T 2
 
1.4%
Other values (2) 2
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
w 18
32.1%
r 10
17.9%
k 6
 
10.7%
e 5
 
8.9%
o 4
 
7.1%
u 4
 
7.1%
b 2
 
3.6%
a 2
 
3.6%
m 2
 
3.6%
d 1
 
1.8%
Other values (2) 2
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 4966
28.5%
2 3007
17.3%
3 1659
 
9.5%
5 1538
 
8.8%
4 1362
 
7.8%
0 1299
 
7.5%
6 1038
 
6.0%
7 977
 
5.6%
8 832
 
4.8%
9 747
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 4955
99.4%
. 17
 
0.3%
@ 12
 
0.2%
Space Separator
ValueCountFrequency (%)
17005
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3930
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3930
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 681
100.0%
Math Symbol
ValueCountFrequency (%)
~ 25
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 64261
57.2%
Common 47986
42.7%
Latin 194
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5168
 
8.0%
4028
 
6.3%
4007
 
6.2%
3664
 
5.7%
3654
 
5.7%
3641
 
5.7%
3628
 
5.6%
3627
 
5.6%
3627
 
5.6%
3604
 
5.6%
Other values (246) 25613
39.9%
Latin
ValueCountFrequency (%)
B 82
42.3%
w 18
 
9.3%
A 16
 
8.2%
r 10
 
5.2%
S 10
 
5.2%
V 7
 
3.6%
K 6
 
3.1%
k 6
 
3.1%
e 5
 
2.6%
o 4
 
2.1%
Other values (14) 30
 
15.5%
Common
ValueCountFrequency (%)
17005
35.4%
1 4966
 
10.3%
, 4955
 
10.3%
( 3930
 
8.2%
) 3930
 
8.2%
2 3007
 
6.3%
3 1659
 
3.5%
5 1538
 
3.2%
4 1362
 
2.8%
0 1299
 
2.7%
Other values (9) 4335
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 64261
57.2%
ASCII 48174
42.8%
CJK Compat 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17005
35.3%
1 4966
 
10.3%
, 4955
 
10.3%
( 3930
 
8.2%
) 3930
 
8.2%
2 3007
 
6.2%
3 1659
 
3.4%
5 1538
 
3.2%
4 1362
 
2.8%
0 1299
 
2.7%
Other values (32) 4523
 
9.4%
Hangul
ValueCountFrequency (%)
5168
 
8.0%
4028
 
6.3%
4007
 
6.2%
3664
 
5.7%
3654
 
5.7%
3641
 
5.7%
3628
 
5.6%
3627
 
5.6%
3627
 
5.6%
3604
 
5.6%
Other values (246) 25613
39.9%
CJK Compat
ValueCountFrequency (%)
6
100.0%
Distinct3568
Distinct (%)43.4%
Missing4
Missing (%)< 0.1%
Memory size64.4 KiB
2024-04-30T00:23:19.071902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length62
Mean length27.928372
Min length21

Characters and Unicode

Total characters229655
Distinct characters333
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1873 ?
Unique (%)22.8%

Sample

1st row서울특별시 성북구 석관동 132번지 44호
2nd row서울특별시 성북구 보문동7가 22번지 5호 한주빌딩
3rd row서울특별시 성북구 석관동 127번지 73호
4th row서울특별시 성북구 길음동 535번지 8호 길음시장-65,66
5th row서울특별시 성북구 안암동5가 104번지 28호
ValueCountFrequency (%)
서울특별시 8223
 
19.0%
성북구 8223
 
19.0%
장위동 1037
 
2.4%
정릉동 872
 
2.0%
하월곡동 810
 
1.9%
1호 777
 
1.8%
길음동 755
 
1.7%
동선동1가 706
 
1.6%
석관동 672
 
1.6%
종암동 624
 
1.4%
Other values (1834) 20632
47.6%
2024-04-30T00:23:19.538577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56952
24.8%
1 10100
 
4.4%
9950
 
4.3%
9302
 
4.1%
8593
 
3.7%
8543
 
3.7%
8258
 
3.6%
8253
 
3.6%
8241
 
3.6%
8238
 
3.6%
Other values (323) 93225
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129109
56.2%
Space Separator 56952
24.8%
Decimal Number 40664
 
17.7%
Close Punctuation 684
 
0.3%
Open Punctuation 684
 
0.3%
Other Punctuation 578
 
0.3%
Dash Punctuation 483
 
0.2%
Lowercase Letter 296
 
0.1%
Uppercase Letter 179
 
0.1%
Math Symbol 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9950
 
7.7%
9302
 
7.2%
8593
 
6.7%
8543
 
6.6%
8258
 
6.4%
8253
 
6.4%
8241
 
6.4%
8238
 
6.4%
8224
 
6.4%
8224
 
6.4%
Other values (262) 43283
33.5%
Lowercase Letter
ValueCountFrequency (%)
w 78
26.4%
e 25
 
8.4%
o 24
 
8.1%
r 23
 
7.8%
k 15
 
5.1%
c 15
 
5.1%
n 14
 
4.7%
m 14
 
4.7%
l 14
 
4.7%
t 13
 
4.4%
Other values (12) 61
20.6%
Uppercase Letter
ValueCountFrequency (%)
B 69
38.5%
A 31
17.3%
S 18
 
10.1%
K 9
 
5.0%
V 8
 
4.5%
L 7
 
3.9%
T 7
 
3.9%
C 6
 
3.4%
D 6
 
3.4%
G 5
 
2.8%
Other values (7) 13
 
7.3%
Decimal Number
ValueCountFrequency (%)
1 10100
24.8%
2 6190
15.2%
3 4610
11.3%
5 3583
 
8.8%
4 3447
 
8.5%
0 3288
 
8.1%
6 2531
 
6.2%
8 2518
 
6.2%
7 2456
 
6.0%
9 1941
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 508
87.9%
. 61
 
10.6%
/ 7
 
1.2%
@ 2
 
0.3%
Math Symbol
ValueCountFrequency (%)
~ 21
91.3%
< 1
 
4.3%
> 1
 
4.3%
Space Separator
ValueCountFrequency (%)
56952
100.0%
Close Punctuation
ValueCountFrequency (%)
) 684
100.0%
Open Punctuation
ValueCountFrequency (%)
( 684
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 483
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129109
56.2%
Common 100071
43.6%
Latin 475
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9950
 
7.7%
9302
 
7.2%
8593
 
6.7%
8543
 
6.6%
8258
 
6.4%
8253
 
6.4%
8241
 
6.4%
8238
 
6.4%
8224
 
6.4%
8224
 
6.4%
Other values (262) 43283
33.5%
Latin
ValueCountFrequency (%)
w 78
16.4%
B 69
14.5%
A 31
 
6.5%
e 25
 
5.3%
o 24
 
5.1%
r 23
 
4.8%
S 18
 
3.8%
k 15
 
3.2%
c 15
 
3.2%
n 14
 
2.9%
Other values (29) 163
34.3%
Common
ValueCountFrequency (%)
56952
56.9%
1 10100
 
10.1%
2 6190
 
6.2%
3 4610
 
4.6%
5 3583
 
3.6%
4 3447
 
3.4%
0 3288
 
3.3%
6 2531
 
2.5%
8 2518
 
2.5%
7 2456
 
2.5%
Other values (12) 4396
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129109
56.2%
ASCII 100543
43.8%
CJK Compat 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56952
56.6%
1 10100
 
10.0%
2 6190
 
6.2%
3 4610
 
4.6%
5 3583
 
3.6%
4 3447
 
3.4%
0 3288
 
3.3%
6 2531
 
2.5%
8 2518
 
2.5%
7 2456
 
2.4%
Other values (50) 4868
 
4.8%
Hangul
ValueCountFrequency (%)
9950
 
7.7%
9302
 
7.2%
8593
 
6.7%
8543
 
6.6%
8258
 
6.4%
8253
 
6.4%
8241
 
6.4%
8238
 
6.4%
8224
 
6.4%
8224
 
6.4%
Other values (262) 43283
33.5%
CJK Compat
ValueCountFrequency (%)
3
100.0%

지도점검일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct3000
Distinct (%)36.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20117633
Minimum1110421
Maximum20240313
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:19.689949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1110421
5-th percentile20030217
Q120070522
median20110127
Q320171024
95-th percentile20231017
Maximum20240313
Range19129892
Interquartile range (IQR)100502

Descriptive statistics

Standard deviation219511.72
Coefficient of variation (CV)0.010911409
Kurtosis6835.6719
Mean20117633
Median Absolute Deviation (MAD)50185
Skewness-78.930129
Sum1.6550777 × 1011
Variance4.8185395 × 1010
MonotonicityNot monotonic
2024-04-30T00:23:19.848147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20231110 85
 
1.0%
20071031 84
 
1.0%
20230527 75
 
0.9%
20231204 74
 
0.9%
20200423 66
 
0.8%
20230605 66
 
0.8%
20230608 54
 
0.7%
20080227 49
 
0.6%
20080220 45
 
0.5%
20081219 45
 
0.5%
Other values (2990) 7584
92.2%
ValueCountFrequency (%)
1110421 1
 
< 0.1%
20000925 4
< 0.1%
20001109 1
 
< 0.1%
20001120 4
< 0.1%
20001201 2
< 0.1%
20010119 1
 
< 0.1%
20010130 2
< 0.1%
20010222 1
 
< 0.1%
20010223 1
 
< 0.1%
20010308 1
 
< 0.1%
ValueCountFrequency (%)
20240313 1
 
< 0.1%
20240207 3
< 0.1%
20240131 4
< 0.1%
20240130 1
 
< 0.1%
20240126 1
 
< 0.1%
20240115 1
 
< 0.1%
20240112 1
 
< 0.1%
20240104 1
 
< 0.1%
20231230 1
 
< 0.1%
20231228 1
 
< 0.1%

행정처분상태
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
처분확정
8227 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row처분확정
2nd row처분확정
3rd row처분확정
4th row처분확정
5th row처분확정

Common Values

ValueCountFrequency (%)
처분확정 8227
100.0%

Length

2024-04-30T00:23:19.988107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T00:23:20.100578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
처분확정 8227
100.0%
Distinct596
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
2024-04-30T00:23:20.315922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length69
Mean length6.7350188
Min length2

Characters and Unicode

Total characters55409
Distinct characters209
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique338 ?
Unique (%)4.1%

Sample

1st row영업소폐쇄
2nd row과태료부과
3rd row영업정지
4th row과태료부과
5th row과태료부과
ValueCountFrequency (%)
과태료부과 2553
24.7%
시정명령 1223
11.8%
영업소폐쇄 1151
 
11.1%
영업정지 1094
 
10.6%
시설개수명령 338
 
3.3%
과태료 279
 
2.7%
부과 276
 
2.7%
과징금부과 240
 
2.3%
136
 
1.3%
시정명령(즉시 125
 
1.2%
Other values (663) 2935
28.4%
2024-04-30T00:23:20.733376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7404
 
13.4%
3713
 
6.7%
3286
 
5.9%
3276
 
5.9%
2989
 
5.4%
2801
 
5.1%
2776
 
5.0%
2127
 
3.8%
1987
 
3.6%
1796
 
3.2%
Other values (199) 23254
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46007
83.0%
Decimal Number 5054
 
9.1%
Space Separator 2127
 
3.8%
Other Punctuation 714
 
1.3%
Open Punctuation 705
 
1.3%
Close Punctuation 702
 
1.3%
Math Symbol 76
 
0.1%
Dash Punctuation 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (169) 14184
30.8%
Decimal Number
ValueCountFrequency (%)
0 1493
29.5%
1 1023
20.2%
2 1002
19.8%
5 313
 
6.2%
3 284
 
5.6%
4 265
 
5.2%
6 256
 
5.1%
8 192
 
3.8%
7 153
 
3.0%
9 73
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 533
74.6%
, 123
 
17.2%
% 31
 
4.3%
/ 17
 
2.4%
: 5
 
0.7%
? 4
 
0.6%
* 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 58
76.3%
12
 
15.8%
3
 
3.9%
+ 2
 
2.6%
= 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 703
99.7%
[ 1
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 700
99.7%
1
 
0.1%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2127
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46007
83.0%
Common 9402
 
17.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (169) 14184
30.8%
Common
ValueCountFrequency (%)
2127
22.6%
0 1493
15.9%
1 1023
10.9%
2 1002
10.7%
( 703
 
7.5%
) 700
 
7.4%
. 533
 
5.7%
5 313
 
3.3%
3 284
 
3.0%
4 265
 
2.8%
Other values (20) 959
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45995
83.0%
ASCII 9385
 
16.9%
Arrows 12
 
< 0.1%
Compat Jamo 12
 
< 0.1%
Geometric Shapes 3
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (168) 14172
30.8%
ASCII
ValueCountFrequency (%)
2127
22.7%
0 1493
15.9%
1 1023
10.9%
2 1002
10.7%
( 703
 
7.5%
) 700
 
7.5%
. 533
 
5.7%
5 313
 
3.3%
3 284
 
3.0%
4 265
 
2.8%
Other values (16) 942
10.0%
Arrows
ValueCountFrequency (%)
12
100.0%
Compat Jamo
ValueCountFrequency (%)
12
100.0%
Geometric Shapes
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct805
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
2024-04-30T00:23:20.996173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length46
Mean length13.15911
Min length2

Characters and Unicode

Total characters108260
Distinct characters185
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique381 ?
Unique (%)4.6%

Sample

1st row식품위생법제22조제5항 및 동법제58조
2nd row법 제82조제2항
3rd row법 제75조
4th row법 제101조제4항1호
5th row법 제101조제4항1호
ValueCountFrequency (%)
3673
18.0%
식품위생법 2975
 
14.5%
1379
 
6.7%
제75조 981
 
4.8%
제101조제4항1호 797
 
3.9%
제71조 708
 
3.5%
식품위생법제26조 545
 
2.7%
제101조제2항제1호 515
 
2.5%
식품위생법제31조 333
 
1.6%
제37조 303
 
1.5%
Other values (600) 8240
40.3%
2024-04-30T00:23:21.447383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14296
13.2%
12240
11.3%
10180
 
9.4%
9730
 
9.0%
1 8434
 
7.8%
5527
 
5.1%
5520
 
5.1%
5427
 
5.0%
5371
 
5.0%
2 4669
 
4.3%
Other values (175) 26866
24.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67401
62.3%
Decimal Number 27575
25.5%
Space Separator 12240
 
11.3%
Other Punctuation 747
 
0.7%
Close Punctuation 146
 
0.1%
Open Punctuation 143
 
0.1%
Dash Punctuation 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14296
21.2%
10180
15.1%
9730
14.4%
5527
 
8.2%
5520
 
8.2%
5427
 
8.1%
5371
 
8.0%
4040
 
6.0%
1757
 
2.6%
1497
 
2.2%
Other values (154) 4056
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 8434
30.6%
2 4669
16.9%
7 3345
 
12.1%
3 2430
 
8.8%
4 2368
 
8.6%
0 2130
 
7.7%
5 2091
 
7.6%
6 1373
 
5.0%
8 560
 
2.0%
9 175
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 739
98.9%
. 3
 
0.4%
: 2
 
0.3%
? 2
 
0.3%
/ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 143
97.9%
] 3
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 140
97.9%
[ 3
 
2.1%
Space Separator
ValueCountFrequency (%)
12240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67401
62.3%
Common 40859
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14296
21.2%
10180
15.1%
9730
14.4%
5527
 
8.2%
5520
 
8.2%
5427
 
8.1%
5371
 
8.0%
4040
 
6.0%
1757
 
2.6%
1497
 
2.2%
Other values (154) 4056
 
6.0%
Common
ValueCountFrequency (%)
12240
30.0%
1 8434
20.6%
2 4669
 
11.4%
7 3345
 
8.2%
3 2430
 
5.9%
4 2368
 
5.8%
0 2130
 
5.2%
5 2091
 
5.1%
6 1373
 
3.4%
, 739
 
1.8%
Other values (11) 1040
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67399
62.3%
ASCII 40859
37.7%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14296
21.2%
10180
15.1%
9730
14.4%
5527
 
8.2%
5520
 
8.2%
5427
 
8.1%
5371
 
8.0%
4040
 
6.0%
1757
 
2.6%
1497
 
2.2%
Other values (152) 4054
 
6.0%
ASCII
ValueCountFrequency (%)
12240
30.0%
1 8434
20.6%
2 4669
 
11.4%
7 3345
 
8.2%
3 2430
 
5.9%
4 2368
 
5.8%
0 2130
 
5.2%
5 2091
 
5.1%
6 1373
 
3.4%
, 739
 
1.8%
Other values (11) 1040
 
2.5%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

위반일자
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct3052
Distinct (%)37.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20114270
Minimum2001112
Maximum20240305
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:21.911324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2001112
5-th percentile20030214
Q120070522
median20110118
Q320170829
95-th percentile20230602
Maximum20240305
Range18239193
Interquartile range (IQR)100307

Descriptive statistics

Standard deviation289290.44
Coefficient of variation (CV)0.014382348
Kurtosis3719.9081
Mean20114270
Median Absolute Deviation (MAD)50107
Skewness-59.49117
Sum1.654801 × 1011
Variance8.3688957 × 1010
MonotonicityNot monotonic
2024-04-30T00:23:22.070934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20220301 153
 
1.9%
20210401 149
 
1.8%
20071031 138
 
1.7%
20231110 85
 
1.0%
20200101 81
 
1.0%
20231204 73
 
0.9%
20230527 69
 
0.8%
20230605 61
 
0.7%
20080227 49
 
0.6%
20050419 45
 
0.5%
Other values (3042) 7324
89.0%
ValueCountFrequency (%)
2001112 1
 
< 0.1%
2041123 1
 
< 0.1%
20000531 1
 
< 0.1%
20000925 4
< 0.1%
20001109 1
 
< 0.1%
20001201 2
< 0.1%
20010119 1
 
< 0.1%
20010130 2
< 0.1%
20010222 1
 
< 0.1%
20010223 1
 
< 0.1%
ValueCountFrequency (%)
20240305 1
 
< 0.1%
20240207 1
 
< 0.1%
20240131 4
< 0.1%
20240130 1
 
< 0.1%
20240118 2
< 0.1%
20240112 1
 
< 0.1%
20231230 1
 
< 0.1%
20231229 1
 
< 0.1%
20231224 1
 
< 0.1%
20231218 2
< 0.1%
Distinct3114
Distinct (%)37.9%
Missing7
Missing (%)0.1%
Memory size64.4 KiB
2024-04-30T00:23:22.369968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length383
Median length165
Mean length23.119951
Min length1

Characters and Unicode

Total characters190046
Distinct characters744
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks11 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1991 ?
Unique (%)24.2%

Sample

1st row폐업신고 미이행(시설물 멸실)
2nd row재난배상책임보험을 기한(2024.3.5.)내에 가입하지 아니함(가입일 : 2024.3.6.)
3rd row2023.12.30 20:42경 청소년에게 주류 판매
4th row식품위생법 위반(2021년 위생교육 미수료)
5th row식품위생법 위반(2021년 위생교육 미수료)
ValueCountFrequency (%)
위생교육 831
 
2.2%
미수료 761
 
2.0%
건강진단 590
 
1.6%
586
 
1.6%
기존영업자 479
 
1.3%
472
 
1.3%
받지 422
 
1.1%
382
 
1.0%
폐업신고 347
 
0.9%
아니함 334
 
0.9%
Other values (5379) 31999
86.0%
2024-04-30T00:23:22.866354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30024
 
15.8%
5395
 
2.8%
2 4632
 
2.4%
1 4266
 
2.2%
0 4046
 
2.1%
4010
 
2.1%
3580
 
1.9%
3579
 
1.9%
) 3473
 
1.8%
( 3464
 
1.8%
Other values (734) 123577
65.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 129028
67.9%
Space Separator 30024
 
15.8%
Decimal Number 17050
 
9.0%
Other Punctuation 5811
 
3.1%
Close Punctuation 3506
 
1.8%
Open Punctuation 3497
 
1.8%
Dash Punctuation 698
 
0.4%
Lowercase Letter 183
 
0.1%
Uppercase Letter 89
 
< 0.1%
Other Symbol 46
 
< 0.1%
Other values (4) 114
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5395
 
4.2%
4010
 
3.1%
3580
 
2.8%
3579
 
2.8%
2659
 
2.1%
2538
 
2.0%
2231
 
1.7%
2223
 
1.7%
2140
 
1.7%
2071
 
1.6%
Other values (644) 98602
76.4%
Lowercase Letter
ValueCountFrequency (%)
g 56
30.6%
m 16
 
8.7%
k 13
 
7.1%
c 11
 
6.0%
a 11
 
6.0%
r 9
 
4.9%
o 9
 
4.9%
e 7
 
3.8%
l 6
 
3.3%
n 6
 
3.3%
Other values (12) 39
21.3%
Uppercase Letter
ValueCountFrequency (%)
O 16
18.0%
G 13
14.6%
L 7
 
7.9%
A 6
 
6.7%
E 6
 
6.7%
J 5
 
5.6%
B 5
 
5.6%
D 5
 
5.6%
P 4
 
4.5%
H 4
 
4.5%
Other values (8) 18
20.2%
Other Punctuation
ValueCountFrequency (%)
. 3300
56.8%
, 891
 
15.3%
: 608
 
10.5%
* 507
 
8.7%
/ 413
 
7.1%
% 35
 
0.6%
? 25
 
0.4%
11
 
0.2%
9
 
0.2%
; 7
 
0.1%
Other values (3) 5
 
0.1%
Decimal Number
ValueCountFrequency (%)
2 4632
27.2%
1 4266
25.0%
0 4046
23.7%
6 792
 
4.6%
3 785
 
4.6%
5 551
 
3.2%
4 547
 
3.2%
7 500
 
2.9%
9 484
 
2.8%
8 447
 
2.6%
Other Symbol
ValueCountFrequency (%)
7
15.2%
7
15.2%
7
15.2%
6
13.0%
6
13.0%
6
13.0%
3
6.5%
3
6.5%
1
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 3473
99.1%
] 29
 
0.8%
4
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 3464
99.1%
[ 29
 
0.8%
4
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 33
80.5%
+ 6
 
14.6%
2
 
4.9%
Other Number
ValueCountFrequency (%)
18
48.6%
15
40.5%
4
 
10.8%
Initial Punctuation
ValueCountFrequency (%)
11
61.1%
7
38.9%
Final Punctuation
ValueCountFrequency (%)
11
61.1%
7
38.9%
Space Separator
ValueCountFrequency (%)
30024
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 698
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 129049
67.9%
Common 60725
32.0%
Latin 272
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5395
 
4.2%
4010
 
3.1%
3580
 
2.8%
3579
 
2.8%
2659
 
2.1%
2538
 
2.0%
2231
 
1.7%
2223
 
1.7%
2140
 
1.7%
2071
 
1.6%
Other values (647) 98623
76.4%
Common
ValueCountFrequency (%)
30024
49.4%
2 4632
 
7.6%
1 4266
 
7.0%
0 4046
 
6.7%
) 3473
 
5.7%
( 3464
 
5.7%
. 3300
 
5.4%
, 891
 
1.5%
6 792
 
1.3%
3 785
 
1.3%
Other values (37) 5052
 
8.3%
Latin
ValueCountFrequency (%)
g 56
20.6%
O 16
 
5.9%
m 16
 
5.9%
G 13
 
4.8%
k 13
 
4.8%
c 11
 
4.0%
a 11
 
4.0%
r 9
 
3.3%
o 9
 
3.3%
e 7
 
2.6%
Other values (30) 111
40.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 129009
67.9%
ASCII 60868
32.0%
Punctuation 45
 
< 0.1%
None 41
 
< 0.1%
Enclosed Alphanum 37
 
< 0.1%
Compat Jamo 19
 
< 0.1%
Geometric Shapes 15
 
< 0.1%
CJK Compat 6
 
< 0.1%
Letterlike Symbols 3
 
< 0.1%
Arrows 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30024
49.3%
2 4632
 
7.6%
1 4266
 
7.0%
0 4046
 
6.6%
) 3473
 
5.7%
( 3464
 
5.7%
. 3300
 
5.4%
, 891
 
1.5%
6 792
 
1.3%
3 785
 
1.3%
Other values (58) 5195
 
8.5%
Hangul
ValueCountFrequency (%)
5395
 
4.2%
4010
 
3.1%
3580
 
2.8%
3579
 
2.8%
2659
 
2.1%
2538
 
2.0%
2231
 
1.7%
2223
 
1.7%
2140
 
1.7%
2071
 
1.6%
Other values (637) 98583
76.4%
Enclosed Alphanum
ValueCountFrequency (%)
18
48.6%
15
40.5%
4
 
10.8%
None
ValueCountFrequency (%)
11
26.8%
7
17.1%
7
17.1%
7
17.1%
4
 
9.8%
4
 
9.8%
1
 
2.4%
Punctuation
ValueCountFrequency (%)
11
24.4%
11
24.4%
9
20.0%
7
15.6%
7
15.6%
CJK Compat
ValueCountFrequency (%)
6
100.0%
Compat Jamo
ValueCountFrequency (%)
6
31.6%
4
21.1%
2
 
10.5%
2
 
10.5%
2
 
10.5%
2
 
10.5%
1
 
5.3%
Geometric Shapes
ValueCountFrequency (%)
6
40.0%
6
40.0%
3
20.0%
Letterlike Symbols
ValueCountFrequency (%)
3
100.0%
Arrows
ValueCountFrequency (%)
2
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
Distinct596
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
2024-04-30T00:23:23.142326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length69
Mean length6.7350188
Min length2

Characters and Unicode

Total characters55409
Distinct characters209
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique338 ?
Unique (%)4.1%

Sample

1st row영업소폐쇄
2nd row과태료부과
3rd row영업정지
4th row과태료부과
5th row과태료부과
ValueCountFrequency (%)
과태료부과 2553
24.7%
시정명령 1223
11.8%
영업소폐쇄 1151
 
11.1%
영업정지 1094
 
10.6%
시설개수명령 338
 
3.3%
과태료 279
 
2.7%
부과 276
 
2.7%
과징금부과 240
 
2.3%
136
 
1.3%
시정명령(즉시 125
 
1.2%
Other values (663) 2935
28.4%
2024-04-30T00:23:23.769490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7404
 
13.4%
3713
 
6.7%
3286
 
5.9%
3276
 
5.9%
2989
 
5.4%
2801
 
5.1%
2776
 
5.0%
2127
 
3.8%
1987
 
3.6%
1796
 
3.2%
Other values (199) 23254
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46007
83.0%
Decimal Number 5054
 
9.1%
Space Separator 2127
 
3.8%
Other Punctuation 714
 
1.3%
Open Punctuation 705
 
1.3%
Close Punctuation 702
 
1.3%
Math Symbol 76
 
0.1%
Dash Punctuation 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (169) 14184
30.8%
Decimal Number
ValueCountFrequency (%)
0 1493
29.5%
1 1023
20.2%
2 1002
19.8%
5 313
 
6.2%
3 284
 
5.6%
4 265
 
5.2%
6 256
 
5.1%
8 192
 
3.8%
7 153
 
3.0%
9 73
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 533
74.6%
, 123
 
17.2%
% 31
 
4.3%
/ 17
 
2.4%
: 5
 
0.7%
? 4
 
0.6%
* 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 58
76.3%
12
 
15.8%
3
 
3.9%
+ 2
 
2.6%
= 1
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 703
99.7%
[ 1
 
0.1%
1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 700
99.7%
1
 
0.1%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
2127
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46007
83.0%
Common 9402
 
17.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (169) 14184
30.8%
Common
ValueCountFrequency (%)
2127
22.6%
0 1493
15.9%
1 1023
10.9%
2 1002
10.7%
( 703
 
7.5%
) 700
 
7.4%
. 533
 
5.7%
5 313
 
3.3%
3 284
 
3.0%
4 265
 
2.8%
Other values (20) 959
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45995
83.0%
ASCII 9385
 
16.9%
Arrows 12
 
< 0.1%
Compat Jamo 12
 
< 0.1%
Geometric Shapes 3
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7404
16.1%
3713
 
8.1%
3286
 
7.1%
3276
 
7.1%
2989
 
6.5%
2801
 
6.1%
2776
 
6.0%
1987
 
4.3%
1796
 
3.9%
1795
 
3.9%
Other values (168) 14172
30.8%
ASCII
ValueCountFrequency (%)
2127
22.7%
0 1493
15.9%
1 1023
10.9%
2 1002
10.7%
( 703
 
7.5%
) 700
 
7.5%
. 533
 
5.7%
5 313
 
3.3%
3 284
 
3.0%
4 265
 
2.8%
Other values (16) 942
10.0%
Arrows
ValueCountFrequency (%)
12
100.0%
Compat Jamo
ValueCountFrequency (%)
12
100.0%
Geometric Shapes
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

처분기간
Real number (ℝ)

MISSING 

Distinct24
Distinct (%)3.5%
Missing7543
Missing (%)91.7%
Infinite0
Infinite (%)0.0%
Mean11.942982
Minimum0
Maximum30
Zeros24
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:23.944276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q17
median15
Q315
95-th percentile20
Maximum30
Range30
Interquartile range (IQR)8

Descriptive statistics

Standard deviation5.4222545
Coefficient of variation (CV)0.45401176
Kurtosis0.19289994
Mean11.942982
Median Absolute Deviation (MAD)5
Skewness0.041892034
Sum8169
Variance29.400844
MonotonicityNot monotonic
2024-04-30T00:23:24.080706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
15 306
 
3.7%
7 163
 
2.0%
10 44
 
0.5%
5 26
 
0.3%
17 25
 
0.3%
20 25
 
0.3%
0 24
 
0.3%
8 12
 
0.1%
4 9
 
0.1%
23 7
 
0.1%
Other values (14) 43
 
0.5%
(Missing) 7543
91.7%
ValueCountFrequency (%)
0 24
 
0.3%
1 2
 
< 0.1%
2 1
 
< 0.1%
3 5
 
0.1%
4 9
 
0.1%
5 26
 
0.3%
6 6
 
0.1%
7 163
2.0%
8 12
 
0.1%
9 4
 
< 0.1%
ValueCountFrequency (%)
30 5
 
0.1%
29 1
 
< 0.1%
28 1
 
< 0.1%
27 2
 
< 0.1%
23 7
 
0.1%
22 6
 
0.1%
20 25
0.3%
19 3
 
< 0.1%
18 4
 
< 0.1%
17 25
0.3%

영업장면적(㎡)
Real number (ℝ)

MISSING 

Distinct891
Distinct (%)34.6%
Missing5655
Missing (%)68.7%
Infinite0
Infinite (%)0.0%
Mean97.319732
Minimum0
Maximum1490.18
Zeros5
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size72.4 KiB
2024-04-30T00:23:24.347953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile15.26
Q129.7875
median59.4
Q3106.23
95-th percentile297
Maximum1490.18
Range1490.18
Interquartile range (IQR)76.4425

Descriptive statistics

Standard deviation137.8145
Coefficient of variation (CV)1.4161003
Kurtosis34.063973
Mean97.319732
Median Absolute Deviation (MAD)33
Skewness4.9728994
Sum250306.35
Variance18992.836
MonotonicityNot monotonic
2024-04-30T00:23:24.638006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
26.4 71
 
0.9%
23.1 69
 
0.8%
19.8 49
 
0.6%
33.0 46
 
0.6%
49.5 42
 
0.5%
16.5 40
 
0.5%
82.5 36
 
0.4%
59.4 36
 
0.4%
13.2 35
 
0.4%
39.6 32
 
0.4%
Other values (881) 2116
 
25.7%
(Missing) 5655
68.7%
ValueCountFrequency (%)
0.0 5
0.1%
3.6 1
 
< 0.1%
4.95 6
0.1%
6.0 2
 
< 0.1%
6.6 9
0.1%
7.0 1
 
< 0.1%
8.25 3
 
< 0.1%
8.5 1
 
< 0.1%
9.12 2
 
< 0.1%
9.25 1
 
< 0.1%
ValueCountFrequency (%)
1490.18 2
< 0.1%
1440.23 2
< 0.1%
1390.09 2
< 0.1%
1320.0 1
 
< 0.1%
1047.0 4
< 0.1%
951.31 4
< 0.1%
921.94 4
< 0.1%
905.32 2
< 0.1%
879.57 1
 
< 0.1%
860.0 1
 
< 0.1%

운영형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size64.4 KiB
<NA>
8146 
직영
 
69
(조합)위탁
 
12

Length

Max length6
Median length4
Mean length3.9861432
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8146
99.0%
직영 69
 
0.8%
(조합)위탁 12
 
0.1%

Length

2024-04-30T00:23:24.803292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T00:23:24.990551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8146
99.0%
직영 69
 
0.8%
조합)위탁 12
 
0.1%

Interactions

2024-04-30T00:23:14.239202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.110772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.804947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.382118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.026879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.649345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.357225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.282348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.912578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.491414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.144568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.732517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.458490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.391350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.002339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.593429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.256064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.835761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.552773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.501107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.099158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.704453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.362358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.932970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.932191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.609332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.193476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.813305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.472620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.030560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:15.058614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:11.697373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.285038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:12.923166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:13.556884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T00:23:14.136081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T00:23:25.157391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분일자교부번호업종명업태명지도점검일자위반일자처분기간영업장면적(㎡)운영형태
처분일자1.0000.5600.3400.481NaN0.0000.5060.1030.148
교부번호0.5601.0000.5330.658NaN0.0000.4620.1940.646
업종명0.3400.5331.0001.000NaN0.0000.5720.633NaN
업태명0.4810.6581.0001.000NaN0.0000.6460.7540.574
지도점검일자NaNNaNNaNNaN1.000NaNNaNNaNNaN
위반일자0.0000.0000.0000.000NaN1.000NaNNaNNaN
처분기간0.5060.4620.5720.646NaNNaN1.0000.463NaN
영업장면적(㎡)0.1030.1940.6330.754NaNNaN0.4631.0000.000
운영형태0.1480.646NaN0.574NaNNaNNaN0.0001.000
2024-04-30T00:23:25.518135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명운영형태
업종명1.0001.000
운영형태1.0001.000
2024-04-30T00:23:25.659557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처분일자교부번호지도점검일자위반일자처분기간영업장면적(㎡)업종명운영형태
처분일자1.0000.5160.9990.9950.019-0.0260.1660.242
교부번호0.5161.0000.5170.519-0.077-0.0690.1960.445
지도점검일자0.9990.5171.0000.9950.013-0.0250.0001.000
위반일자0.9950.5190.9951.0000.015-0.0200.0001.000
처분기간0.019-0.0770.0130.0151.000-0.0860.3140.000
영업장면적(㎡)-0.026-0.069-0.025-0.020-0.0861.0000.3080.000
업종명0.1660.1960.0000.0000.3140.3081.0001.000
운영형태0.2420.4451.0001.0000.0000.0001.0001.000

Missing values

2024-04-30T00:23:15.302105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T00:23:15.665670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T00:23:15.882539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군구코드처분일자교부번호업종명업태명업소명소재지도로명소재지지번지도점검일자행정처분상태처분명법적근거위반일자위반내용처분내용처분기간영업장면적(㎡)운영형태
030700002080031720040051251일반음식점탕류(보신용)소머리국밥집<NA>서울특별시 성북구 석관동 132번지 44호20080220처분확정영업소폐쇄식품위생법제22조제5항 및 동법제58조20071031폐업신고 미이행(시설물 멸실)영업소폐쇄<NA>37.95<NA>
130700002024040420230067365일반음식점한식큰대쪽갈비서울특별시 성북구 보문로 58-1, 한주빌딩 1층 (보문동7가)서울특별시 성북구 보문동7가 22번지 5호 한주빌딩20240313처분확정과태료부과법 제82조제2항20240305재난배상책임보험을 기한(2024.3.5.)내에 가입하지 아니함(가입일 : 2024.3.6.)과태료부과<NA><NA><NA>
230700002024031920020050596일반음식점정종/대포집/소주방펍피맥서울특별시 성북구 한천로78길 43, (석관동)서울특별시 성북구 석관동 127번지 73호20231230처분확정영업정지법 제75조202312302023.12.30 20:42경 청소년에게 주류 판매영업정지<NA><NA><NA>
330700002024031420030050506일반음식점분식한갈비탕서울특별시 성북구 동소문로 227, 65,66호 (길음동)서울특별시 성북구 길음동 535번지 8호 길음시장-65,6620231201처분확정과태료부과법 제101조제4항1호20231201식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
430700002024031419980050312일반음식점호프/통닭청춘퓨전포차서울특별시 성북구 고려대로26길 42-1, (안암동5가)서울특별시 성북구 안암동5가 104번지 28호20231128처분확정과태료부과법 제101조제4항1호20231128식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
530700002024031420030050506일반음식점분식한갈비탕서울특별시 성북구 동소문로 227, 65,66호 (길음동)서울특별시 성북구 길음동 535번지 8호 길음시장-65,6620231201처분확정과태료부과법 제101조제4항1호20231201식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA>39.6<NA>
630700002024031419950050925일반음식점호프/통닭호우양꼬치서울특별시 성북구 동소문로20다길 11, (동선동1가)서울특별시 성북구 동선동1가 3번지 6호20231127처분확정과태료부과법 제101조제4항1호20231127식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
730700002024031419930050505일반음식점호프/통닭오술로서울특별시 성북구 동소문로6길 4-11, (동소문동2가)서울특별시 성북구 동소문동2가 39번지20231129처분확정과태료부과법 제101조제4항1호20231129식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
830700002024031420030050506일반음식점분식한갈비탕서울특별시 성북구 동소문로 227, 65,66호 (길음동)서울특별시 성북구 길음동 535번지 8호 길음시장-65,6620231201처분확정과태료부과법 제101조제4항1호20231201식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
930700002024031419990050715일반음식점분식손칼국수서울특별시 성북구 동소문로40길 2, (하월곡동)서울특별시 성북구 하월곡동 104번지 85호20231128처분확정과태료부과법 제101조제4항1호20231128식품위생법 위반(2021년 위생교육 미수료)과태료부과<NA><NA><NA>
시군구코드처분일자교부번호업종명업태명업소명소재지도로명소재지지번지도점검일자행정처분상태처분명법적근거위반일자위반내용처분내용처분기간영업장면적(㎡)운영형태
821730700002001022620000050330식품제조가공업식품제조가공업오향식품<NA>서울특별시 성북구 성북동 107번지 1호20010119처분확정품목제조정지1월식품위생법제10조20010119일괄표시사항누락(양념소스)품목제조정지1월<NA><NA><NA>
821830700002001021720000050458기타식품판매업기타식품판매업참조은마트<NA>서울특별시 성북구 돈암동 624번지 0호 현대상가지하2층20001201처분확정영업정지7일 갈음 과징금 371만원식품위생법제31조20001201유통기한 경과제품 진열판매영업정지7일 갈음 과징금 371만원7452.0<NA>
821930700002001021720000050458기타식품판매업기타식품판매업참조은마트<NA>서울특별시 성북구 돈암동 624번지 0호 현대상가지하2층20001201처분확정영업정지7일 갈음 과징금 371만원식품위생법제31조20001201유통기한 경과제품 진열판매영업정지7일 갈음 과징금 371만원7<NA><NA>
822030700002001020619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20010130처분확정시정명령(2001.2.21까지)식품위생법제10조20010130제품명표시기준위반(동부콩원료의 묵을 청포묵으로 표기)시정명령(2001.2.21까지)<NA><NA><NA>
822130700002001020619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20010130처분확정시정명령(2001.2.21까지)식품위생법제10조20010130제품명표시기준위반(동부콩원료의 묵을 청포묵으로 표기)시정명령(2001.2.21까지)<NA><NA><NA>
822230700002000120819980050952식품제조가공업식품제조가공업산촌한과<NA>서울특별시 성북구 성북동 131번지 51호20001109처분확정품목제조정지15일, 시정명령(즉시)식품위생법제10조20001109유통기한 미표시 및 제품명 불분명품목제조정지15일, 시정명령(즉시)15<NA><NA>
822330700002000110619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20000925처분확정품목제조정지1월갈음 과징금 360만원식품위생법제19조20000925자가품질검사 미실시(청포묵, 도토리묵)품목제조정지1월갈음 과징금 360만원<NA><NA><NA>
822430700002000110619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20000925처분확정건강진단미필(2/4),과태료50만원식품위생법 제26조20000925건강진단미필(2/4)건강진단미필(2/4),과태료50만원<NA><NA><NA>
822530700002000110619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20000925처분확정품목제조정지1월갈음 과징금 360만원식품위생법제19조20000925자가품질검사 미실시(청포묵, 도토리묵)품목제조정지1월갈음 과징금 360만원<NA><NA><NA>
822630700002000110619950050320식품제조가공업식품제조가공업수정식품<NA>서울특별시 성북구 종암동 9번지 54호20000925처분확정건강진단미필(2/4),과태료50만원식품위생법 제26조20000925건강진단미필(2/4)건강진단미필(2/4),과태료50만원<NA><NA><NA>

Duplicate rows

Most frequently occurring

시군구코드처분일자교부번호업종명업태명업소명소재지도로명소재지지번지도점검일자행정처분상태처분명법적근거위반일자위반내용처분내용처분기간영업장면적(㎡)운영형태# duplicates
5430700002005070420010051169식품제조가공업식품제조가공업성일식품<NA>서울특별시 성북구 정릉동 210번지 상가 푸른마을동아아파트-지하1층20050419처분확정과태료부과20만원식품위생법 제27조200504192004년도 영업자 위생교육 미수료과태료부과20만원<NA><NA><NA>8
5630700002005070420010051169식품제조가공업식품제조가공업성일식품<NA>서울특별시 성북구 정릉동 210번지 상가 푸른마을동아아파트-지하1층20050419처분확정시정명령식품위생법 제27조200504192004년도 영업자 위생교육 미수료시정명령<NA><NA><NA>8
18030700002011071319950050486일반음식점한식쌍둥이식당<NA>서울특별시 성북구 상월곡동 64번지 3호20110706처분확정과태료부과식품위생법 제3조 및 동법 시행규칙 제100조20110706위생모 미착용과태료부과<NA><NA><NA>5
18130700002011071319950050486일반음식점한식쌍둥이식당<NA>서울특별시 성북구 상월곡동 64번지 3호20110706처분확정과태료부과식품위생법 제40조제1항20110706건강진단을 받지 아니한 종업원(조리장 종사, 윤영은)과태료부과<NA><NA><NA>5
18230700002011071319950050486일반음식점한식쌍둥이식당<NA>서울특별시 성북구 상월곡동 64번지 3호20110706처분확정과태료부과식품위생법 제40조제1항20110706건강진단을 받지 아니한 종업원(조리장 종사, 윤용진)과태료부과<NA><NA><NA>5
18330700002011071319950050486일반음식점한식쌍둥이식당<NA>서울특별시 성북구 상월곡동 64번지 3호20110706처분확정과태료부과식품위생법 제40조제1항20110706건강진단을 받지 아니한 종업원(조리장 종사, 윤정은)과태료부과<NA><NA><NA>5
18430700002011071319950050486일반음식점한식쌍둥이식당<NA>서울특별시 성북구 상월곡동 64번지 3호20110706처분확정과태료부과식품위생법 제40조제3항20110706건강진단을 받지 아니한 종업원을 영업에 종사시킨 영업자(3/3)과태료부과<NA><NA><NA>5
1830700002003050920020050806유통전문판매업유통전문판매업주식회사 지웰라이프<NA>서울특별시 성북구 보문동7가 118번지 서광빌딩 5층20030421처분확정영업정지식품위생법제11조20030421허위과대광고영업정지15<NA><NA>4
3130700002004051320010051169식품제조가공업식품제조가공업성일식품<NA>서울특별시 성북구 정릉동 170번지 16호20040408처분확정품목류제조정지식품위생법제19조20040408자가품질검사미필품목류제조정지<NA><NA><NA>4
4330700002005050620010051169식품제조가공업식품제조가공업성일식품<NA>서울특별시 성북구 정릉동 210번지 상가 푸른마을동아아파트-지하1층20050418처분확정시정명령식품위생법제7조20050418식품의 원료사용기준 위반 -유통기한(가공일자) 미표시식품 원료사용,보관시정명령<NA><NA><NA>4