Overview

Dataset statistics

Number of variables47
Number of observations10000
Missing cells145060
Missing cells (%)30.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 MiB
Average record size in memory414.0 B

Variable types

Numeric15
Categorical14
Text7
Unsupported9
DateTime1
Boolean1

Dataset

Description22년09월_6270000_대구광역시_07_22_03_P_건강기능식품일반판매업
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=DMI_0000096630&dataSetDetailId=DDI_0000096655&provdMethod=FILE

Alerts

개방서비스명 has constant value ""Constant
개방서비스아이디 has constant value ""Constant
다중이용업소여부 has constant value ""Constant
급수시설구분명 is highly imbalanced (69.9%)Imbalance
공장생산직직원수 is highly imbalanced (58.7%)Imbalance
인허가취소일자 has 10000 (100.0%) missing valuesMissing
폐업일자 has 2961 (29.6%) missing valuesMissing
휴업시작일자 has 10000 (100.0%) missing valuesMissing
휴업종료일자 has 10000 (100.0%) missing valuesMissing
재개업일자 has 10000 (100.0%) missing valuesMissing
소재지전화 has 5411 (54.1%) missing valuesMissing
소재지면적 has 4304 (43.0%) missing valuesMissing
소재지우편번호 has 131 (1.3%) missing valuesMissing
도로명전체주소 has 1877 (18.8%) missing valuesMissing
도로명우편번호 has 1942 (19.4%) missing valuesMissing
업태구분명 has 10000 (100.0%) missing valuesMissing
좌표정보(X) has 231 (2.3%) missing valuesMissing
좌표정보(Y) has 231 (2.3%) missing valuesMissing
영업장주변구분명 has 10000 (100.0%) missing valuesMissing
등급구분명 has 10000 (100.0%) missing valuesMissing
본사직원수 has 3700 (37.0%) missing valuesMissing
공장사무직직원수 has 3701 (37.0%) missing valuesMissing
공장판매직직원수 has 3700 (37.0%) missing valuesMissing
보증액 has 8444 (84.4%) missing valuesMissing
월세액 has 8444 (84.4%) missing valuesMissing
전통업소지정번호 has 10000 (100.0%) missing valuesMissing
전통업소주된음식 has 10000 (100.0%) missing valuesMissing
홈페이지 has 9977 (99.8%) missing valuesMissing
소재지우편번호 is highly skewed (γ1 = -74.31405975)Skewed
본사직원수 is highly skewed (γ1 = 28.32954065)Skewed
공장사무직직원수 is highly skewed (γ1 = 26.81200534)Skewed
공장판매직직원수 is highly skewed (γ1 = 22.32326867)Skewed
보증액 is highly skewed (γ1 = 37.8684918)Skewed
시설총규모 is highly skewed (γ1 = 24.21205549)Skewed
번호 has unique valuesUnique
관리번호 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업종료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
영업장주변구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등급구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소지정번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소주된음식 is an unsupported type, check if it needs cleaning or further analysisUnsupported
본사직원수 has 6275 (62.7%) zerosZeros
공장사무직직원수 has 6081 (60.8%) zerosZeros
공장판매직직원수 has 5631 (56.3%) zerosZeros
보증액 has 1548 (15.5%) zerosZeros
월세액 has 1548 (15.5%) zerosZeros
시설총규모 has 9852 (98.5%) zerosZeros

Reproduction

Analysis started2024-04-19 06:58:50.501453
Analysis finished2024-04-19 06:58:52.580878
Duration2.08 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8235.2332
Minimum1
Maximum16369
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:52.647002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile831.95
Q14142.75
median8223.5
Q312342.25
95-th percentile15592.05
Maximum16369
Range16368
Interquartile range (IQR)8199.5

Descriptive statistics

Standard deviation4727.3036
Coefficient of variation (CV)0.57403397
Kurtosis-1.1963325
Mean8235.2332
Median Absolute Deviation (MAD)4101
Skewness-0.0073183419
Sum82352332
Variance22347399
MonotonicityNot monotonic
2024-04-19T15:58:52.817335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1012 1
 
< 0.1%
7627 1
 
< 0.1%
10464 1
 
< 0.1%
11786 1
 
< 0.1%
8574 1
 
< 0.1%
15395 1
 
< 0.1%
14615 1
 
< 0.1%
2242 1
 
< 0.1%
12506 1
 
< 0.1%
4315 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
18 1
< 0.1%
19 1
< 0.1%
20 1
< 0.1%
ValueCountFrequency (%)
16369 1
< 0.1%
16368 1
< 0.1%
16367 1
< 0.1%
16366 1
< 0.1%
16365 1
< 0.1%
16364 1
< 0.1%
16363 1
< 0.1%
16361 1
< 0.1%
16359 1
< 0.1%
16357 1
< 0.1%

개방서비스명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
건강기능식품일반판매업
10000 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건강기능식품일반판매업
2nd row건강기능식품일반판매업
3rd row건강기능식품일반판매업
4th row건강기능식품일반판매업
5th row건강기능식품일반판매업

Common Values

ValueCountFrequency (%)
건강기능식품일반판매업 10000
100.0%

Length

2024-04-19T15:58:52.937869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:53.037283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건강기능식품일반판매업 10000
100.0%

개방서비스아이디
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
07_22_03_P
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row07_22_03_P
2nd row07_22_03_P
3rd row07_22_03_P
4th row07_22_03_P
5th row07_22_03_P

Common Values

ValueCountFrequency (%)
07_22_03_P 10000
100.0%

Length

2024-04-19T15:58:53.127808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:53.390805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
07_22_03_p 10000
100.0%

개방자치단체코드
Real number (ℝ)

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3448589
Minimum3410000
Maximum3480000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:53.467294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3410000
5-th percentile3410000
Q13430000
median3450000
Q33470000
95-th percentile3480000
Maximum3480000
Range70000
Interquartile range (IQR)40000

Descriptive statistics

Standard deviation21269.798
Coefficient of variation (CV)0.0061676814
Kurtosis-1.0734271
Mean3448589
Median Absolute Deviation (MAD)20000
Skewness-0.41353374
Sum3.448589 × 1010
Variance4.5240432 × 108
MonotonicityNot monotonic
2024-04-19T15:58:53.573712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
3470000 2161
21.6%
3460000 1823
18.2%
3450000 1746
17.5%
3420000 1412
14.1%
3410000 806
 
8.1%
3440000 713
 
7.1%
3430000 680
 
6.8%
3480000 659
 
6.6%
ValueCountFrequency (%)
3410000 806
 
8.1%
3420000 1412
14.1%
3430000 680
 
6.8%
3440000 713
 
7.1%
3450000 1746
17.5%
3460000 1823
18.2%
3470000 2161
21.6%
3480000 659
 
6.6%
ValueCountFrequency (%)
3480000 659
 
6.6%
3470000 2161
21.6%
3460000 1823
18.2%
3450000 1746
17.5%
3440000 713
 
7.1%
3430000 680
 
6.8%
3420000 1412
14.1%
3410000 806
 
8.1%

관리번호
Text

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-19T15:58:53.775943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length22
Min length22

Characters and Unicode

Total characters220000
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row3410000-134-2022-00045
2nd row3470000-134-2020-00185
3rd row3440000-134-2016-00033
4th row3480000-134-2015-00010
5th row3430000-134-2014-00011
ValueCountFrequency (%)
3410000-134-2022-00045 1
 
< 0.1%
3420000-134-2011-00051 1
 
< 0.1%
3440000-134-2018-00052 1
 
< 0.1%
3440000-134-2010-00034 1
 
< 0.1%
3460000-134-2011-00092 1
 
< 0.1%
3460000-134-2019-00038 1
 
< 0.1%
3450000-134-2004-00265 1
 
< 0.1%
3480000-134-2015-00015 1
 
< 0.1%
3470000-134-2015-00092 1
 
< 0.1%
3450000-134-2012-00122 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-04-19T15:58:54.080102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 83662
38.0%
- 30000
 
13.6%
4 24742
 
11.2%
3 23709
 
10.8%
1 21631
 
9.8%
2 17386
 
7.9%
7 4462
 
2.0%
5 4459
 
2.0%
6 4249
 
1.9%
8 2951
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 190000
86.4%
Dash Punctuation 30000
 
13.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 83662
44.0%
4 24742
 
13.0%
3 23709
 
12.5%
1 21631
 
11.4%
2 17386
 
9.2%
7 4462
 
2.3%
5 4459
 
2.3%
6 4249
 
2.2%
8 2951
 
1.6%
9 2749
 
1.4%
Dash Punctuation
ValueCountFrequency (%)
- 30000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 220000
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 83662
38.0%
- 30000
 
13.6%
4 24742
 
11.2%
3 23709
 
10.8%
1 21631
 
9.8%
2 17386
 
7.9%
7 4462
 
2.0%
5 4459
 
2.0%
6 4249
 
1.9%
8 2951
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 220000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 83662
38.0%
- 30000
 
13.6%
4 24742
 
11.2%
3 23709
 
10.8%
1 21631
 
9.8%
2 17386
 
7.9%
7 4462
 
2.0%
5 4459
 
2.0%
6 4249
 
1.9%
8 2951
 
1.3%

인허가일자
Real number (ℝ)

Distinct3551
Distinct (%)35.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20128942
Minimum20040204
Maximum20220930
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:54.219464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20040204
5-th percentile20040618
Q120090324
median20121126
Q320181119
95-th percentile20220401
Maximum20220930
Range180726
Interquartile range (IQR)90795.75

Descriptive statistics

Standard deviation58607.293
Coefficient of variation (CV)0.0029115933
Kurtosis-1.2089861
Mean20128942
Median Absolute Deviation (MAD)50022
Skewness0.021336816
Sum2.0128942 × 1011
Variance3.4348147 × 109
MonotonicityNot monotonic
2024-04-19T15:58:54.353056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20040618 225
 
2.2%
20040617 73
 
0.7%
20040616 36
 
0.4%
20040614 29
 
0.3%
20150423 27
 
0.3%
20040621 27
 
0.3%
20040615 27
 
0.3%
20131204 27
 
0.3%
20040917 25
 
0.2%
20081219 22
 
0.2%
Other values (3541) 9482
94.8%
ValueCountFrequency (%)
20040204 1
< 0.1%
20040214 1
< 0.1%
20040320 1
< 0.1%
20040326 1
< 0.1%
20040401 1
< 0.1%
20040408 2
< 0.1%
20040409 1
< 0.1%
20040414 1
< 0.1%
20040419 2
< 0.1%
20040420 1
< 0.1%
ValueCountFrequency (%)
20220930 3
< 0.1%
20220929 3
< 0.1%
20220928 3
< 0.1%
20220927 4
< 0.1%
20220926 4
< 0.1%
20220923 3
< 0.1%
20220922 2
< 0.1%
20220921 2
< 0.1%
20220920 4
< 0.1%
20220919 2
< 0.1%

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3
7039 
1
2961 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 7039
70.4%
1 2961
29.6%

Length

2024-04-19T15:58:54.478488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:54.560910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 7039
70.4%
1 2961
29.6%

영업상태명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
7039 
영업/정상
2961 

Length

Max length5
Median length2
Mean length2.8883
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row폐업
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
폐업 7039
70.4%
영업/정상 2961
29.6%

Length

2024-04-19T15:58:54.662729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:54.755728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 7039
70.4%
영업/정상 2961
29.6%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
7039 
1
2961 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 7039
70.4%
1 2961
29.6%

Length

2024-04-19T15:58:54.878731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:54.982377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 7039
70.4%
1 2961
29.6%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
폐업
7039 
영업
2961 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row폐업
3rd row폐업
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
폐업 7039
70.4%
영업 2961
29.6%

Length

2024-04-19T15:58:55.077578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:58:55.175584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 7039
70.4%
영업 2961
29.6%

폐업일자
Real number (ℝ)

MISSING 

Distinct2993
Distinct (%)42.5%
Missing2961
Missing (%)29.6%
Infinite0
Infinite (%)0.0%
Mean20149268
Minimum20040419
Maximum20220929
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:55.281509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20040419
5-th percentile20061226
Q120111226
median20160224
Q320190130
95-th percentile20211203
Maximum20220929
Range180510
Interquartile range (IQR)78904

Descriptive statistics

Standard deviation45333.445
Coefficient of variation (CV)0.0022498805
Kurtosis-0.75291527
Mean20149268
Median Absolute Deviation (MAD)30700
Skewness-0.46020852
Sum1.418307 × 1011
Variance2.0551212 × 109
MonotonicityNot monotonic
2024-04-19T15:58:55.417577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20171229 51
 
0.5%
20171228 36
 
0.4%
20171205 35
 
0.4%
20171222 26
 
0.3%
20191231 26
 
0.3%
20171211 25
 
0.2%
20171206 24
 
0.2%
20211231 23
 
0.2%
20171220 22
 
0.2%
20171227 22
 
0.2%
Other values (2983) 6749
67.5%
(Missing) 2961
29.6%
ValueCountFrequency (%)
20040419 1
< 0.1%
20040503 1
< 0.1%
20040610 1
< 0.1%
20040624 1
< 0.1%
20040701 1
< 0.1%
20040713 1
< 0.1%
20040714 1
< 0.1%
20040715 1
< 0.1%
20040719 1
< 0.1%
20040722 1
< 0.1%
ValueCountFrequency (%)
20220929 3
< 0.1%
20220927 3
< 0.1%
20220926 1
 
< 0.1%
20220923 2
< 0.1%
20220921 1
 
< 0.1%
20220920 2
< 0.1%
20220919 1
 
< 0.1%
20220915 2
< 0.1%
20220914 3
< 0.1%
20220913 3
< 0.1%

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

휴업종료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

소재지전화
Text

MISSING 

Distinct4281
Distinct (%)93.3%
Missing5411
Missing (%)54.1%
Memory size156.2 KiB
2024-04-19T15:58:55.758558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.179124
Min length3

Characters and Unicode

Total characters51301
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4027 ?
Unique (%)87.8%

Sample

1st row053 584 5709
2nd row053 621 6219
3rd row053 5231320
4th row053 586 7589
5th row474 6500
ValueCountFrequency (%)
053 3720
34.4%
070 149
 
1.4%
02 42
 
0.4%
323 34
 
0.3%
741 30
 
0.3%
621 30
 
0.3%
753 26
 
0.2%
791 25
 
0.2%
324 23
 
0.2%
588 23
 
0.2%
Other values (4268) 6712
62.1%
2024-04-19T15:58:56.204123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 8294
16.2%
3 7232
14.1%
0 7199
14.0%
6280
12.2%
2 3996
7.8%
7 3660
7.1%
6 3532
6.9%
1 2909
 
5.7%
4 2902
 
5.7%
8 2807
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 45021
87.8%
Space Separator 6280
 
12.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 8294
18.4%
3 7232
16.1%
0 7199
16.0%
2 3996
8.9%
7 3660
8.1%
6 3532
7.8%
1 2909
 
6.5%
4 2902
 
6.4%
8 2807
 
6.2%
9 2490
 
5.5%
Space Separator
ValueCountFrequency (%)
6280
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 51301
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 8294
16.2%
3 7232
14.1%
0 7199
14.0%
6280
12.2%
2 3996
7.8%
7 3660
7.1%
6 3532
6.9%
1 2909
 
5.7%
4 2902
 
5.7%
8 2807
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 51301
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 8294
16.2%
3 7232
14.1%
0 7199
14.0%
6280
12.2%
2 3996
7.8%
7 3660
7.1%
6 3532
6.9%
1 2909
 
5.7%
4 2902
 
5.7%
8 2807
 
5.5%

소재지면적
Text

MISSING 

Distinct1698
Distinct (%)29.8%
Missing4304
Missing (%)43.0%
Memory size156.2 KiB
2024-04-19T15:58:56.560290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.4529494
Min length3

Characters and Unicode

Total characters25364
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1285 ?
Unique (%)22.6%

Sample

1st row.00
2nd row67.39
3rd row.00
4th row1.50
5th row5.00
ValueCountFrequency (%)
3.30 719
 
12.6%
00 501
 
8.8%
3.00 382
 
6.7%
1.00 292
 
5.1%
2.00 194
 
3.4%
6.00 119
 
2.1%
6.60 88
 
1.5%
4.00 85
 
1.5%
33.00 62
 
1.1%
30.00 55
 
1.0%
Other values (1688) 3199
56.2%
2024-04-19T15:58:57.014081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7585
29.9%
. 5696
22.5%
3 2940
 
11.6%
1 1846
 
7.3%
2 1696
 
6.7%
6 1224
 
4.8%
5 1166
 
4.6%
4 1051
 
4.1%
8 783
 
3.1%
9 709
 
2.8%
Other values (2) 668
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 19666
77.5%
Other Punctuation 5698
 
22.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7585
38.6%
3 2940
 
14.9%
1 1846
 
9.4%
2 1696
 
8.6%
6 1224
 
6.2%
5 1166
 
5.9%
4 1051
 
5.3%
8 783
 
4.0%
9 709
 
3.6%
7 666
 
3.4%
Other Punctuation
ValueCountFrequency (%)
. 5696
> 99.9%
, 2
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 25364
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7585
29.9%
. 5696
22.5%
3 2940
 
11.6%
1 1846
 
7.3%
2 1696
 
6.7%
6 1224
 
4.8%
5 1166
 
4.6%
4 1051
 
4.1%
8 783
 
3.1%
9 709
 
2.8%
Other values (2) 668
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25364
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7585
29.9%
. 5696
22.5%
3 2940
 
11.6%
1 1846
 
7.3%
2 1696
 
6.7%
6 1224
 
4.8%
5 1166
 
4.6%
4 1051
 
4.1%
8 783
 
3.1%
9 709
 
2.8%
Other values (2) 668
 
2.6%

소재지우편번호
Real number (ℝ)

MISSING  SKEWED 

Distinct920
Distinct (%)9.3%
Missing131
Missing (%)1.3%
Infinite0
Infinite (%)0.0%
Mean704309.72
Minimum136821
Maximum718801
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:57.154960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum136821
5-th percentile700805
Q1702746
median704400
Q3705830
95-th percentile711777
Maximum718801
Range581980
Interquartile range (IQR)3084

Descriptive statistics

Standard deviation6291.3553
Coefficient of variation (CV)0.0089326543
Kurtosis6709.6746
Mean704309.72
Median Absolute Deviation (MAD)1610
Skewness-74.31406
Sum6.9508327 × 109
Variance39581152
MonotonicityNot monotonic
2024-04-19T15:58:57.328740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
706170 153
 
1.5%
704080 102
 
1.0%
702886 102
 
1.0%
704060 83
 
0.8%
704834 80
 
0.8%
701847 70
 
0.7%
706220 70
 
0.7%
702807 70
 
0.7%
711812 66
 
0.7%
702040 64
 
0.6%
Other values (910) 9009
90.1%
(Missing) 131
 
1.3%
ValueCountFrequency (%)
136821 1
 
< 0.1%
700010 14
 
0.1%
700020 4
 
< 0.1%
700040 7
 
0.1%
700050 1
 
< 0.1%
700060 6
 
0.1%
700070 29
0.3%
700081 3
 
< 0.1%
700082 37
0.4%
700092 25
0.2%
ValueCountFrequency (%)
718801 1
 
< 0.1%
712110 2
 
< 0.1%
711893 2
 
< 0.1%
711892 1
 
< 0.1%
711891 36
0.4%
711874 7
 
0.1%
711873 5
 
0.1%
711872 6
 
0.1%
711871 1
 
< 0.1%
711864 7
 
0.1%
Distinct4827
Distinct (%)48.3%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-19T15:58:57.621037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length52
Mean length27.219444
Min length16

Characters and Unicode

Total characters272140
Distinct characters497
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3580 ?
Unique (%)35.8%

Sample

1st row대구광역시 중구 종로*가 ****-****
2nd row대구광역시 달서구 월성동 **-* 월성청구코아
3rd row대구광역시 남구 이천동 ***-**번지
4th row대구광역시 달성군 다사읍 죽곡리 ***-*번지
5th row대구광역시 서구 평리동 ****-**번지
ValueCountFrequency (%)
대구광역시 9994
19.6%
번지 7294
 
14.3%
2928
 
5.7%
달서구 2158
 
4.2%
2073
 
4.1%
수성구 1823
 
3.6%
북구 1742
 
3.4%
동구 1412
 
2.8%
1364
 
2.7%
1128
 
2.2%
Other values (2497) 19025
37.3%
2024-04-19T15:58:58.049664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 60788
22.3%
49284
18.1%
19839
 
7.3%
12913
 
4.7%
11380
 
4.2%
10325
 
3.8%
10100
 
3.7%
10050
 
3.7%
9255
 
3.4%
- 7456
 
2.7%
Other values (487) 70750
26.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 152099
55.9%
Other Punctuation 61018
22.4%
Space Separator 49284
 
18.1%
Dash Punctuation 7456
 
2.7%
Open Punctuation 878
 
0.3%
Close Punctuation 877
 
0.3%
Uppercase Letter 451
 
0.2%
Lowercase Letter 65
 
< 0.1%
Math Symbol 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19839
 
13.0%
12913
 
8.5%
11380
 
7.5%
10325
 
6.8%
10100
 
6.6%
10050
 
6.6%
9255
 
6.1%
7297
 
4.8%
3887
 
2.6%
3343
 
2.2%
Other values (440) 53710
35.3%
Uppercase Letter
ValueCountFrequency (%)
A 144
31.9%
B 52
 
11.5%
P 37
 
8.2%
T 37
 
8.2%
S 24
 
5.3%
C 24
 
5.3%
M 18
 
4.0%
L 16
 
3.5%
K 16
 
3.5%
D 15
 
3.3%
Other values (14) 68
15.1%
Lowercase Letter
ValueCountFrequency (%)
e 52
80.0%
s 2
 
3.1%
c 2
 
3.1%
a 2
 
3.1%
w 1
 
1.5%
h 1
 
1.5%
u 1
 
1.5%
l 1
 
1.5%
p 1
 
1.5%
m 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
* 60788
99.6%
, 149
 
0.2%
. 47
 
0.1%
/ 31
 
0.1%
@ 2
 
< 0.1%
· 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 10
83.3%
+ 2
 
16.7%
Space Separator
ValueCountFrequency (%)
49284
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7456
100.0%
Open Punctuation
ValueCountFrequency (%)
( 878
100.0%
Close Punctuation
ValueCountFrequency (%)
) 877
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 152099
55.9%
Common 119525
43.9%
Latin 516
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19839
 
13.0%
12913
 
8.5%
11380
 
7.5%
10325
 
6.8%
10100
 
6.6%
10050
 
6.6%
9255
 
6.1%
7297
 
4.8%
3887
 
2.6%
3343
 
2.2%
Other values (440) 53710
35.3%
Latin
ValueCountFrequency (%)
A 144
27.9%
e 52
 
10.1%
B 52
 
10.1%
P 37
 
7.2%
T 37
 
7.2%
S 24
 
4.7%
C 24
 
4.7%
M 18
 
3.5%
L 16
 
3.1%
K 16
 
3.1%
Other values (25) 96
18.6%
Common
ValueCountFrequency (%)
* 60788
50.9%
49284
41.2%
- 7456
 
6.2%
( 878
 
0.7%
) 877
 
0.7%
, 149
 
0.1%
. 47
 
< 0.1%
/ 31
 
< 0.1%
~ 10
 
< 0.1%
@ 2
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 152099
55.9%
ASCII 120040
44.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 60788
50.6%
49284
41.1%
- 7456
 
6.2%
( 878
 
0.7%
) 877
 
0.7%
, 149
 
0.1%
A 144
 
0.1%
e 52
 
< 0.1%
B 52
 
< 0.1%
. 47
 
< 0.1%
Other values (36) 313
 
0.3%
Hangul
ValueCountFrequency (%)
19839
 
13.0%
12913
 
8.5%
11380
 
7.5%
10325
 
6.8%
10100
 
6.6%
10050
 
6.6%
9255
 
6.1%
7297
 
4.8%
3887
 
2.6%
3343
 
2.2%
Other values (440) 53710
35.3%
None
ValueCountFrequency (%)
· 1
100.0%

도로명전체주소
Text

MISSING 

Distinct5969
Distinct (%)73.5%
Missing1877
Missing (%)18.8%
Memory size156.2 KiB
2024-04-19T15:58:58.366410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length56
Mean length33.286717
Min length19

Characters and Unicode

Total characters270388
Distinct characters516
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4815 ?
Unique (%)59.3%

Sample

1st row대구광역시 중구 종로 **-*, *층 ****호 (종로*가)
2nd row대구광역시 달서구 학산로 **, 월성청구코아 *층 ***호 (월성동)
3rd row대구광역시 남구 희망로*길 **, ***동 ***호 (이천동, 대성유니드아파트)
4th row대구광역시 달성군 다사읍 죽곡*길 *-*, *층
5th row대구광역시 서구 통학로**길 **-** (평리동)
ValueCountFrequency (%)
8178
 
15.5%
대구광역시 8119
 
15.4%
3101
 
5.9%
2966
 
5.6%
1872
 
3.5%
달서구 1746
 
3.3%
수성구 1490
 
2.8%
북구 1476
 
2.8%
동구 1188
 
2.2%
중구 615
 
1.2%
Other values (3346) 22051
41.8%
2024-04-19T15:58:58.826104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 48549
18.0%
44683
16.5%
17171
 
6.4%
12528
 
4.6%
10831
 
4.0%
, 8577
 
3.2%
8497
 
3.1%
8305
 
3.1%
( 8231
 
3.0%
) 8230
 
3.0%
Other values (506) 94786
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 150091
55.5%
Other Punctuation 57165
 
21.1%
Space Separator 44683
 
16.5%
Open Punctuation 8231
 
3.0%
Close Punctuation 8230
 
3.0%
Dash Punctuation 1473
 
0.5%
Uppercase Letter 420
 
0.2%
Lowercase Letter 81
 
< 0.1%
Math Symbol 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17171
 
11.4%
12528
 
8.3%
10831
 
7.2%
8497
 
5.7%
8305
 
5.5%
8164
 
5.4%
8041
 
5.4%
4119
 
2.7%
3712
 
2.5%
3585
 
2.4%
Other values (454) 65138
43.4%
Uppercase Letter
ValueCountFrequency (%)
A 115
27.4%
B 58
13.8%
C 31
 
7.4%
S 28
 
6.7%
T 23
 
5.5%
P 21
 
5.0%
D 17
 
4.0%
M 17
 
4.0%
K 16
 
3.8%
L 16
 
3.8%
Other values (14) 78
18.6%
Lowercase Letter
ValueCountFrequency (%)
e 59
72.8%
p 3
 
3.7%
s 3
 
3.7%
l 2
 
2.5%
t 2
 
2.5%
a 2
 
2.5%
c 2
 
2.5%
b 1
 
1.2%
i 1
 
1.2%
y 1
 
1.2%
Other values (5) 5
 
6.2%
Other Punctuation
ValueCountFrequency (%)
* 48549
84.9%
, 8577
 
15.0%
. 19
 
< 0.1%
/ 15
 
< 0.1%
· 2
 
< 0.1%
& 2
 
< 0.1%
: 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 12
85.7%
+ 2
 
14.3%
Space Separator
ValueCountFrequency (%)
44683
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8230
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1473
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 150091
55.5%
Common 119796
44.3%
Latin 501
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17171
 
11.4%
12528
 
8.3%
10831
 
7.2%
8497
 
5.7%
8305
 
5.5%
8164
 
5.4%
8041
 
5.4%
4119
 
2.7%
3712
 
2.5%
3585
 
2.4%
Other values (454) 65138
43.4%
Latin
ValueCountFrequency (%)
A 115
23.0%
e 59
11.8%
B 58
11.6%
C 31
 
6.2%
S 28
 
5.6%
T 23
 
4.6%
P 21
 
4.2%
D 17
 
3.4%
M 17
 
3.4%
K 16
 
3.2%
Other values (29) 116
23.2%
Common
ValueCountFrequency (%)
* 48549
40.5%
44683
37.3%
, 8577
 
7.2%
( 8231
 
6.9%
) 8230
 
6.9%
- 1473
 
1.2%
. 19
 
< 0.1%
/ 15
 
< 0.1%
~ 12
 
< 0.1%
· 2
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 150091
55.5%
ASCII 120295
44.5%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 48549
40.4%
44683
37.1%
, 8577
 
7.1%
( 8231
 
6.8%
) 8230
 
6.8%
- 1473
 
1.2%
A 115
 
0.1%
e 59
 
< 0.1%
B 58
 
< 0.1%
C 31
 
< 0.1%
Other values (41) 289
 
0.2%
Hangul
ValueCountFrequency (%)
17171
 
11.4%
12528
 
8.3%
10831
 
7.2%
8497
 
5.7%
8305
 
5.5%
8164
 
5.4%
8041
 
5.4%
4119
 
2.7%
3712
 
2.5%
3585
 
2.4%
Other values (454) 65138
43.4%
None
ValueCountFrequency (%)
· 2
100.0%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct1314
Distinct (%)16.3%
Missing1942
Missing (%)19.4%
Infinite0
Infinite (%)0.0%
Mean42041.334
Minimum2836
Maximum43024
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:58.957325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2836
5-th percentile41108
Q141516
median42062
Q342631
95-th percentile42927
Maximum43024
Range40188
Interquartile range (IQR)1115

Descriptive statistics

Standard deviation736.80236
Coefficient of variation (CV)0.017525665
Kurtosis993.53833
Mean42041.334
Median Absolute Deviation (MAD)563.5
Skewness-18.751659
Sum3.3876907 × 108
Variance542877.72
MonotonicityNot monotonic
2024-04-19T15:58:59.111456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41423 47
 
0.5%
41936 39
 
0.4%
41940 38
 
0.4%
41581 35
 
0.4%
41229 33
 
0.3%
41438 32
 
0.3%
42760 32
 
0.3%
42914 29
 
0.3%
41519 29
 
0.3%
42274 29
 
0.3%
Other values (1304) 7715
77.1%
(Missing) 1942
 
19.4%
ValueCountFrequency (%)
2836 1
 
< 0.1%
38655 2
 
< 0.1%
39894 1
 
< 0.1%
41000 1
 
< 0.1%
41001 4
 
< 0.1%
41002 11
0.1%
41003 6
 
0.1%
41004 1
 
< 0.1%
41005 17
0.2%
41006 1
 
< 0.1%
ValueCountFrequency (%)
43024 2
 
< 0.1%
43023 1
 
< 0.1%
43019 12
0.1%
43018 11
0.1%
43017 14
0.1%
43016 17
0.2%
43015 5
 
0.1%
43014 4
 
< 0.1%
43013 4
 
< 0.1%
43012 1
 
< 0.1%
Distinct8507
Distinct (%)85.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-19T15:58:59.371425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length28
Mean length6.899
Min length1

Characters and Unicode

Total characters68990
Distinct characters925
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7859 ?
Unique (%)78.6%

Sample

1st row자유팜
2nd row허브&해독나라
3rd row시너지코리아
4th row동인비 다사지사
5th row콩지몰
ValueCountFrequency (%)
허브다이어트 117
 
1.0%
주식회사 79
 
0.7%
한국암웨이 64
 
0.6%
애터미 57
 
0.5%
아모레퍼시픽 50
 
0.4%
지에스(gs)25 45
 
0.4%
아모레 42
 
0.4%
에이다넷 40
 
0.4%
아리따움 35
 
0.3%
세븐일레븐 32
 
0.3%
Other values (8793) 10755
95.0%
2024-04-19T15:58:59.721546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2698
 
3.9%
1699
 
2.5%
1531
 
2.2%
) 1498
 
2.2%
( 1483
 
2.1%
1319
 
1.9%
1237
 
1.8%
1170
 
1.7%
1052
 
1.5%
1024
 
1.5%
Other values (915) 54279
78.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60348
87.5%
Uppercase Letter 1716
 
2.5%
Close Punctuation 1499
 
2.2%
Open Punctuation 1484
 
2.2%
Space Separator 1319
 
1.9%
Decimal Number 1286
 
1.9%
Lowercase Letter 1171
 
1.7%
Other Punctuation 121
 
0.2%
Dash Punctuation 41
 
0.1%
Connector Punctuation 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2698
 
4.5%
1699
 
2.8%
1531
 
2.5%
1237
 
2.0%
1170
 
1.9%
1052
 
1.7%
1024
 
1.7%
996
 
1.7%
904
 
1.5%
882
 
1.5%
Other values (833) 47155
78.1%
Lowercase Letter
ValueCountFrequency (%)
e 148
12.6%
o 107
 
9.1%
a 85
 
7.3%
n 83
 
7.1%
i 80
 
6.8%
l 78
 
6.7%
t 71
 
6.1%
r 63
 
5.4%
s 46
 
3.9%
g 46
 
3.9%
Other values (16) 364
31.1%
Uppercase Letter
ValueCountFrequency (%)
S 262
15.3%
G 218
12.7%
O 127
 
7.4%
I 107
 
6.2%
C 105
 
6.1%
B 102
 
5.9%
N 90
 
5.2%
H 73
 
4.3%
E 73
 
4.3%
A 72
 
4.2%
Other values (15) 487
28.4%
Other Punctuation
ValueCountFrequency (%)
& 46
38.0%
. 38
31.4%
, 16
 
13.2%
/ 7
 
5.8%
! 4
 
3.3%
' 3
 
2.5%
: 2
 
1.7%
; 2
 
1.7%
1
 
0.8%
# 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
4 527
41.0%
2 265
20.6%
5 245
19.1%
1 81
 
6.3%
3 57
 
4.4%
8 32
 
2.5%
0 30
 
2.3%
6 21
 
1.6%
9 16
 
1.2%
7 12
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 1498
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1483
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1319
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60347
87.5%
Common 5753
 
8.3%
Latin 2888
 
4.2%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2698
 
4.5%
1699
 
2.8%
1531
 
2.5%
1237
 
2.0%
1170
 
1.9%
1052
 
1.7%
1024
 
1.7%
996
 
1.7%
904
 
1.5%
882
 
1.5%
Other values (833) 47154
78.1%
Latin
ValueCountFrequency (%)
S 262
 
9.1%
G 218
 
7.5%
e 148
 
5.1%
O 127
 
4.4%
I 107
 
3.7%
o 107
 
3.7%
C 105
 
3.6%
B 102
 
3.5%
N 90
 
3.1%
a 85
 
2.9%
Other values (42) 1537
53.2%
Common
ValueCountFrequency (%)
) 1498
26.0%
( 1483
25.8%
1319
22.9%
4 527
 
9.2%
2 265
 
4.6%
5 245
 
4.3%
1 81
 
1.4%
3 57
 
1.0%
& 46
 
0.8%
- 41
 
0.7%
Other values (19) 191
 
3.3%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60346
87.5%
ASCII 8638
 
12.5%
None 3
 
< 0.1%
CJK 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2698
 
4.5%
1699
 
2.8%
1531
 
2.5%
1237
 
2.0%
1170
 
1.9%
1052
 
1.7%
1024
 
1.7%
996
 
1.7%
904
 
1.5%
882
 
1.5%
Other values (832) 47153
78.1%
ASCII
ValueCountFrequency (%)
) 1498
17.3%
( 1483
17.2%
1319
15.3%
4 527
 
6.1%
2 265
 
3.1%
S 262
 
3.0%
5 245
 
2.8%
G 218
 
2.5%
e 148
 
1.7%
O 127
 
1.5%
Other values (68) 2546
29.5%
CJK
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
· 1
33.3%
Number Forms
ValueCountFrequency (%)
1
100.0%

최종수정시점
Real number (ℝ)

Distinct9298
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0155469 × 1013
Minimum2.0040324 × 1013
Maximum2.022093 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:58:59.859608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0040324 × 1013
5-th percentile2.0041008 × 1013
Q12.0111123 × 1013
median2.0171219 × 1013
Q32.0201021 × 1013
95-th percentile2.0220519 × 1013
Maximum2.022093 × 1013
Range1.8060617 × 1011
Interquartile range (IQR)8.9897991 × 1010

Descriptive statistics

Standard deviation5.5716371 × 1010
Coefficient of variation (CV)0.0027643302
Kurtosis-0.75947769
Mean2.0155469 × 1013
Median Absolute Deviation (MAD)3.9688027 × 1010
Skewness-0.67867381
Sum2.0155469 × 1017
Variance3.104314 × 1021
MonotonicityNot monotonic
2024-04-19T15:58:59.999674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20041008000000 73
 
0.7%
20040618000000 30
 
0.3%
20040917000000 24
 
0.2%
20040715000000 18
 
0.2%
20040916000000 16
 
0.2%
20040617000000 16
 
0.2%
20040915000000 14
 
0.1%
20040805000000 13
 
0.1%
20040918000000 13
 
0.1%
20041103000000 12
 
0.1%
Other values (9288) 9771
97.7%
ValueCountFrequency (%)
20040324000000 1
 
< 0.1%
20040401000000 2
< 0.1%
20040407000000 1
 
< 0.1%
20040408000000 1
 
< 0.1%
20040409000000 1
 
< 0.1%
20040419000000 1
 
< 0.1%
20040426000000 2
< 0.1%
20040429000000 3
< 0.1%
20040430000000 1
 
< 0.1%
20040503000000 2
< 0.1%
ValueCountFrequency (%)
20220930174526 1
< 0.1%
20220930162644 1
< 0.1%
20220930143225 1
< 0.1%
20220930111927 1
< 0.1%
20220930091839 1
< 0.1%
20220930041509 1
< 0.1%
20220929151304 1
< 0.1%
20220929135143 1
< 0.1%
20220929130546 1
< 0.1%
20220929114401 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
I
7067 
U
2933 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowU
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 7067
70.7%
U 2933
29.3%

Length

2024-04-19T15:59:00.160823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:00.251088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 7067
70.7%
u 2933
29.3%
Distinct1487
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-08-31 23:59:59
Maximum2022-10-02 02:40:00
2024-04-19T15:59:00.350275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:59:00.772622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct6612
Distinct (%)67.7%
Missing231
Missing (%)2.3%
Infinite0
Infinite (%)0.0%
Mean343089.04
Minimum200018.86
Maximum358213.06
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:00.922368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200018.86
5-th percentile334363.5
Q1339601.47
median343191.57
Q3346516.67
95-th percentile353900.23
Maximum358213.06
Range158194.2
Interquartile range (IQR)6915.1934

Descriptive statistics

Standard deviation5603.4558
Coefficient of variation (CV)0.016332366
Kurtosis43.298925
Mean343089.04
Median Absolute Deviation (MAD)3487.8011
Skewness-1.6071249
Sum3.3516368 × 109
Variance31398717
MonotonicityNot monotonic
2024-04-19T15:59:01.053819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
344047.164924 40
 
0.4%
344686.259338 32
 
0.3%
343588.735555 28
 
0.3%
347037.24197 27
 
0.3%
339047.793379 26
 
0.3%
347950.194975 21
 
0.2%
346916.265263 21
 
0.2%
340548.96997 20
 
0.2%
345032.238221 19
 
0.2%
345383.649328 17
 
0.2%
Other values (6602) 9518
95.2%
(Missing) 231
 
2.3%
ValueCountFrequency (%)
200018.855604638 1
< 0.1%
325733.855686 1
< 0.1%
325937.566452 1
< 0.1%
326030.831594512 1
< 0.1%
326284.270352 1
< 0.1%
326534.608033 1
< 0.1%
326874.570932 1
< 0.1%
327183.454587 1
< 0.1%
327220.136878 1
< 0.1%
327251.925896 2
< 0.1%
ValueCountFrequency (%)
358213.056494 2
< 0.1%
358060.647419 1
 
< 0.1%
358053.700823 1
 
< 0.1%
357934.27702 1
 
< 0.1%
357835.663303 1
 
< 0.1%
357672.59192 1
 
< 0.1%
357646.727978 3
< 0.1%
357494.773215 1
 
< 0.1%
357073.974572 1
 
< 0.1%
356923.487472 1
 
< 0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct6612
Distinct (%)67.7%
Missing231
Missing (%)2.3%
Infinite0
Infinite (%)0.0%
Mean263323.35
Minimum237059.21
Maximum454580.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:01.188875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum237059.21
5-th percentile257530.38
Q1261176.6
median263315.04
Q3265407.4
95-th percentile271586.56
Maximum454580.8
Range217521.59
Interquartile range (IQR)4230.808

Descriptive statistics

Standard deviation4969.0853
Coefficient of variation (CV)0.018870659
Kurtosis227.43205
Mean263323.35
Median Absolute Deviation (MAD)2110.5999
Skewness5.0749245
Sum2.5724058 × 109
Variance24691809
MonotonicityNot monotonic
2024-04-19T15:59:01.320652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
265132.987974 40
 
0.4%
263958.880864 32
 
0.3%
264119.01075 28
 
0.3%
265407.404337 27
 
0.3%
258741.90218 26
 
0.3%
265687.62574 21
 
0.2%
263443.713906 21
 
0.2%
272470.830195 20
 
0.2%
262949.871621 19
 
0.2%
267901.294823 17
 
0.2%
Other values (6602) 9518
95.2%
(Missing) 231
 
2.3%
ValueCountFrequency (%)
237059.208299 1
 
< 0.1%
238029.007127 1
 
< 0.1%
238126.933814 1
 
< 0.1%
238381.91462 1
 
< 0.1%
239311.580853 1
 
< 0.1%
240182.187266 1
 
< 0.1%
240270.873923 2
 
< 0.1%
240358.722944 5
0.1%
240365.71605 1
 
< 0.1%
240473.571874 5
0.1%
ValueCountFrequency (%)
454580.797415934 1
< 0.1%
278120.260025 1
< 0.1%
278042.64925 1
< 0.1%
277923.610317 1
< 0.1%
277799.708684 2
< 0.1%
277798.822544 1
< 0.1%
277784.134758 1
< 0.1%
277755.206408 1
< 0.1%
277718.159532176 1
< 0.1%
276945.373485 1
< 0.1%

위생업태명
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
영업장판매
4748 
전자상거래(통신판매업)
2265 
방문판매
1231 
통신판매
1053 
다단계판매
608 
Other values (5)
 
95

Length

Max length14
Median length5
Mean length6.3989
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전자상거래(통신판매업)
2nd row전자상거래(통신판매업)
3rd row방문판매
4th row방문판매
5th row전자상거래(통신판매업)

Common Values

ValueCountFrequency (%)
영업장판매 4748
47.5%
전자상거래(통신판매업) 2265
22.7%
방문판매 1231
 
12.3%
통신판매 1053
 
10.5%
다단계판매 608
 
6.1%
기타(복합 등) 34
 
0.3%
기타 건강기능식품일반판매업 30
 
0.3%
도매업(유통) 19
 
0.2%
전화권유판매 10
 
0.1%
<NA> 2
 
< 0.1%

Length

2024-04-19T15:59:01.436850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:01.548802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업장판매 4748
47.2%
전자상거래(통신판매업 2265
22.5%
방문판매 1231
 
12.2%
통신판매 1053
 
10.5%
다단계판매 608
 
6.0%
기타(복합 34
 
0.3%
34
 
0.3%
기타 30
 
0.3%
건강기능식품일반판매업 30
 
0.3%
도매업(유통 19
 
0.2%
Other values (2) 12
 
0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8476 
0
1524 

Length

Max length4
Median length4
Mean length3.5428
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8476
84.8%
0 1524
 
15.2%

Length

2024-04-19T15:59:01.692441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:01.786138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8476
84.8%
0 1524
 
15.2%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8476 
0
1524 

Length

Max length4
Median length4
Mean length3.5428
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8476
84.8%
0 1524
 
15.2%

Length

2024-04-19T15:59:01.928124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:02.024137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8476
84.8%
0 1524
 
15.2%

영업장주변구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

등급구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

급수시설구분명
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8550 
상수도전용
1445 
간이상수도
 
3
전용상수도(특정시설의 자가용 수도)
 
2

Length

Max length19
Median length4
Mean length4.1478
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8550
85.5%
상수도전용 1445
 
14.4%
간이상수도 3
 
< 0.1%
전용상수도(특정시설의 자가용 수도) 2
 
< 0.1%

Length

2024-04-19T15:59:02.137522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:02.230919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8550
85.5%
상수도전용 1445
 
14.4%
간이상수도 3
 
< 0.1%
전용상수도(특정시설의 2
 
< 0.1%
자가용 2
 
< 0.1%
수도 2
 
< 0.1%

총직원수
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8486 
0
1514 

Length

Max length4
Median length4
Mean length3.5458
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8486
84.9%
0 1514
 
15.1%

Length

2024-04-19T15:59:02.341327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:02.449117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8486
84.9%
0 1514
 
15.1%

본사직원수
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct6
Distinct (%)0.1%
Missing3700
Missing (%)37.0%
Infinite0
Infinite (%)0.0%
Mean0.0066666667
Minimum0
Maximum6
Zeros6275
Zeros (%)62.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:02.527376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.1295516
Coefficient of variation (CV)19.43274
Kurtosis1012.2036
Mean0.0066666667
Median Absolute Deviation (MAD)0
Skewness28.329541
Sum42
Variance0.016783616
MonotonicityNot monotonic
2024-04-19T15:59:02.621317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 6275
62.7%
1 16
 
0.2%
2 5
 
0.1%
3 2
 
< 0.1%
4 1
 
< 0.1%
6 1
 
< 0.1%
(Missing) 3700
37.0%
ValueCountFrequency (%)
0 6275
62.7%
1 16
 
0.2%
2 5
 
0.1%
3 2
 
< 0.1%
4 1
 
< 0.1%
6 1
 
< 0.1%
ValueCountFrequency (%)
6 1
 
< 0.1%
4 1
 
< 0.1%
3 2
 
< 0.1%
2 5
 
0.1%
1 16
 
0.2%
0 6275
62.7%

공장사무직직원수
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct11
Distinct (%)0.2%
Missing3701
Missing (%)37.0%
Infinite0
Infinite (%)0.0%
Mean0.057151929
Minimum0
Maximum23
Zeros6081
Zeros (%)60.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:02.713322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum23
Range23
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.53094541
Coefficient of variation (CV)9.2900699
Kurtosis974.46322
Mean0.057151929
Median Absolute Deviation (MAD)0
Skewness26.812005
Sum360
Variance0.28190303
MonotonicityNot monotonic
2024-04-19T15:59:02.807680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
0 6081
60.8%
1 172
 
1.7%
2 23
 
0.2%
4 11
 
0.1%
3 5
 
0.1%
5 2
 
< 0.1%
23 1
 
< 0.1%
6 1
 
< 0.1%
14 1
 
< 0.1%
20 1
 
< 0.1%
(Missing) 3701
37.0%
ValueCountFrequency (%)
0 6081
60.8%
1 172
 
1.7%
2 23
 
0.2%
3 5
 
0.1%
4 11
 
0.1%
5 2
 
< 0.1%
6 1
 
< 0.1%
10 1
 
< 0.1%
14 1
 
< 0.1%
20 1
 
< 0.1%
ValueCountFrequency (%)
23 1
 
< 0.1%
20 1
 
< 0.1%
14 1
 
< 0.1%
10 1
 
< 0.1%
6 1
 
< 0.1%
5 2
 
< 0.1%
4 11
 
0.1%
3 5
 
0.1%
2 23
 
0.2%
1 172
1.7%

공장판매직직원수
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct28
Distinct (%)0.4%
Missing3700
Missing (%)37.0%
Infinite0
Infinite (%)0.0%
Mean0.33126984
Minimum0
Maximum102
Zeros5631
Zeros (%)56.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:02.910602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum102
Range102
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.4598287
Coefficient of variation (CV)10.44414
Kurtosis565.58106
Mean0.33126984
Median Absolute Deviation (MAD)0
Skewness22.323269
Sum2087
Variance11.970414
MonotonicityNot monotonic
2024-04-19T15:59:03.024929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
0 5631
56.3%
1 520
 
5.2%
2 61
 
0.6%
3 30
 
0.3%
4 9
 
0.1%
10 6
 
0.1%
5 6
 
0.1%
6 5
 
0.1%
20 4
 
< 0.1%
94 3
 
< 0.1%
Other values (18) 25
 
0.2%
(Missing) 3700
37.0%
ValueCountFrequency (%)
0 5631
56.3%
1 520
 
5.2%
2 61
 
0.6%
3 30
 
0.3%
4 9
 
0.1%
5 6
 
0.1%
6 5
 
0.1%
7 3
 
< 0.1%
9 1
 
< 0.1%
10 6
 
0.1%
ValueCountFrequency (%)
102 1
 
< 0.1%
100 1
 
< 0.1%
94 3
< 0.1%
80 1
 
< 0.1%
58 1
 
< 0.1%
52 1
 
< 0.1%
50 1
 
< 0.1%
45 1
 
< 0.1%
40 1
 
< 0.1%
36 1
 
< 0.1%

공장생산직직원수
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
6294 
<NA>
3700 
1
 
4
9
 
1
6
 
1

Length

Max length4
Median length1
Mean length2.11
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
0 6294
62.9%
<NA> 3700
37.0%
1 4
 
< 0.1%
9 1
 
< 0.1%
6 1
 
< 0.1%

Length

2024-04-19T15:59:03.144798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:03.252339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 6294
62.9%
na 3700
37.0%
1 4
 
< 0.1%
9 1
 
< 0.1%
6 1
 
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8306 
임대
860 
자가
834 

Length

Max length4
Median length4
Mean length3.6612
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8306
83.1%
임대 860
 
8.6%
자가 834
 
8.3%

Length

2024-04-19T15:59:03.371422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T15:59:03.509863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8306
83.1%
임대 860
 
8.6%
자가 834
 
8.3%

보증액
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct8
Distinct (%)0.5%
Missing8444
Missing (%)84.4%
Infinite0
Infinite (%)0.0%
Mean87403.92
Minimum0
Maximum1 × 108
Zeros1548
Zeros (%)15.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:03.595763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1 × 108
Range1 × 108
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2571437.6
Coefficient of variation (CV)29.420163
Kurtosis1468.7176
Mean87403.92
Median Absolute Deviation (MAD)0
Skewness37.868492
Sum1.360005 × 108
Variance6.6122913 × 1012
MonotonicityNot monotonic
2024-04-19T15:59:03.703328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 1548
 
15.5%
10000000 2
 
< 0.1%
500 1
 
< 0.1%
1000000 1
 
< 0.1%
2000000 1
 
< 0.1%
100000000 1
 
< 0.1%
5000000 1
 
< 0.1%
8000000 1
 
< 0.1%
(Missing) 8444
84.4%
ValueCountFrequency (%)
0 1548
15.5%
500 1
 
< 0.1%
1000000 1
 
< 0.1%
2000000 1
 
< 0.1%
5000000 1
 
< 0.1%
8000000 1
 
< 0.1%
10000000 2
 
< 0.1%
100000000 1
 
< 0.1%
ValueCountFrequency (%)
100000000 1
 
< 0.1%
10000000 2
 
< 0.1%
8000000 1
 
< 0.1%
5000000 1
 
< 0.1%
2000000 1
 
< 0.1%
1000000 1
 
< 0.1%
500 1
 
< 0.1%
0 1548
15.5%

월세액
Real number (ℝ)

MISSING  ZEROS 

Distinct8
Distinct (%)0.5%
Missing8444
Missing (%)84.4%
Infinite0
Infinite (%)0.0%
Mean2461.4653
Minimum0
Maximum1000000
Zeros1548
Zeros (%)15.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:03.804456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1000000
Range1000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation41942.178
Coefficient of variation (CV)17.039516
Kurtosis393.2729
Mean2461.4653
Median Absolute Deviation (MAD)0
Skewness19.295714
Sum3830040
Variance1.7591463 × 109
MonotonicityNot monotonic
2024-04-19T15:59:03.901200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0 1548
 
15.5%
800000 2
 
< 0.1%
40 1
 
< 0.1%
300000 1
 
< 0.1%
180000 1
 
< 0.1%
1000000 1
 
< 0.1%
200000 1
 
< 0.1%
550000 1
 
< 0.1%
(Missing) 8444
84.4%
ValueCountFrequency (%)
0 1548
15.5%
40 1
 
< 0.1%
180000 1
 
< 0.1%
200000 1
 
< 0.1%
300000 1
 
< 0.1%
550000 1
 
< 0.1%
800000 2
 
< 0.1%
1000000 1
 
< 0.1%
ValueCountFrequency (%)
1000000 1
 
< 0.1%
800000 2
 
< 0.1%
550000 1
 
< 0.1%
300000 1
 
< 0.1%
200000 1
 
< 0.1%
180000 1
 
< 0.1%
40 1
 
< 0.1%
0 1548
15.5%

다중이용업소여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size97.7 KiB
False
9998 
(Missing)
 
2
ValueCountFrequency (%)
False 9998
> 99.9%
(Missing) 2
 
< 0.1%
2024-04-19T15:59:03.990905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

시설총규모
Real number (ℝ)

SKEWED  ZEROS 

Distinct114
Distinct (%)1.1%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean0.60758952
Minimum0
Maximum407.45
Zeros9852
Zeros (%)98.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:59:04.091057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum407.45
Range407.45
Interquartile range (IQR)0

Descriptive statistics

Standard deviation9.1246828
Coefficient of variation (CV)15.017841
Kurtosis755.92632
Mean0.60758952
Median Absolute Deviation (MAD)0
Skewness24.212055
Sum6074.68
Variance83.259836
MonotonicityNot monotonic
2024-04-19T15:59:04.243042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 9852
98.5%
3.3 9
 
0.1%
1.0 4
 
< 0.1%
10.0 4
 
< 0.1%
15.0 4
 
< 0.1%
20.0 3
 
< 0.1%
3.0 3
 
< 0.1%
30.0 3
 
< 0.1%
8.64 3
 
< 0.1%
5.6 2
 
< 0.1%
Other values (104) 111
 
1.1%
ValueCountFrequency (%)
0.0 9852
98.5%
0.4 1
 
< 0.1%
0.55 1
 
< 0.1%
0.9 1
 
< 0.1%
0.96 1
 
< 0.1%
1.0 4
 
< 0.1%
1.5 1
 
< 0.1%
2.0 1
 
< 0.1%
2.18 1
 
< 0.1%
2.4 1
 
< 0.1%
ValueCountFrequency (%)
407.45 1
< 0.1%
312.0 1
< 0.1%
250.98 1
< 0.1%
243.2 1
< 0.1%
232.51 1
< 0.1%
162.48 1
< 0.1%
160.0 1
< 0.1%
155.5 1
< 0.1%
149.98 1
< 0.1%
148.17 1
< 0.1%

전통업소지정번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

전통업소주된음식
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

홈페이지
Text

MISSING 

Distinct21
Distinct (%)91.3%
Missing9977
Missing (%)99.8%
Memory size156.2 KiB
2024-04-19T15:59:04.451070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length24
Mean length17.130435
Min length4

Characters and Unicode

Total characters394
Distinct characters61
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)87.0%

Sample

1st rowG-market, Auction, 11번가에서 판매중(판매자명:더코넬)
2nd rowhttps://smartstore.naver.com/beautysmile
3rd row11번가, 옥션 등
4th rowwww.healthocean-korea.com
5th row오픈마켓
ValueCountFrequency (%)
오픈마켓 4
 
12.1%
옥션 2
 
6.1%
2
 
6.1%
판매중(판매자명:더코넬 1
 
3.0%
www.44nallary.com 1
 
3.0%
www.makelifebetter.co.kr 1
 
3.0%
www.oyeoldudal.co.kr 1
 
3.0%
www.tocxtoc.com 1
 
3.0%
지마켓 1
 
3.0%
오픈마켓(옥션 1
 
3.0%
Other values (18) 18
54.5%
2024-04-19T15:59:04.789179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
w 39
 
9.9%
. 34
 
8.6%
o 29
 
7.4%
c 20
 
5.1%
e 20
 
5.1%
m 19
 
4.8%
t 16
 
4.1%
r 16
 
4.1%
l 14
 
3.6%
a 14
 
3.6%
Other values (51) 173
43.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 254
64.5%
Other Letter 57
 
14.5%
Other Punctuation 48
 
12.2%
Decimal Number 14
 
3.6%
Space Separator 10
 
2.5%
Dash Punctuation 4
 
1.0%
Uppercase Letter 3
 
0.8%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 39
15.4%
o 29
11.4%
c 20
 
7.9%
e 20
 
7.9%
m 19
 
7.5%
t 16
 
6.3%
r 16
 
6.3%
l 14
 
5.5%
a 14
 
5.5%
s 9
 
3.5%
Other values (14) 58
22.8%
Other Letter
ValueCountFrequency (%)
8
14.0%
8
14.0%
6
 
10.5%
6
 
10.5%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
Other values (14) 15
26.3%
Other Punctuation
ValueCountFrequency (%)
. 34
70.8%
/ 5
 
10.4%
, 5
 
10.4%
: 4
 
8.3%
Decimal Number
ValueCountFrequency (%)
4 7
50.0%
1 5
35.7%
0 2
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
G 2
66.7%
A 1
33.3%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 257
65.2%
Common 80
 
20.3%
Hangul 57
 
14.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 39
15.2%
o 29
11.3%
c 20
 
7.8%
e 20
 
7.8%
m 19
 
7.4%
t 16
 
6.2%
r 16
 
6.2%
l 14
 
5.4%
a 14
 
5.4%
s 9
 
3.5%
Other values (16) 61
23.7%
Hangul
ValueCountFrequency (%)
8
14.0%
8
14.0%
6
 
10.5%
6
 
10.5%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
Other values (14) 15
26.3%
Common
ValueCountFrequency (%)
. 34
42.5%
10
 
12.5%
4 7
 
8.8%
/ 5
 
6.2%
1 5
 
6.2%
, 5
 
6.2%
: 4
 
5.0%
- 4
 
5.0%
) 2
 
2.5%
( 2
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 337
85.5%
Hangul 57
 
14.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
w 39
 
11.6%
. 34
 
10.1%
o 29
 
8.6%
c 20
 
5.9%
e 20
 
5.9%
m 19
 
5.6%
t 16
 
4.7%
r 16
 
4.7%
l 14
 
4.2%
a 14
 
4.2%
Other values (27) 116
34.4%
Hangul
ValueCountFrequency (%)
8
14.0%
8
14.0%
6
 
10.5%
6
 
10.5%
3
 
5.3%
3
 
5.3%
2
 
3.5%
2
 
3.5%
2
 
3.5%
2
 
3.5%
Other values (14) 15
26.3%

Sample

번호개방서비스명개방서비스아이디개방자치단체코드관리번호인허가일자인허가취소일자영업상태구분코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자소재지전화소재지면적소재지우편번호소재지전체주소도로명전체주소도로명우편번호사업장명최종수정시점데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명급수시설구분명총직원수본사직원수공장사무직직원수공장판매직직원수공장생산직직원수건물소유구분명보증액월세액다중이용업소여부시설총규모전통업소지정번호전통업소주된음식홈페이지
10111012건강기능식품일반판매업07_22_03_P34100003410000-134-2022-0004520220616<NA>1영업/정상1영업<NA><NA><NA><NA><NA><NA>700191대구광역시 중구 종로*가 ****-****대구광역시 중구 종로 **-*, *층 ****호 (종로*가)41934자유팜20220616172042I2022-06-18 00:22:31.0<NA>343614.806882264466.012002전자상거래(통신판매업)00<NA><NA><NA>00000<NA>00N0.0<NA><NA><NA>
1181711818건강기능식품일반판매업07_22_03_P34700003470000-134-2020-0018520201208<NA>3폐업2폐업20211224<NA><NA><NA><NA><NA>704828대구광역시 달서구 월성동 **-* 월성청구코아대구광역시 달서구 학산로 **, 월성청구코아 *층 ***호 (월성동)42732허브&해독나라20211224131430U2021-12-26 02:40:00.0<NA>338293.86158259678.714164전자상거래(통신판매업)00<NA><NA><NA>00000<NA>00N0.0<NA><NA><NA>
59235924건강기능식품일반판매업07_22_03_P34400003440000-134-2016-0003320161117<NA>3폐업2폐업20171206<NA><NA><NA><NA>.00705836대구광역시 남구 이천동 ***-**번지대구광역시 남구 희망로*길 **, ***동 ***호 (이천동, 대성유니드아파트)42437시너지코리아20171206142123I2018-08-31 23:59:59.0<NA>344625.531604262123.754405방문판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
1539115392건강기능식품일반판매업07_22_03_P34800003480000-134-2015-0001020150520<NA>3폐업2폐업20160310<NA><NA><NA>053 584 570967.39711815대구광역시 달성군 다사읍 죽곡리 ***-*번지대구광역시 달성군 다사읍 죽곡*길 *-*, *층42918동인비 다사지사20150520165102I2018-08-31 23:59:59.0<NA>332227.145212262912.161541방문판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
40324033건강기능식품일반판매업07_22_03_P34300003430000-134-2014-0001120140304<NA>3폐업2폐업20171204<NA><NA><NA><NA><NA>703845대구광역시 서구 평리동 ****-**번지대구광역시 서구 통학로**길 **-** (평리동)41821콩지몰20171204170750I2018-08-31 23:59:59.0<NA>341347.124148264216.774896전자상거래(통신판매업)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
1132511326건강기능식품일반판매업07_22_03_P34600003460000-134-2020-0010020200910<NA>1영업/정상1영업<NA><NA><NA><NA><NA><NA>706850대구광역시 수성구 황금동 *** 캐슬골드파크*단지 **층 ****동 ****호대구광역시 수성구 청수로 ***, ****동 **층 ****호 (황금동, 캐슬골드파크*단지)42178에스엠상사20200910111039I2020-09-12 00:23:12.0<NA>347385.97527261131.856154전자상거래(통신판매업)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
1318313184건강기능식품일반판매업07_22_03_P34700003470000-134-2010-0022320101029<NA>3폐업2폐업20130802<NA><NA><NA>053 621 6219.00704971대구광역시 달서구 본리동 ***-*번지 본리*차롯데캐슬 ***동 ****호대구광역시 달서구 대명천로 ***, ***동 **층 ****호 (본리동, 본리*차롯데캐슬)42682날씬콩20120504093927I2018-08-31 23:59:59.0<NA>339601.473313260947.99864통신판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
1225812259건강기능식품일반판매업07_22_03_P34700003470000-134-2009-0021720091026<NA>3폐업2폐업20091103<NA><NA><NA><NA>1.50704945대구광역시 달서구 장기동 ***-**번지 (지상*층)<NA><NA>신선바이오테크20091026105342I2018-08-31 23:59:59.0<NA>338338.83776261314.063755영업장판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
11511152건강기능식품일반판매업07_22_03_P34100003410000-134-2018-0000420180116<NA>1영업/정상1영업<NA><NA><NA><NA><NA>5.00700320대구광역시 중구 대신동 ****-****번지 대신지하상가 나열**호대구광역시 중구 국채보상로 지하 ***, 대신지하상가 나열**호 (대신동)41925경남하이패션20180213173412I2018-08-31 23:59:59.0<NA>342797.374532264457.420146영업장판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
62886289건강기능식품일반판매업07_22_03_P34500003450000-134-2018-0006020180816<NA>3폐업2폐업20191105<NA><NA><NA><NA>56.00702310대구광역시 북구 사수동 ***번지대구광역시 북구 한강로 **, *층 (사수동)41598한라통신20191105144402U2019-11-07 02:40:00.0<NA>336725.369634267373.691132영업장판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>자가<NA><NA>N0.0<NA><NA><NA>
번호개방서비스명개방서비스아이디개방자치단체코드관리번호인허가일자인허가취소일자영업상태구분코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자소재지전화소재지면적소재지우편번호소재지전체주소도로명전체주소도로명우편번호사업장명최종수정시점데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명급수시설구분명총직원수본사직원수공장사무직직원수공장판매직직원수공장생산직직원수건물소유구분명보증액월세액다중이용업소여부시설총규모전통업소지정번호전통업소주된음식홈페이지
22442245건강기능식품일반판매업07_22_03_P34200003420000-134-2011-0011220110714<NA>3폐업2폐업20131008<NA><NA><NA>053 745 62661.00701826대구광역시 동구 신천동 ***-*번지대구광역시 동구 송라로**길 ** (신천동)41257우리들체어대구수성대리점20111005153435I2018-08-31 23:59:59.0<NA>346456.854569264757.627085영업장판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
1586715868건강기능식품일반판매업07_22_03_P34800003480000-134-2004-0002820040618<NA>3폐업2폐업20111228<NA><NA><NA>000006119794<NA>711873대구광역시 달성군 현풍면 중리 ***번지 학산APT ***동 ***호<NA><NA>한국암웨이양귀옥20040930000000I2018-08-31 23:59:59.0<NA>330855.388357244659.008667다단계판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
1022910230건강기능식품일반판매업07_22_03_P34600003460000-134-2016-0000120160104<NA>3폐업2폐업20171130<NA><NA><NA>053 943 115510.00706805대구광역시 수성구 만촌동 ***-**번지대구광역시 수성구 무열로 ***, *층 (만촌동)42047투투20171130154446I2018-08-31 23:59:59.0<NA>348797.166064264617.334452전자상거래(통신판매업)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
52015202건강기능식품일반판매업07_22_03_P34400003440000-134-2007-0001020070406<NA>3폐업2폐업20070905<NA><NA><NA><NA>15.00705830대구광역시 남구 봉덕동 ***-**번지<NA><NA>경희통상20070406000000I2018-08-31 23:59:59.0<NA>343967.871954261752.0432영업장판매<NA><NA><NA><NA>상수도전용<NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
44094410건강기능식품일반판매업07_22_03_P34300003430000-134-2020-0000320200122<NA>3폐업2폐업20210426<NA><NA><NA><NA>.50703812대구광역시 서구 비산동 ***-*대구광역시 서구 국채보상로 ***, *층 (비산동)41799온누리헬스케어20210426152948U2021-04-28 02:40:00.0<NA>341852.829879264733.15103영업장판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
36933694건강기능식품일반판매업07_22_03_P34300003430000-134-2013-0004820130906<NA>3폐업2폐업20180119<NA><NA><NA><NA>29.40703806대구광역시 서구 내당동 ***-**번지대구광역시 서구 달구벌대로***길 ** (내당동)41870다이어트클럽 허브와콩순이20180119115716I2018-08-31 23:59:59.0<NA>341802.883255263712.288803영업장판매<NA><NA><NA><NA>상수도전용<NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
1220812209건강기능식품일반판매업07_22_03_P34700003470000-134-2009-0017120090825<NA>3폐업2폐업20190123<NA><NA><NA><NA><NA>704923대구광역시 달서구 용산동 ***-**번지 용산롯데캐슬그랜드 ***동 ***호대구광역시 달서구 장산남로 **, ***동 ***호 (용산동,용산롯데캐슬그랜드)42637슬림퀸20190123181632U2019-01-25 02:40:00.0<NA>337841.507465262262.872036통신판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
36013602건강기능식품일반판매업07_22_03_P34200003420000-134-2020-0002320200309<NA>1영업/정상1영업<NA><NA><NA><NA><NA><NA>701866대구광역시 동구 입석동 ***-**번지대구광역시 동구 입석로*길 *, *층 (입석동)41141자이시티20200309181708I2020-03-11 00:23:22.0<NA>348494.742946267144.252729방문판매<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>
1491214913건강기능식품일반판매업07_22_03_P34700003470000-134-2010-0012620100609<NA>1영업/정상1영업<NA><NA><NA><NA>053 655 00781.00704826대구광역시 달서구 송현동 ****-**번지 (지상*층)대구광역시 달서구 월배로**길 ** (송현동,(지상*층))42743녹십초20100616103852I2018-08-31 23:59:59.0<NA>339597.853657259693.442372영업장판매<NA><NA><NA><NA><NA><NA>0000<NA><NA><NA>N0.0<NA><NA><NA>
110111건강기능식품일반판매업07_22_03_P34100003410000-134-2011-0005920110720<NA>3폐업2폐업20120831<NA><NA><NA>053 4258400<NA>700412대구광역시 중구 삼덕동*가 ****-****번지 지상*층대구광역시 중구 달구벌대로***길 *-* (삼덕동*가)41948뚱녀날다20120203100409I2018-08-31 23:59:59.0<NA>344873.42969263753.680341전자상거래(통신판매업)<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N0.0<NA><NA><NA>