Overview

Dataset statistics

Number of variables27
Number of observations4327
Missing cells37022
Missing cells (%)31.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory955.1 KiB
Average record size in memory226.0 B

Variable types

Categorical6
Numeric5
Text7
DateTime6
Unsupported3

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),지정일자,민원종류명
Author동대문구
URLhttps://data.seoul.go.kr/dataList/OA-19907/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
상세영업상태명 is highly imbalanced (53.9%)Imbalance
인허가취소일자 has 3892 (89.9%) missing valuesMissing
폐업일자 has 1307 (30.2%) missing valuesMissing
휴업시작일자 has 4233 (97.8%) missing valuesMissing
휴업종료일자 has 4236 (97.9%) missing valuesMissing
재개업일자 has 4327 (100.0%) missing valuesMissing
전화번호 has 1506 (34.8%) missing valuesMissing
소재지면적 has 4327 (100.0%) missing valuesMissing
소재지우편번호 has 3171 (73.3%) missing valuesMissing
도로명주소 has 504 (11.6%) missing valuesMissing
도로명우편번호 has 2892 (66.8%) missing valuesMissing
업태구분명 has 4327 (100.0%) missing valuesMissing
좌표정보(X) has 294 (6.8%) missing valuesMissing
좌표정보(Y) has 294 (6.8%) missing valuesMissing
지정일자 has 1712 (39.6%) missing valuesMissing
관리번호 has unique valuesUnique
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
상세영업상태코드 has 809 (18.7%) zerosZeros

Reproduction

Analysis started2024-05-11 06:38:29.731236
Analysis finished2024-05-11 06:38:32.191054
Duration2.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
3050000
4327 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3050000
2nd row3050000
3rd row3050000
4th row3050000
5th row3050000

Common Values

ValueCountFrequency (%)
3050000 4327
100.0%

Length

2024-05-11T06:38:32.308619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:32.506973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3050000 4327
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct4327
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0063286 × 1018
Minimum1.968305 × 1018
Maximum2.024305 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.2 KiB
2024-05-11T06:38:32.986318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.968305 × 1018
5-th percentile1.993305 × 1018
Q12.000305 × 1018
median2.005305 × 1018
Q32.013305 × 1018
95-th percentile2.020305 × 1018
Maximum2.024305 × 1018
Range5.6000013 × 1016
Interquartile range (IQR)1.3000006 × 1016

Descriptive statistics

Standard deviation8.655819 × 1015
Coefficient of variation (CV)0.0043142579
Kurtosis0.14271683
Mean2.0063286 × 1018
Median Absolute Deviation (MAD)6.0000018 × 1015
Skewness-0.2303175
Sum-7.0326788 × 1018
Variance7.4923203 × 1031
MonotonicityStrictly increasing
2024-05-11T06:38:33.472090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1968305008205605005 1
 
< 0.1%
2010305010005600101 1
 
< 0.1%
2010305010005600087 1
 
< 0.1%
2010305010005600088 1
 
< 0.1%
2010305010005600089 1
 
< 0.1%
2010305010005600090 1
 
< 0.1%
2010305010005600091 1
 
< 0.1%
2010305010005600092 1
 
< 0.1%
2010305010005600093 1
 
< 0.1%
2010305010005600094 1
 
< 0.1%
Other values (4317) 4317
99.8%
ValueCountFrequency (%)
1968305008205605005 1
< 0.1%
1972305008205613006 1
< 0.1%
1972305008205625001 1
< 0.1%
1973305008205613045 1
< 0.1%
1973305008205624025 1
< 0.1%
1974305008205623010 1
< 0.1%
1976305008205624027 1
< 0.1%
1976305008205625031 1
< 0.1%
1978305008205602031 1
< 0.1%
1978305008205625012 1
< 0.1%
ValueCountFrequency (%)
2024305021005600023 1
< 0.1%
2024305021005600022 1
< 0.1%
2024305021005600021 1
< 0.1%
2024305021005600020 1
< 0.1%
2024305021005600019 1
< 0.1%
2024305021005600018 1
< 0.1%
2024305021005600017 1
< 0.1%
2024305021005600016 1
< 0.1%
2024305021005600015 1
< 0.1%
2024305021005600014 1
< 0.1%
Distinct2678
Distinct (%)61.9%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2024-05-11T06:38:34.109922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.1215623
Min length8

Characters and Unicode

Total characters35142
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1773 ?
Unique (%)41.0%

Sample

1st row19680630
2nd row19720630
3rd row19720630
4th row19730518
5th row19730630
ValueCountFrequency (%)
19990101 74
 
1.7%
19981210 22
 
0.5%
19981219 21
 
0.5%
19981221 19
 
0.4%
19790630 18
 
0.4%
19981217 17
 
0.4%
19981029 16
 
0.4%
19981102 15
 
0.3%
19981212 13
 
0.3%
19981209 12
 
0.3%
Other values (2668) 4100
94.8%
2024-05-11T06:38:35.194674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10482
29.8%
1 6902
19.6%
2 6505
18.5%
9 3297
 
9.4%
8 1587
 
4.5%
3 1471
 
4.2%
4 1160
 
3.3%
7 1125
 
3.2%
5 1039
 
3.0%
6 1032
 
2.9%
Other values (2) 542
 
1.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34600
98.5%
Dash Punctuation 526
 
1.5%
Space Separator 16
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10482
30.3%
1 6902
19.9%
2 6505
18.8%
9 3297
 
9.5%
8 1587
 
4.6%
3 1471
 
4.3%
4 1160
 
3.4%
7 1125
 
3.3%
5 1039
 
3.0%
6 1032
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 526
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35142
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10482
29.8%
1 6902
19.6%
2 6505
18.5%
9 3297
 
9.4%
8 1587
 
4.5%
3 1471
 
4.2%
4 1160
 
3.3%
7 1125
 
3.2%
5 1039
 
3.0%
6 1032
 
2.9%
Other values (2) 542
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35142
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10482
29.8%
1 6902
19.6%
2 6505
18.5%
9 3297
 
9.4%
8 1587
 
4.5%
3 1471
 
4.2%
4 1160
 
3.3%
7 1125
 
3.2%
5 1039
 
3.0%
6 1032
 
2.9%
Other values (2) 542
 
1.5%

인허가취소일자
Date

MISSING 

Distinct141
Distinct (%)32.4%
Missing3892
Missing (%)89.9%
Memory size33.9 KiB
Minimum2001-04-27 00:00:00
Maximum2024-03-06 00:00:00
2024-05-11T06:38:35.610327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:36.003771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
3
3020 
1
809 
4
495 
2
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row4
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 3020
69.8%
1 809
 
18.7%
4 495
 
11.4%
2 3
 
0.1%

Length

2024-05-11T06:38:36.252574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:36.575329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 3020
69.8%
1 809
 
18.7%
4 495
 
11.4%
2 3
 
0.1%

영업상태명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
폐업
3020 
영업/정상
809 
취소/말소/만료/정지/중지
495 
휴업
 
3

Length

Max length14
Median length2
Mean length3.9336723
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업
2nd row취소/말소/만료/정지/중지
3rd row폐업
4th row폐업
5th row폐업

Common Values

ValueCountFrequency (%)
폐업 3020
69.8%
영업/정상 809
 
18.7%
취소/말소/만료/정지/중지 495
 
11.4%
휴업 3
 
0.1%

Length

2024-05-11T06:38:36.970129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:37.234252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 3020
69.8%
영업/정상 809
 
18.7%
취소/말소/만료/정지/중지 495
 
11.4%
휴업 3
 
0.1%

상세영업상태코드
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8950774
Minimum0
Maximum6
Zeros809
Zeros (%)18.7%
Negative0
Negative (%)0.0%
Memory size38.2 KiB
2024-05-11T06:38:37.456305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median2
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.2140689
Coefficient of variation (CV)0.64064343
Kurtosis1.4815879
Mean1.8950774
Median Absolute Deviation (MAD)0
Skewness0.61930931
Sum8200
Variance1.4739633
MonotonicityNot monotonic
2024-05-11T06:38:37.697577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
2 3020
69.8%
0 809
 
18.7%
5 331
 
7.6%
3 158
 
3.7%
4 4
 
0.1%
1 3
 
0.1%
6 2
 
< 0.1%
ValueCountFrequency (%)
0 809
 
18.7%
1 3
 
0.1%
2 3020
69.8%
3 158
 
3.7%
4 4
 
0.1%
5 331
 
7.6%
6 2
 
< 0.1%
ValueCountFrequency (%)
6 2
 
< 0.1%
5 331
 
7.6%
4 4
 
0.1%
3 158
 
3.7%
2 3020
69.8%
1 3
 
0.1%
0 809
 
18.7%

상세영업상태명
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
폐업처리
3020 
정상영업
809 
지정취소
331 
직권취소
 
158
임시소매기간만료
 
4
Other values (2)
 
5

Length

Max length8
Median length4
Mean length4.0036977
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row폐업처리
2nd row직권취소
3rd row폐업처리
4th row폐업처리
5th row폐업처리

Common Values

ValueCountFrequency (%)
폐업처리 3020
69.8%
정상영업 809
 
18.7%
지정취소 331
 
7.6%
직권취소 158
 
3.7%
임시소매기간만료 4
 
0.1%
휴업처리 3
 
0.1%
영업정지 2
 
< 0.1%

Length

2024-05-11T06:38:38.063321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:38.446981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업처리 3020
69.8%
정상영업 809
 
18.7%
지정취소 331
 
7.6%
직권취소 158
 
3.7%
임시소매기간만료 4
 
0.1%
휴업처리 3
 
0.1%
영업정지 2
 
< 0.1%

폐업일자
Date

MISSING 

Distinct2195
Distinct (%)72.7%
Missing1307
Missing (%)30.2%
Memory size33.9 KiB
Minimum2000-01-16 00:00:00
Maximum2024-05-08 00:00:00
2024-05-11T06:38:38.838075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:39.235816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업시작일자
Date

MISSING 

Distinct87
Distinct (%)92.6%
Missing4233
Missing (%)97.8%
Memory size33.9 KiB
Minimum2001-02-09 00:00:00
Maximum2023-07-31 00:00:00
2024-05-11T06:38:39.703562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:40.076763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

휴업종료일자
Date

MISSING 

Distinct83
Distinct (%)91.2%
Missing4236
Missing (%)97.9%
Memory size33.9 KiB
Minimum2001-07-31 00:00:00
Maximum2027-12-31 00:00:00
2024-05-11T06:38:40.486983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:40.920950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4327
Missing (%)100.0%
Memory size38.2 KiB

전화번호
Text

MISSING 

Distinct2627
Distinct (%)93.1%
Missing1506
Missing (%)34.8%
Memory size33.9 KiB
2024-05-11T06:38:41.380481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length10.592343
Min length1

Characters and Unicode

Total characters29881
Distinct characters21
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2491 ?
Unique (%)88.3%

Sample

1st row000209244233
2nd row000222446132
3rd row000209674781
4th row000222134553
5th row000209674115
ValueCountFrequency (%)
02 350
 
11.0%
9669856 25
 
0.8%
1577-0711 10
 
0.3%
02-1577-0711 7
 
0.2%
000000000000 6
 
0.2%
9610234 6
 
0.2%
6
 
0.2%
9643470 5
 
0.2%
9590965 4
 
0.1%
9628052 4
 
0.1%
Other values (2611) 2771
86.8%
2024-05-11T06:38:42.287799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6706
22.4%
2 6511
21.8%
9 2649
 
8.9%
4 2441
 
8.2%
6 2347
 
7.9%
1 1800
 
6.0%
5 1683
 
5.6%
3 1648
 
5.5%
7 1495
 
5.0%
8 1296
 
4.3%
Other values (11) 1305
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 28576
95.6%
Dash Punctuation 861
 
2.9%
Space Separator 378
 
1.3%
Close Punctuation 49
 
0.2%
Other Letter 12
 
< 0.1%
Other Punctuation 3
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6706
23.5%
2 6511
22.8%
9 2649
 
9.3%
4 2441
 
8.5%
6 2347
 
8.2%
1 1800
 
6.3%
5 1683
 
5.9%
3 1648
 
5.8%
7 1495
 
5.2%
8 1296
 
4.5%
Other Letter
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 861
100.0%
Space Separator
ValueCountFrequency (%)
378
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 29869
> 99.9%
Hangul 12
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6706
22.5%
2 6511
21.8%
9 2649
 
8.9%
4 2441
 
8.2%
6 2347
 
7.9%
1 1800
 
6.0%
5 1683
 
5.6%
3 1648
 
5.5%
7 1495
 
5.0%
8 1296
 
4.3%
Other values (5) 1293
 
4.3%
Hangul
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29869
> 99.9%
Hangul 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6706
22.5%
2 6511
21.8%
9 2649
 
8.9%
4 2441
 
8.2%
6 2347
 
7.9%
1 1800
 
6.0%
5 1683
 
5.6%
3 1648
 
5.5%
7 1495
 
5.0%
8 1296
 
4.3%
Other values (5) 1293
 
4.3%
Hangul
ValueCountFrequency (%)
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4327
Missing (%)100.0%
Memory size38.2 KiB

소재지우편번호
Text

MISSING 

Distinct138
Distinct (%)11.9%
Missing3171
Missing (%)73.3%
Memory size33.9 KiB
2024-05-11T06:38:42.836818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0423875
Min length6

Characters and Unicode

Total characters6985
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)3.6%

Sample

1st row130878
2nd row130810
3rd row130021
4th row130-800
5th row130774
ValueCountFrequency (%)
130101 109
 
9.4%
130021 85
 
7.4%
130070 79
 
6.8%
130060 71
 
6.1%
130031 68
 
5.9%
130010 52
 
4.5%
130081 49
 
4.2%
130091 43
 
3.7%
130110 40
 
3.5%
130050 27
 
2.3%
Other values (128) 533
46.1%
2024-05-11T06:38:43.792121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2307
33.0%
1 1896
27.1%
3 1332
19.1%
8 508
 
7.3%
2 227
 
3.2%
7 209
 
3.0%
6 165
 
2.4%
5 112
 
1.6%
9 90
 
1.3%
4 90
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6936
99.3%
Dash Punctuation 49
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2307
33.3%
1 1896
27.3%
3 1332
19.2%
8 508
 
7.3%
2 227
 
3.3%
7 209
 
3.0%
6 165
 
2.4%
5 112
 
1.6%
9 90
 
1.3%
4 90
 
1.3%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6985
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2307
33.0%
1 1896
27.1%
3 1332
19.1%
8 508
 
7.3%
2 227
 
3.2%
7 209
 
3.0%
6 165
 
2.4%
5 112
 
1.6%
9 90
 
1.3%
4 90
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6985
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2307
33.0%
1 1896
27.1%
3 1332
19.1%
8 508
 
7.3%
2 227
 
3.2%
7 209
 
3.0%
6 165
 
2.4%
5 112
 
1.6%
9 90
 
1.3%
4 90
 
1.3%
Distinct3657
Distinct (%)84.5%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2024-05-11T06:38:44.678864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length47
Mean length26.173099
Min length1

Characters and Unicode

Total characters113251
Distinct characters326
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3180 ?
Unique (%)73.5%

Sample

1st row서울특별시 동대문구 제기동 137번지 140호
2nd row서울특별시 동대문구 답십리동 282번지 8호
3rd row서울특별시 동대문구 이문동 324번지 4호
4th row서울특별시 동대문구 답십리동 962호 (2F 219)
5th row서울특별시 동대문구 이문동 292번지 190호
ValueCountFrequency (%)
서울특별시 4326
18.8%
동대문구 4323
18.8%
장안동 784
 
3.4%
659
 
2.9%
답십리동 496
 
2.2%
전농동 477
 
2.1%
제기동 412
 
1.8%
용신동 365
 
1.6%
이문동 364
 
1.6%
청량리동 280
 
1.2%
Other values (2086) 10562
45.8%
2024-05-11T06:38:45.929597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23103
20.4%
8806
 
7.8%
4778
 
4.2%
1 4554
 
4.0%
4428
 
3.9%
4355
 
3.8%
4343
 
3.8%
4342
 
3.8%
4337
 
3.8%
4328
 
3.8%
Other values (316) 45877
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 68907
60.8%
Space Separator 23103
 
20.4%
Decimal Number 20724
 
18.3%
Dash Punctuation 259
 
0.2%
Uppercase Letter 91
 
0.1%
Other Punctuation 63
 
0.1%
Close Punctuation 45
 
< 0.1%
Open Punctuation 45
 
< 0.1%
Math Symbol 7
 
< 0.1%
Lowercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8806
12.8%
4778
 
6.9%
4428
 
6.4%
4355
 
6.3%
4343
 
6.3%
4342
 
6.3%
4337
 
6.3%
4328
 
6.3%
4326
 
6.3%
4195
 
6.1%
Other values (274) 20669
30.0%
Uppercase Letter
ValueCountFrequency (%)
S 20
22.0%
K 19
20.9%
B 9
9.9%
F 7
 
7.7%
A 6
 
6.6%
D 5
 
5.5%
O 3
 
3.3%
T 3
 
3.3%
W 3
 
3.3%
E 3
 
3.3%
Other values (9) 13
14.3%
Decimal Number
ValueCountFrequency (%)
1 4554
22.0%
2 2810
13.6%
3 2445
11.8%
4 2022
9.8%
5 1765
 
8.5%
0 1610
 
7.8%
9 1495
 
7.2%
6 1428
 
6.9%
7 1329
 
6.4%
8 1266
 
6.1%
Other Punctuation
ValueCountFrequency (%)
, 45
71.4%
@ 12
 
19.0%
. 4
 
6.3%
/ 2
 
3.2%
Lowercase Letter
ValueCountFrequency (%)
e 3
42.9%
b 2
28.6%
s 1
 
14.3%
k 1
 
14.3%
Space Separator
ValueCountFrequency (%)
23103
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 259
100.0%
Close Punctuation
ValueCountFrequency (%)
) 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 45
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 68907
60.8%
Common 44246
39.1%
Latin 98
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8806
12.8%
4778
 
6.9%
4428
 
6.4%
4355
 
6.3%
4343
 
6.3%
4342
 
6.3%
4337
 
6.3%
4328
 
6.3%
4326
 
6.3%
4195
 
6.1%
Other values (274) 20669
30.0%
Latin
ValueCountFrequency (%)
S 20
20.4%
K 19
19.4%
B 9
9.2%
F 7
 
7.1%
A 6
 
6.1%
D 5
 
5.1%
O 3
 
3.1%
T 3
 
3.1%
e 3
 
3.1%
W 3
 
3.1%
Other values (13) 20
20.4%
Common
ValueCountFrequency (%)
23103
52.2%
1 4554
 
10.3%
2 2810
 
6.4%
3 2445
 
5.5%
4 2022
 
4.6%
5 1765
 
4.0%
0 1610
 
3.6%
9 1495
 
3.4%
6 1428
 
3.2%
7 1329
 
3.0%
Other values (9) 1685
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 68907
60.8%
ASCII 44344
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23103
52.1%
1 4554
 
10.3%
2 2810
 
6.3%
3 2445
 
5.5%
4 2022
 
4.6%
5 1765
 
4.0%
0 1610
 
3.6%
9 1495
 
3.4%
6 1428
 
3.2%
7 1329
 
3.0%
Other values (32) 1783
 
4.0%
Hangul
ValueCountFrequency (%)
8806
12.8%
4778
 
6.9%
4428
 
6.4%
4355
 
6.3%
4343
 
6.3%
4342
 
6.3%
4337
 
6.3%
4328
 
6.3%
4326
 
6.3%
4195
 
6.1%
Other values (274) 20669
30.0%

도로명주소
Text

MISSING 

Distinct2666
Distinct (%)69.7%
Missing504
Missing (%)11.6%
Memory size33.9 KiB
2024-05-11T06:38:46.564653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length59
Mean length28.678525
Min length22

Characters and Unicode

Total characters109638
Distinct characters334
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1991 ?
Unique (%)52.1%

Sample

1st row서울특별시 동대문구 제기로 7 (제기동)
2nd row서울특별시 동대문구 이문로 85 (이문동)
3rd row서울특별시 동대문구 이문로30길 9 (이문동)
4th row서울특별시 동대문구 망우로16길 11-15 (휘경동)
5th row서울특별시 동대문구 이문로42길 50-12 (이문동)
ValueCountFrequency (%)
서울특별시 3823
 
18.5%
동대문구 3818
 
18.4%
장안동 747
 
3.6%
1층 510
 
2.5%
답십리동 430
 
2.1%
전농동 415
 
2.0%
이문동 341
 
1.6%
용두동 340
 
1.6%
제기동 298
 
1.4%
휘경동 276
 
1.3%
Other values (1818) 9704
46.9%
2024-05-11T06:38:47.606406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16884
 
15.4%
7931
 
7.2%
4509
 
4.1%
4423
 
4.0%
1 4169
 
3.8%
4054
 
3.7%
3934
 
3.6%
3922
 
3.6%
) 3854
 
3.5%
( 3853
 
3.5%
Other values (324) 52105
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67550
61.6%
Space Separator 16884
 
15.4%
Decimal Number 15272
 
13.9%
Close Punctuation 3854
 
3.5%
Open Punctuation 3853
 
3.5%
Other Punctuation 1514
 
1.4%
Dash Punctuation 571
 
0.5%
Uppercase Letter 119
 
0.1%
Math Symbol 13
 
< 0.1%
Lowercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7931
 
11.7%
4509
 
6.7%
4423
 
6.5%
4054
 
6.0%
3934
 
5.8%
3922
 
5.8%
3829
 
5.7%
3827
 
5.7%
3825
 
5.7%
3823
 
5.7%
Other values (279) 23473
34.7%
Uppercase Letter
ValueCountFrequency (%)
S 19
16.0%
K 18
15.1%
B 13
10.9%
A 11
9.2%
M 6
 
5.0%
D 6
 
5.0%
E 5
 
4.2%
W 5
 
4.2%
T 5
 
4.2%
I 4
 
3.4%
Other values (12) 27
22.7%
Decimal Number
ValueCountFrequency (%)
1 4169
27.3%
2 2142
14.0%
3 1577
 
10.3%
4 1345
 
8.8%
0 1202
 
7.9%
5 1081
 
7.1%
6 1078
 
7.1%
7 1004
 
6.6%
8 907
 
5.9%
9 767
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 1493
98.6%
@ 11
 
0.7%
. 7
 
0.5%
/ 3
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
e 3
37.5%
s 2
25.0%
k 2
25.0%
b 1
 
12.5%
Space Separator
ValueCountFrequency (%)
16884
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3854
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3853
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 571
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67549
61.6%
Common 41961
38.3%
Latin 127
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7931
 
11.7%
4509
 
6.7%
4423
 
6.5%
4054
 
6.0%
3934
 
5.8%
3922
 
5.8%
3829
 
5.7%
3827
 
5.7%
3825
 
5.7%
3823
 
5.7%
Other values (278) 23472
34.7%
Latin
ValueCountFrequency (%)
S 19
15.0%
K 18
14.2%
B 13
 
10.2%
A 11
 
8.7%
M 6
 
4.7%
D 6
 
4.7%
E 5
 
3.9%
W 5
 
3.9%
T 5
 
3.9%
I 4
 
3.1%
Other values (16) 35
27.6%
Common
ValueCountFrequency (%)
16884
40.2%
1 4169
 
9.9%
) 3854
 
9.2%
( 3853
 
9.2%
2 2142
 
5.1%
3 1577
 
3.8%
, 1493
 
3.6%
4 1345
 
3.2%
0 1202
 
2.9%
5 1081
 
2.6%
Other values (9) 4361
 
10.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67549
61.6%
ASCII 42088
38.4%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16884
40.1%
1 4169
 
9.9%
) 3854
 
9.2%
( 3853
 
9.2%
2 2142
 
5.1%
3 1577
 
3.7%
, 1493
 
3.5%
4 1345
 
3.2%
0 1202
 
2.9%
5 1081
 
2.6%
Other values (35) 4488
 
10.7%
Hangul
ValueCountFrequency (%)
7931
 
11.7%
4509
 
6.7%
4423
 
6.5%
4054
 
6.0%
3934
 
5.8%
3922
 
5.8%
3829
 
5.7%
3827
 
5.7%
3825
 
5.7%
3823
 
5.7%
Other values (278) 23472
34.7%
CJK
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Text

MISSING 

Distinct303
Distinct (%)21.1%
Missing2892
Missing (%)66.8%
Memory size33.9 KiB
2024-05-11T06:38:48.321716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.4675958
Min length5

Characters and Unicode

Total characters7846
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)5.4%

Sample

1st row02451
2nd row130070
3rd row02469
4th row130867
5th row02460
ValueCountFrequency (%)
130101 63
 
4.4%
130070 52
 
3.6%
130021 50
 
3.5%
130031 45
 
3.1%
130060 41
 
2.9%
130081 37
 
2.6%
130010 30
 
2.1%
130091 24
 
1.7%
02586 20
 
1.4%
130110 18
 
1.3%
Other values (293) 1055
73.5%
2024-05-11T06:38:49.204254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2208
28.1%
1 1184
15.1%
2 1113
14.2%
3 913
11.6%
5 621
 
7.9%
4 466
 
5.9%
6 440
 
5.6%
8 429
 
5.5%
7 243
 
3.1%
9 196
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7813
99.6%
Dash Punctuation 33
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2208
28.3%
1 1184
15.2%
2 1113
14.2%
3 913
11.7%
5 621
 
7.9%
4 466
 
6.0%
6 440
 
5.6%
8 429
 
5.5%
7 243
 
3.1%
9 196
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7846
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2208
28.1%
1 1184
15.1%
2 1113
14.2%
3 913
11.6%
5 621
 
7.9%
4 466
 
5.9%
6 440
 
5.6%
8 429
 
5.5%
7 243
 
3.1%
9 196
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7846
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2208
28.1%
1 1184
15.1%
2 1113
14.2%
3 913
11.6%
5 621
 
7.9%
4 466
 
5.9%
6 440
 
5.6%
8 429
 
5.5%
7 243
 
3.1%
9 196
 
2.5%
Distinct3179
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
2024-05-11T06:38:49.823670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length6.2357291
Min length1

Characters and Unicode

Total characters26982
Distinct characters653
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2635 ?
Unique (%)60.9%

Sample

1st row담배
2nd row박응관
3rd row영남상회
4th row송화매점
5th row새마을수퍼
ValueCountFrequency (%)
씨유 148
 
2.9%
gs25 85
 
1.6%
담배 60
 
1.2%
세븐일레븐 56
 
1.1%
주)코리아세븐 47
 
0.9%
식품 43
 
0.8%
지에스25 43
 
0.8%
이마트24 27
 
0.5%
현대슈퍼 22
 
0.4%
지에스(gs)25 18
 
0.3%
Other values (3207) 4611
89.4%
2024-05-11T06:38:50.820568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1020
 
3.8%
841
 
3.1%
716
 
2.7%
689
 
2.6%
678
 
2.5%
594
 
2.2%
509
 
1.9%
503
 
1.9%
2 437
 
1.6%
424
 
1.6%
Other values (643) 20571
76.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23615
87.5%
Decimal Number 962
 
3.6%
Space Separator 841
 
3.1%
Uppercase Letter 834
 
3.1%
Close Punctuation 292
 
1.1%
Open Punctuation 291
 
1.1%
Lowercase Letter 106
 
0.4%
Other Punctuation 33
 
0.1%
Dash Punctuation 7
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1020
 
4.3%
716
 
3.0%
689
 
2.9%
678
 
2.9%
594
 
2.5%
509
 
2.2%
503
 
2.1%
424
 
1.8%
398
 
1.7%
368
 
1.6%
Other values (575) 17716
75.0%
Uppercase Letter
ValueCountFrequency (%)
S 281
33.7%
G 269
32.3%
C 54
 
6.5%
U 45
 
5.4%
K 30
 
3.6%
L 28
 
3.4%
I 14
 
1.7%
J 13
 
1.6%
E 13
 
1.6%
A 13
 
1.6%
Other values (14) 74
 
8.9%
Lowercase Letter
ValueCountFrequency (%)
e 20
18.9%
a 11
10.4%
r 10
 
9.4%
o 8
 
7.5%
t 8
 
7.5%
m 8
 
7.5%
k 5
 
4.7%
n 4
 
3.8%
u 4
 
3.8%
l 3
 
2.8%
Other values (12) 25
23.6%
Decimal Number
ValueCountFrequency (%)
2 437
45.4%
5 355
36.9%
4 75
 
7.8%
1 38
 
4.0%
3 26
 
2.7%
0 14
 
1.5%
6 7
 
0.7%
7 6
 
0.6%
8 2
 
0.2%
9 2
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 17
51.5%
, 8
24.2%
& 3
 
9.1%
: 2
 
6.1%
? 1
 
3.0%
/ 1
 
3.0%
! 1
 
3.0%
Space Separator
ValueCountFrequency (%)
841
100.0%
Close Punctuation
ValueCountFrequency (%)
) 292
100.0%
Open Punctuation
ValueCountFrequency (%)
( 291
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23615
87.5%
Common 2426
 
9.0%
Latin 940
 
3.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1020
 
4.3%
716
 
3.0%
689
 
2.9%
678
 
2.9%
594
 
2.5%
509
 
2.2%
503
 
2.1%
424
 
1.8%
398
 
1.7%
368
 
1.6%
Other values (575) 17716
75.0%
Latin
ValueCountFrequency (%)
S 281
29.9%
G 269
28.6%
C 54
 
5.7%
U 45
 
4.8%
K 30
 
3.2%
L 28
 
3.0%
e 20
 
2.1%
I 14
 
1.5%
J 13
 
1.4%
E 13
 
1.4%
Other values (36) 173
18.4%
Common
ValueCountFrequency (%)
841
34.7%
2 437
18.0%
5 355
14.6%
) 292
 
12.0%
( 291
 
12.0%
4 75
 
3.1%
1 38
 
1.6%
3 26
 
1.1%
. 17
 
0.7%
0 14
 
0.6%
Other values (11) 40
 
1.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23614
87.5%
ASCII 3366
 
12.5%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1020
 
4.3%
716
 
3.0%
689
 
2.9%
678
 
2.9%
594
 
2.5%
509
 
2.2%
503
 
2.1%
424
 
1.8%
398
 
1.7%
368
 
1.6%
Other values (574) 17715
75.0%
ASCII
ValueCountFrequency (%)
841
25.0%
2 437
13.0%
5 355
10.5%
) 292
 
8.7%
( 291
 
8.6%
S 281
 
8.3%
G 269
 
8.0%
4 75
 
2.2%
C 54
 
1.6%
U 45
 
1.3%
Other values (57) 426
12.7%
None
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct3040
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
Minimum2007-07-23 17:11:56
Maximum2024-05-08 17:09:21
2024-05-11T06:38:51.132301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:51.553121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
I
3480 
U
847 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowI
5th rowI

Common Values

ValueCountFrequency (%)
I 3480
80.4%
U 847
 
19.6%

Length

2024-05-11T06:38:51.949934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:52.261621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 3480
80.4%
u 847
 
19.6%
Distinct618
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
Minimum2018-08-31 23:59:59
Maximum2023-12-05 00:09:00
2024-05-11T06:38:52.617510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T06:38:53.006486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4327
Missing (%)100.0%
Memory size38.2 KiB

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct1952
Distinct (%)48.4%
Missing294
Missing (%)6.8%
Infinite0
Infinite (%)0.0%
Mean204699.91
Minimum201991.59
Maximum207343.18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.2 KiB
2024-05-11T06:38:53.508439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201991.59
5-th percentile202444.18
Q1203757.01
median204956.79
Q3205726.12
95-th percentile206318.5
Maximum207343.18
Range5351.5918
Interquartile range (IQR)1969.1148

Descriptive statistics

Standard deviation1200.1426
Coefficient of variation (CV)0.005862937
Kurtosis-0.79301565
Mean204699.91
Median Absolute Deviation (MAD)890.41254
Skewness-0.48402556
Sum8.2555473 × 108
Variance1440342.4
MonotonicityNot monotonic
2024-05-11T06:38:53.974123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
206590.046202018 16
 
0.4%
205271.704936121 15
 
0.3%
204469.209049094 13
 
0.3%
205168.920264507 12
 
0.3%
206190.228695097 11
 
0.3%
205996.909518819 10
 
0.2%
202024.479054844 10
 
0.2%
205018.506913318 10
 
0.2%
205810.922953473 10
 
0.2%
203743.598607972 10
 
0.2%
Other values (1942) 3916
90.5%
(Missing) 294
 
6.8%
ValueCountFrequency (%)
201991.588408047 1
 
< 0.1%
202013.401502902 3
 
0.1%
202024.479054844 10
0.2%
202026.286928788 2
 
< 0.1%
202032.935534754 2
 
< 0.1%
202033.014104401 1
 
< 0.1%
202034.733564967 4
 
0.1%
202036.453390905 2
 
< 0.1%
202042.817335648 1
 
< 0.1%
202042.827600608 3
 
0.1%
ValueCountFrequency (%)
207343.180165801 1
 
< 0.1%
206630.901669352 3
 
0.1%
206618.082278385 1
 
< 0.1%
206592.423303981 5
 
0.1%
206590.046202018 16
0.4%
206586.572428103 2
 
< 0.1%
206562.649224959 2
 
< 0.1%
206560.736934242 2
 
< 0.1%
206543.156690365 3
 
0.1%
206538.84863498 4
 
0.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct1951
Distinct (%)48.4%
Missing294
Missing (%)6.8%
Infinite0
Infinite (%)0.0%
Mean453049.96
Minimum444744.67
Maximum462100.91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.2 KiB
2024-05-11T06:38:54.399030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum444744.67
5-th percentile451388.67
Q1452224.96
median452880.11
Q3453825.19
95-th percentile455256.32
Maximum462100.91
Range17356.244
Interquartile range (IQR)1600.2259

Descriptive statistics

Standard deviation1149.964
Coefficient of variation (CV)0.002538272
Kurtosis1.0168281
Mean453049.96
Median Absolute Deviation (MAD)774.14987
Skewness0.46736477
Sum1.8271505 × 109
Variance1322417.3
MonotonicityNot monotonic
2024-05-11T06:38:54.911244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
452410.619464697 16
 
0.4%
452706.897879436 15
 
0.3%
454934.047367752 13
 
0.3%
454277.787393015 12
 
0.3%
453109.711081815 11
 
0.3%
453361.806084615 10
 
0.2%
452448.721381334 10
 
0.2%
451331.654449855 10
 
0.2%
451847.699904699 10
 
0.2%
452993.535471829 10
 
0.2%
Other values (1941) 3916
90.5%
(Missing) 294
 
6.8%
ValueCountFrequency (%)
444744.670820701 1
 
< 0.1%
450990.782258934 1
 
< 0.1%
451011.194201757 3
0.1%
451025.902075739 2
< 0.1%
451031.569748553 2
< 0.1%
451038.454241239 1
 
< 0.1%
451045.898346812 1
 
< 0.1%
451049.960154453 1
 
< 0.1%
451068.470216265 1
 
< 0.1%
451070.693941969 1
 
< 0.1%
ValueCountFrequency (%)
462100.914909035 1
 
< 0.1%
455917.552544887 7
0.2%
455899.982370316 6
0.1%
455876.231869346 3
0.1%
455813.713540339 3
0.1%
455803.028832488 1
 
< 0.1%
455790.631069022 4
0.1%
455774.815358764 1
 
< 0.1%
455760.496065909 2
 
< 0.1%
455753.744478701 1
 
< 0.1%

지정일자
Real number (ℝ)

MISSING 

Distinct1838
Distinct (%)70.3%
Missing1712
Missing (%)39.6%
Infinite0
Infinite (%)0.0%
Mean20080860
Minimum19720630
Maximum20220321
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size38.2 KiB
2024-05-11T06:38:55.360554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19720630
5-th percentile19940498
Q120040110
median20090702
Q320140821
95-th percentile20190616
Maximum20220321
Range499691
Interquartile range (IQR)100711

Descriptive statistics

Standard deviation79311.855
Coefficient of variation (CV)0.0039496243
Kurtosis1.0568619
Mean20080860
Median Absolute Deviation (MAD)50308
Skewness-0.87328377
Sum5.251145 × 1010
Variance6.2903704 × 109
MonotonicityNot monotonic
2024-05-11T06:38:55.832496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19990101 19
 
0.4%
19790630 11
 
0.3%
19981221 8
 
0.2%
20050128 8
 
0.2%
19981102 7
 
0.2%
19981030 7
 
0.2%
19981110 7
 
0.2%
19981219 7
 
0.2%
19981212 6
 
0.1%
20121018 6
 
0.1%
Other values (1828) 2529
58.4%
(Missing) 1712
39.6%
ValueCountFrequency (%)
19720630 2
 
< 0.1%
19740701 1
 
< 0.1%
19781222 1
 
< 0.1%
19790630 11
0.3%
19801024 1
 
< 0.1%
19810621 1
 
< 0.1%
19820514 1
 
< 0.1%
19821102 1
 
< 0.1%
19821116 1
 
< 0.1%
19821223 1
 
< 0.1%
ValueCountFrequency (%)
20220321 1
< 0.1%
20220225 2
< 0.1%
20220113 1
< 0.1%
20211222 1
< 0.1%
20211207 2
< 0.1%
20211202 1
< 0.1%
20211126 1
< 0.1%
20211112 1
< 0.1%
20211111 1
< 0.1%
20211109 1
< 0.1%

민원종류명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size33.9 KiB
<NA>
1712 
2009년11월법개정전자료
1324 
제7조의3제2항에따른경우
1225 
제7조의3제3항에따른경우
 
66

Length

Max length14
Median length13
Mean length9.745089
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row2009년11월법개정전자료
3rd row2009년11월법개정전자료
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1712
39.6%
2009년11월법개정전자료 1324
30.6%
제7조의3제2항에따른경우 1225
28.3%
제7조의3제3항에따른경우 66
 
1.5%

Length

2024-05-11T06:38:56.263961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T06:38:56.632475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1712
39.6%
2009년11월법개정전자료 1324
30.6%
제7조의3제2항에따른경우 1225
28.3%
제7조의3제3항에따른경우 66
 
1.5%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)지정일자민원종류명
03050000196830500820560500519680630<NA>3폐업2폐업처리20010820<NA><NA><NA>000209244233<NA><NA>서울특별시 동대문구 제기동 137번지 140호서울특별시 동대문구 제기로 7 (제기동)<NA>담배2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>202908.89497453813.199718<NA><NA>
13050000197230500820561300619720630200405274취소/말소/만료/정지/중지3직권취소<NA><NA><NA><NA>000222446132<NA><NA>서울특별시 동대문구 답십리동 282번지 8호<NA><NA>박응관2018-06-05 15:30:22I2018-08-31 23:59:59.0<NA><NA><NA>197206302009년11월법개정전자료
23050000197230500820562500119720630<NA>3폐업2폐업처리20160907<NA><NA><NA>000209674781<NA><NA>서울특별시 동대문구 이문동 324번지 4호서울특별시 동대문구 이문로 85 (이문동)02451영남상회2016-09-07 09:36:06I2018-08-31 23:59:59.0<NA>205127.89623454716.5395197206302009년11월법개정전자료
33050000197330500820561304519730518<NA>3폐업2폐업처리20060119<NA><NA><NA>000222134553<NA><NA>서울특별시 동대문구 답십리동 962호 (2F 219)<NA><NA>송화매점2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>205055.087973451485.94501<NA><NA>
43050000197330500820562402519730630<NA>3폐업2폐업처리20070117<NA><NA><NA>000209674115<NA><NA>서울특별시 동대문구 이문동 292번지 190호서울특별시 동대문구 이문로30길 9 (이문동)<NA>새마을수퍼2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>205341.113858454960.132765<NA><NA>
53050000197430500820562301019740701<NA>3폐업2폐업처리20120105<NA><NA><NA>000222444495<NA>130878서울특별시 동대문구 휘경2동 276번지 3호서울특별시 동대문구 망우로16길 11-15 (휘경동)<NA>이천마트2012-01-05 09:57:07I2018-08-31 23:59:59.0<NA>205440.411321454137.552904197407012009년11월법개정전자료
63050000197630500820562402719761012<NA>3폐업2폐업처리20031015<NA><NA><NA>000209672735<NA><NA>서울특별시 동대문구 이문동 163번지 44호서울특별시 동대문구 이문로42길 50-12 (이문동)<NA>쌀상회2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>205662.831799455333.468628<NA><NA>
73050000197630500820562503119760529<NA>3폐업2폐업처리20030821<NA><NA><NA>000209631616<NA><NA>서울특별시 동대문구 이문동 319번지 107호서울특별시 동대문구 이문로 64-4 (이문동)<NA>경기상회2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>205058.155851454487.05365<NA><NA>
83050000197830500820560203119780401<NA>3폐업2폐업처리20020412<NA><NA><NA>000209668731<NA><NA>서울특별시 동대문구 용신동 39번지 396호서울특별시 동대문구 천호대로45가길 36 (용두동)<NA>담배가게2008-02-21 00:00:00I2018-08-31 23:59:59.0<NA>203642.237051452647.83484<NA><NA>
93050000197830500820562501219781222<NA>3폐업2폐업처리20070817<NA><NA><NA>000209616669<NA><NA>서울특별시 동대문구 이문동 262번지 25 호서울특별시 동대문구 이문로33길 18 (이문동)<NA>잡화2007-08-17 10:14:29I2018-08-31 23:59:59.0<NA>205293.706474455225.133301197812222009년11월법개정전자료
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)지정일자민원종류명
4317305000020243050210056000142024-03-08<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 장안동 93-43서울특별시 동대문구 사가정로25길 34, 1층 101호 (장안동)02515지에스(GS)25 장안주공2024-03-08 16:53:01I2023-12-02 23:00:00.0<NA>206190.228695453109.711082<NA><NA>
4318305000020243050210056000152024-03-13<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 휘경동 286-334서울특별시 동대문구 망우로16길 26, 1층 (휘경동)02496씨유 휘경점2024-03-13 14:23:55I2023-12-02 23:05:00.0<NA>205326.014468454077.422005<NA><NA>
4319305000020243050210056000162024-03-21<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 이문동 264-445서울특별시 동대문구 천장산로 47, 1층 나12,13호 (이문동)02448씨유(CU) 이문삼성2024-03-22 08:59:45I2023-12-02 22:04:00.0<NA>204991.594355455332.294542<NA><NA>
4320305000020243050210056000172024-03-28<NA>1영업/정상0정상영업<NA><NA><NA><NA>02-6456-7111<NA><NA>서울특별시 동대문구 제기동 1140-5 불로장생타워서울특별시 동대문구 왕산로 117, 불로장생타워상가 1층 (제기동)02569유한책임회사 골드마트쇼핑2024-03-28 17:45:51I2023-12-02 21:00:00.0<NA>203155.504922452914.607543<NA><NA>
4321305000020243050210056000182024-04-05<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 용두동 39-419서울특별시 동대문구 천호대로31길 50 (용두동)02562지에스(GS)25 동대문용두2024-04-05 13:26:25I2023-12-04 00:07:00.0<NA>203589.797341452591.266044<NA><NA>
4322305000020243050210056000192024-04-09<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 장안동 424-7서울특별시 동대문구 장한로5길 16, 1층 (장안동)02629씨유(CU) 장안지우점2024-04-09 13:16:36I2023-12-03 23:01:00.0<NA>205722.737527451359.751072<NA><NA>
4323305000020243050210056000202024-04-12<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 신설동 117-12 신설동 지웰홈스서울특별시 동대문구 난계로 242, 104,105호 (신설동, 신설동 지웰홈스)02586지에스(GS)25 동대문지웰2024-04-12 10:18:38I2023-12-03 23:04:00.0<NA>202036.453391452307.687696<NA><NA>
4324305000020243050210056000212024-04-24<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 제기동 148-41서울특별시 동대문구 약령시로 7 (제기동)02477씨유 안암로타리점2024-04-24 13:48:50I2023-12-03 22:07:00.0<NA>202633.567387453392.902949<NA><NA>
4325305000020243050210056000222024-04-26<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 청량리동 224-2서울특별시 동대문구 약령시로21길 5, 1층 (청량리동)02484씨유 청량리경찰서점2024-04-26 15:48:17I2023-12-03 22:08:00.0<NA>203900.727368453480.356786<NA><NA>
4326305000020243050210056000232024-04-26<NA>1영업/정상0정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 동대문구 답십리동 765 대원빌딩서울특별시 동대문구 한천로 107-3, 대원빌딩 1층 (답십리동)02610씨유 답십리대림점2024-04-26 15:51:15I2023-12-03 22:08:00.0<NA>205661.548239452069.966458<NA><NA>