Overview

Dataset statistics

Number of variables27
Number of observations61
Missing cells521
Missing cells (%)31.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.7 KiB
Average record size in memory230.2 B

Variable types

Categorical9
Numeric4
DateTime6
Unsupported4
Text4

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),자본금,거래처
Author용산구
URLhttps://data.seoul.go.kr/dataList/OA-19054/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
휴업시작일자 has constant value ""Constant
휴업종료일자 has constant value ""Constant
재개업일자 has constant value ""Constant
도로명우편번호 is highly imbalanced (79.3%)Imbalance
인허가취소일자 has 61 (100.0%) missing valuesMissing
폐업일자 has 45 (73.8%) missing valuesMissing
휴업시작일자 has 60 (98.4%) missing valuesMissing
휴업종료일자 has 60 (98.4%) missing valuesMissing
재개업일자 has 60 (98.4%) missing valuesMissing
전화번호 has 2 (3.3%) missing valuesMissing
소재지면적 has 37 (60.7%) missing valuesMissing
소재지우편번호 has 61 (100.0%) missing valuesMissing
도로명주소 has 5 (8.2%) missing valuesMissing
좌표정보(X) has 4 (6.6%) missing valuesMissing
좌표정보(Y) has 4 (6.6%) missing valuesMissing
자본금 has 61 (100.0%) missing valuesMissing
거래처 has 61 (100.0%) missing valuesMissing
관리번호 has unique valuesUnique
최종수정일자 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
자본금 is an unsupported type, check if it needs cleaning or further analysisUnsupported
거래처 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 02:02:33.619884
Analysis finished2024-05-11 02:02:34.621805
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size620.0 B
3020000
61 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3020000
2nd row3020000
3rd row3020000
4th row3020000
5th row3020000

Common Values

ValueCountFrequency (%)
3020000 61
100.0%

Length

2024-05-11T02:02:35.037698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:35.741337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3020000 61
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9942364 × 1018
Minimum1.976302 × 1018
Maximum2.018302 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size681.0 B
2024-05-11T02:02:36.223404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.976302 × 1018
5-th percentile1.976302 × 1018
Q11.986302 × 1018
median1.997302 × 1018
Q32.002302 × 1018
95-th percentile2.005302 × 1018
Maximum2.018302 × 1018
Range4.2000008 × 1016
Interquartile range (IQR)1.6 × 1016

Descriptive statistics

Standard deviation1.0560728 × 1016
Coefficient of variation (CV)0.0052956247
Kurtosis-0.85507825
Mean1.9942364 × 1018
Median Absolute Deviation (MAD)6 × 1015
Skewness-0.44085475
Sum-7.4787861 × 1018
Variance1.1152897 × 1032
MonotonicityStrictly increasing
2024-05-11T02:02:37.021850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1976302007401500004 1
 
1.6%
2002302007401200003 1
 
1.6%
2001302007401200011 1
 
1.6%
2001302007401200013 1
 
1.6%
2001302007401200017 1
 
1.6%
2001302007401200018 1
 
1.6%
2001302007401200021 1
 
1.6%
2001302007401200022 1
 
1.6%
2001302007401200023 1
 
1.6%
2001302007401500003 1
 
1.6%
Other values (51) 51
83.6%
ValueCountFrequency (%)
1976302007401500004 1
1.6%
1976302007401500005 1
1.6%
1976302007401500006 1
1.6%
1976302007401500007 1
1.6%
1976302007401500010 1
1.6%
1976302007401500011 1
1.6%
1979302007401200001 1
1.6%
1979302007401200002 1
1.6%
1979302007401200003 1
1.6%
1979302007401200004 1
1.6%
ValueCountFrequency (%)
2018302015601200001 1
1.6%
2007302009501500002 1
1.6%
2007302009501500001 1
1.6%
2005302007401509188 1
1.6%
2004302007401500001 1
1.6%
2003302007401500018 1
1.6%
2003302007401500017 1
1.6%
2003302007401500015 1
1.6%
2003302007401500007 1
1.6%
2003302007401500002 1
1.6%
Distinct41
Distinct (%)67.2%
Missing0
Missing (%)0.0%
Memory size620.0 B
Minimum1976-04-30 00:00:00
Maximum2018-11-22 00:00:00
2024-05-11T02:02:37.424375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:37.885103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=41)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing61
Missing (%)100.0%
Memory size681.0 B
Distinct3
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size620.0 B
3
40 
1
20 
4
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row3
2nd row1
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 40
65.6%
1 20
32.8%
4 1
 
1.6%

Length

2024-05-11T02:02:38.327090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:38.680687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 40
65.6%
1 20
32.8%
4 1
 
1.6%

영업상태명
Categorical

Distinct3
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size620.0 B
폐업
40 
영업/정상
20 
취소/말소/만료/정지/중지
 
1

Length

Max length14
Median length2
Mean length3.1803279
Min length2

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row폐업
2nd row영업/정상
3rd row폐업
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
폐업 40
65.6%
영업/정상 20
32.8%
취소/말소/만료/정지/중지 1
 
1.6%

Length

2024-05-11T02:02:39.041351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:39.351952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 40
65.6%
영업/정상 20
32.8%
취소/말소/만료/정지/중지 1
 
1.6%
Distinct4
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size620.0 B
3
40 
1
19 
6
 
1
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)3.3%

Sample

1st row3
2nd row1
3rd row3
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 40
65.6%
1 19
31.1%
6 1
 
1.6%
2 1
 
1.6%

Length

2024-05-11T02:02:39.676699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:40.132014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 40
65.6%
1 19
31.1%
6 1
 
1.6%
2 1
 
1.6%
Distinct4
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size620.0 B
폐지
40 
신규등록
19 
휴지사업재개
 
1
등록취소
 
1

Length

Max length6
Median length2
Mean length2.7213115
Min length2

Unique

Unique2 ?
Unique (%)3.3%

Sample

1st row폐지
2nd row신규등록
3rd row폐지
4th row신규등록
5th row신규등록

Common Values

ValueCountFrequency (%)
폐지 40
65.6%
신규등록 19
31.1%
휴지사업재개 1
 
1.6%
등록취소 1
 
1.6%

Length

2024-05-11T02:02:40.631100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:41.169439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐지 40
65.6%
신규등록 19
31.1%
휴지사업재개 1
 
1.6%
등록취소 1
 
1.6%

폐업일자
Date

MISSING 

Distinct16
Distinct (%)100.0%
Missing45
Missing (%)73.8%
Memory size620.0 B
Minimum2007-06-11 00:00:00
Maximum2024-05-01 00:00:00
2024-05-11T02:02:41.598718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:41.984204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

휴업시작일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing60
Missing (%)98.4%
Memory size620.0 B
Minimum2012-05-21 00:00:00
Maximum2012-05-21 00:00:00
2024-05-11T02:02:42.260966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:42.649653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

휴업종료일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing60
Missing (%)98.4%
Memory size620.0 B
Minimum2012-08-10 00:00:00
Maximum2012-08-10 00:00:00
2024-05-11T02:02:43.159660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:43.649083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

재개업일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing60
Missing (%)98.4%
Memory size620.0 B
Minimum2024-04-23 00:00:00
Maximum2024-04-23 00:00:00
2024-05-11T02:02:43.916296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:44.236786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

전화번호
Text

MISSING 

Distinct53
Distinct (%)89.8%
Missing2
Missing (%)3.3%
Memory size620.0 B
2024-05-11T02:02:44.768010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length10.322034
Min length7

Characters and Unicode

Total characters609
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)79.7%

Sample

1st row02 7135346
2nd row02 7192202
3rd row02 7950204
4th row02 7901020
5th row02 7958484
ValueCountFrequency (%)
02 42
32.6%
793 5
 
3.9%
02797 3
 
2.3%
02712 3
 
2.3%
7950757 2
 
1.6%
0072 2
 
1.6%
716 2
 
1.6%
712 2
 
1.6%
2900 2
 
1.6%
02713 2
 
1.6%
Other values (57) 64
49.6%
2024-05-11T02:02:45.789884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 101
16.6%
2 90
14.8%
7 90
14.8%
79
13.0%
1 48
7.9%
5 45
7.4%
9 42
6.9%
3 38
 
6.2%
4 34
 
5.6%
8 22
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 530
87.0%
Space Separator 79
 
13.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 101
19.1%
2 90
17.0%
7 90
17.0%
1 48
9.1%
5 45
8.5%
9 42
7.9%
3 38
 
7.2%
4 34
 
6.4%
8 22
 
4.2%
6 20
 
3.8%
Space Separator
ValueCountFrequency (%)
79
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 609
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 101
16.6%
2 90
14.8%
7 90
14.8%
79
13.0%
1 48
7.9%
5 45
7.4%
9 42
6.9%
3 38
 
6.2%
4 34
 
5.6%
8 22
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 609
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 101
16.6%
2 90
14.8%
7 90
14.8%
79
13.0%
1 48
7.9%
5 45
7.4%
9 42
6.9%
3 38
 
6.2%
4 34
 
5.6%
8 22
 
3.6%

소재지면적
Real number (ℝ)

MISSING 

Distinct23
Distinct (%)95.8%
Missing37
Missing (%)60.7%
Infinite0
Infinite (%)0.0%
Mean824.47417
Minimum30
Maximum1910
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size681.0 B
2024-05-11T02:02:46.247791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile188.07
Q1574.95
median749.19
Q31019.25
95-th percentile1773.15
Maximum1910
Range1880
Interquartile range (IQR)444.3

Descriptive statistics

Standard deviation462.3877
Coefficient of variation (CV)0.5608274
Kurtosis0.69343198
Mean824.47417
Median Absolute Deviation (MAD)259.5
Skewness0.77239022
Sum19787.38
Variance213802.39
MonotonicityNot monotonic
2024-05-11T02:02:46.661160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
595.0 2
 
3.3%
30.0 1
 
1.6%
749.4 1
 
1.6%
610.0 1
 
1.6%
1001.0 1
 
1.6%
482.0 1
 
1.6%
1020.0 1
 
1.6%
157.2 1
 
1.6%
807.9 1
 
1.6%
669.2 1
 
1.6%
Other values (13) 13
 
21.3%
(Missing) 37
60.7%
ValueCountFrequency (%)
30.0 1
1.6%
157.2 1
1.6%
363.0 1
1.6%
386.0 1
1.6%
482.0 1
1.6%
514.8 1
1.6%
595.0 2
3.3%
610.0 1
1.6%
664.0 1
1.6%
669.2 1
1.6%
ValueCountFrequency (%)
1910.0 1
1.6%
1821.0 1
1.6%
1502.0 1
1.6%
1263.9 1
1.6%
1086.0 1
1.6%
1020.0 1
1.6%
1019.0 1
1.6%
1001.0 1
1.6%
910.0 1
1.6%
882.0 1
1.6%

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing61
Missing (%)100.0%
Memory size681.0 B
Distinct58
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size620.0 B
2024-05-11T02:02:47.235394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length37
Mean length22.245902
Min length17

Characters and Unicode

Total characters1357
Distinct characters75
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)90.2%

Sample

1st row서울특별시 용산구 원효로1가 40-13
2nd row서울특별시 용산구 원효로2가 70-2
3rd row서울특별시 용산구 한강로3가 65-155
4th row서울특별시 용산구 이촌동 302-79
5th row서울특별시 용산구 한남동 707-14
ValueCountFrequency (%)
서울특별시 60
23.3%
용산구 59
22.9%
한남동 7
 
2.7%
한강로3가 6
 
2.3%
이촌동 5
 
1.9%
보광동 5
 
1.9%
원효로3가 4
 
1.6%
이태원동 4
 
1.6%
용산동2가 3
 
1.2%
후암동 3
 
1.2%
Other values (87) 102
39.5%
2024-05-11T02:02:48.348144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
240
17.7%
1 78
 
5.7%
64
 
4.7%
64
 
4.7%
63
 
4.6%
61
 
4.5%
60
 
4.4%
60
 
4.4%
60
 
4.4%
60
 
4.4%
Other values (65) 547
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 752
55.4%
Decimal Number 310
22.8%
Space Separator 240
 
17.7%
Dash Punctuation 53
 
3.9%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
8.5%
64
 
8.5%
63
 
8.4%
61
 
8.1%
60
 
8.0%
60
 
8.0%
60
 
8.0%
60
 
8.0%
45
 
6.0%
26
 
3.5%
Other values (51) 189
25.1%
Decimal Number
ValueCountFrequency (%)
1 78
25.2%
2 42
13.5%
3 41
13.2%
4 29
 
9.4%
0 28
 
9.0%
7 24
 
7.7%
6 21
 
6.8%
5 19
 
6.1%
9 17
 
5.5%
8 11
 
3.5%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%
Space Separator
ValueCountFrequency (%)
240
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 752
55.4%
Common 603
44.4%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
8.5%
64
 
8.5%
63
 
8.4%
61
 
8.1%
60
 
8.0%
60
 
8.0%
60
 
8.0%
60
 
8.0%
45
 
6.0%
26
 
3.5%
Other values (51) 189
25.1%
Common
ValueCountFrequency (%)
240
39.8%
1 78
 
12.9%
- 53
 
8.8%
2 42
 
7.0%
3 41
 
6.8%
4 29
 
4.8%
0 28
 
4.6%
7 24
 
4.0%
6 21
 
3.5%
5 19
 
3.2%
Other values (2) 28
 
4.6%
Latin
ValueCountFrequency (%)
k 1
50.0%
s 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 752
55.4%
ASCII 605
44.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
240
39.7%
1 78
 
12.9%
- 53
 
8.8%
2 42
 
6.9%
3 41
 
6.8%
4 29
 
4.8%
0 28
 
4.6%
7 24
 
4.0%
6 21
 
3.5%
5 19
 
3.1%
Other values (4) 30
 
5.0%
Hangul
ValueCountFrequency (%)
64
 
8.5%
64
 
8.5%
63
 
8.4%
61
 
8.1%
60
 
8.0%
60
 
8.0%
60
 
8.0%
60
 
8.0%
45
 
6.0%
26
 
3.5%
Other values (51) 189
25.1%

도로명주소
Text

MISSING 

Distinct52
Distinct (%)92.9%
Missing5
Missing (%)8.2%
Memory size620.0 B
2024-05-11T02:02:48.954602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length32
Mean length26.410714
Min length21

Characters and Unicode

Total characters1479
Distinct characters91
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)85.7%

Sample

1st row서울특별시 용산구 원효로 178 (원효로2가)
2nd row서울특별시 용산구 한강대로 48 (한강로3가)
3rd row서울특별시 용산구 이촌로 166 (이촌동)
4th row서울특별시 용산구 한남대로 82 (한남동)
5th row서울특별시 용산구 녹사평대로11길 24 (서빙고동)
ValueCountFrequency (%)
서울특별시 55
 
19.1%
용산구 54
 
18.8%
한남동 7
 
2.4%
원효로 6
 
2.1%
보광동 5
 
1.7%
5 5
 
1.7%
한강대로 4
 
1.4%
원효로3가 4
 
1.4%
이촌동 4
 
1.4%
한남대로 4
 
1.4%
Other values (105) 140
48.6%
2024-05-11T02:02:49.937682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
245
 
16.6%
72
 
4.9%
60
 
4.1%
58
 
3.9%
58
 
3.9%
) 57
 
3.9%
( 57
 
3.9%
56
 
3.8%
55
 
3.7%
55
 
3.7%
Other values (81) 706
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 898
60.7%
Space Separator 245
 
16.6%
Decimal Number 212
 
14.3%
Close Punctuation 57
 
3.9%
Open Punctuation 57
 
3.9%
Other Punctuation 6
 
0.4%
Dash Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
8.0%
60
 
6.7%
58
 
6.5%
58
 
6.5%
56
 
6.2%
55
 
6.1%
55
 
6.1%
55
 
6.1%
55
 
6.1%
43
 
4.8%
Other values (66) 331
36.9%
Decimal Number
ValueCountFrequency (%)
1 50
23.6%
2 30
14.2%
3 29
13.7%
4 20
 
9.4%
0 18
 
8.5%
8 18
 
8.5%
7 16
 
7.5%
6 15
 
7.1%
5 11
 
5.2%
9 5
 
2.4%
Space Separator
ValueCountFrequency (%)
245
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 898
60.7%
Common 581
39.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
8.0%
60
 
6.7%
58
 
6.5%
58
 
6.5%
56
 
6.2%
55
 
6.1%
55
 
6.1%
55
 
6.1%
55
 
6.1%
43
 
4.8%
Other values (66) 331
36.9%
Common
ValueCountFrequency (%)
245
42.2%
) 57
 
9.8%
( 57
 
9.8%
1 50
 
8.6%
2 30
 
5.2%
3 29
 
5.0%
4 20
 
3.4%
0 18
 
3.1%
8 18
 
3.1%
7 16
 
2.8%
Other values (5) 41
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 898
60.7%
ASCII 581
39.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
245
42.2%
) 57
 
9.8%
( 57
 
9.8%
1 50
 
8.6%
2 30
 
5.2%
3 29
 
5.0%
4 20
 
3.4%
0 18
 
3.1%
8 18
 
3.1%
7 16
 
2.8%
Other values (5) 41
 
7.1%
Hangul
ValueCountFrequency (%)
72
 
8.0%
60
 
6.7%
58
 
6.5%
58
 
6.5%
56
 
6.2%
55
 
6.1%
55
 
6.1%
55
 
6.1%
55
 
6.1%
43
 
4.8%
Other values (66) 331
36.9%

도로명우편번호
Categorical

IMBALANCE 

Distinct5
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size620.0 B
<NA>
57 
4363
 
1
4334
 
1
4417
 
1
4382
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique4 ?
Unique (%)6.6%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 57
93.4%
4363 1
 
1.6%
4334 1
 
1.6%
4417 1
 
1.6%
4382 1
 
1.6%

Length

2024-05-11T02:02:50.425301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:50.927320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 57
93.4%
4363 1
 
1.6%
4334 1
 
1.6%
4417 1
 
1.6%
4382 1
 
1.6%
Distinct58
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size620.0 B
2024-05-11T02:02:51.681098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length8.0819672
Min length3

Characters and Unicode

Total characters493
Distinct characters96
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)90.2%

Sample

1st row원일주유소
2nd row(주)영원에너지 풍기주유소 (대표자:박종영)
3rd row용산뉴타운주유소
4th row한국석유공업(주) 한석주유소
5th row(주)중앙에너비스 에너비스
ValueCountFrequency (%)
에이치디현대오일뱅크(주)직영 4
 
5.2%
주식회사 3
 
3.9%
서울상사 2
 
2.6%
중앙석유 2
 
2.6%
주)중앙에너비스 2
 
2.6%
쌍용석유 2
 
2.6%
동성석유 2
 
2.6%
동양연료 1
 
1.3%
이태원석유 1
 
1.3%
세정석유(주 1
 
1.3%
Other values (57) 57
74.0%
2024-05-11T02:02:52.841495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
9.5%
46
 
9.3%
24
 
4.9%
23
 
4.7%
) 22
 
4.5%
( 22
 
4.5%
16
 
3.2%
15
 
3.0%
11
 
2.2%
10
 
2.0%
Other values (86) 257
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 427
86.6%
Close Punctuation 22
 
4.5%
Open Punctuation 22
 
4.5%
Space Separator 16
 
3.2%
Uppercase Letter 4
 
0.8%
Decimal Number 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
11.0%
46
 
10.8%
24
 
5.6%
23
 
5.4%
15
 
3.5%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
10
 
2.3%
Other values (79) 221
51.8%
Uppercase Letter
ValueCountFrequency (%)
S 2
50.0%
K 2
50.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 427
86.6%
Common 62
 
12.6%
Latin 4
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
11.0%
46
 
10.8%
24
 
5.6%
23
 
5.4%
15
 
3.5%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
10
 
2.3%
Other values (79) 221
51.8%
Common
ValueCountFrequency (%)
) 22
35.5%
( 22
35.5%
16
25.8%
3 1
 
1.6%
: 1
 
1.6%
Latin
ValueCountFrequency (%)
S 2
50.0%
K 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 427
86.6%
ASCII 66
 
13.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
11.0%
46
 
10.8%
24
 
5.6%
23
 
5.4%
15
 
3.5%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
10
 
2.3%
Other values (79) 221
51.8%
ASCII
ValueCountFrequency (%)
) 22
33.3%
( 22
33.3%
16
24.2%
S 2
 
3.0%
K 2
 
3.0%
3 1
 
1.5%
: 1
 
1.5%

최종수정일자
Date

UNIQUE 

Distinct61
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size620.0 B
Minimum2001-03-24 00:00:00
Maximum2024-04-29 17:43:52
2024-05-11T02:02:53.355359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-11T02:02:54.172319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size620.0 B
I
40 
U
21 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowU
3rd rowU
4th rowU
5th rowI

Common Values

ValueCountFrequency (%)
I 40
65.6%
U 21
34.4%

Length

2024-05-11T02:02:54.688780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:55.084689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 40
65.6%
u 21
34.4%
Distinct17
Distinct (%)27.9%
Missing0
Missing (%)0.0%
Memory size620.0 B
2018-08-31 23:59:59.0
40 
2022-10-30 22:07:00.0
2022-12-04 00:08:00.0
 
2
2019-10-09 02:40:00.0
 
1
2021-04-10 02:40:00.0
 
1
Other values (12)
12 

Length

Max length21
Median length21
Mean length21
Min length21

Unique

Unique14 ?
Unique (%)23.0%

Sample

1st row2018-08-31 23:59:59.0
2nd row2022-10-30 22:07:00.0
3rd row2021-04-10 02:40:00.0
4th row2021-12-07 22:07:00.0
5th row2018-08-31 23:59:59.0

Common Values

ValueCountFrequency (%)
2018-08-31 23:59:59.0 40
65.6%
2022-10-30 22:07:00.0 5
 
8.2%
2022-12-04 00:08:00.0 2
 
3.3%
2019-10-09 02:40:00.0 1
 
1.6%
2021-04-10 02:40:00.0 1
 
1.6%
2021-12-07 22:07:00.0 1
 
1.6%
2020-04-01 02:40:00.0 1
 
1.6%
2018-11-16 02:37:39.0 1
 
1.6%
2023-12-03 22:04:00.0 1
 
1.6%
2020-01-15 02:40:00.0 1
 
1.6%
Other values (7) 7
 
11.5%

Length

2024-05-11T02:02:55.511225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2018-08-31 40
32.8%
23:59:59.0 40
32.8%
22:07:00.0 6
 
4.9%
2022-10-30 5
 
4.1%
02:40:00.0 5
 
4.1%
2022-12-04 3
 
2.5%
00:08:00.0 2
 
1.6%
2021-10-30 1
 
0.8%
23:03:00.0 1
 
0.8%
2023-12-05 1
 
0.8%
Other values (18) 18
14.8%

업태구분명
Categorical

Distinct3
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size620.0 B
일반판매소
34 
주유소
24 
용제판매소
 
3

Length

Max length5
Median length5
Mean length4.2131148
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주유소
2nd row주유소
3rd row주유소
4th row주유소
5th row주유소

Common Values

ValueCountFrequency (%)
일반판매소 34
55.7%
주유소 24
39.3%
용제판매소 3
 
4.9%

Length

2024-05-11T02:02:55.995374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T02:02:56.351624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반판매소 34
55.7%
주유소 24
39.3%
용제판매소 3
 
4.9%

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct50
Distinct (%)87.7%
Missing4
Missing (%)6.6%
Infinite0
Infinite (%)0.0%
Mean197927.54
Minimum192559.99
Maximum206901.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size681.0 B
2024-05-11T02:02:56.711897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum192559.99
5-th percentile195862.96
Q1196610.66
median197480.05
Q3199354.47
95-th percentile200496.99
Maximum206901.07
Range14341.073
Interquartile range (IQR)2743.8052

Descriptive statistics

Standard deviation2090.0409
Coefficient of variation (CV)0.010559627
Kurtosis5.062721
Mean197927.54
Median Absolute Deviation (MAD)1279.9455
Skewness1.2436432
Sum11281870
Variance4368271.1
MonotonicityNot monotonic
2024-05-11T02:02:57.210424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199943.016460781 2
 
3.3%
195956.83241222 2
 
3.3%
200471.903312912 2
 
3.3%
197570.006193384 2
 
3.3%
199354.469018798 2
 
3.3%
196200.099737911 2
 
3.3%
195996.987059253 2
 
3.3%
199902.25812814 1
 
1.6%
198844.701739064 1
 
1.6%
198511.640616644 1
 
1.6%
Other values (40) 40
65.6%
(Missing) 4
 
6.6%
ValueCountFrequency (%)
192559.994725361 1
1.6%
195122.135399337 1
1.6%
195770.709246897 1
1.6%
195886.024469289 1
1.6%
195956.83241222 2
3.3%
195996.987059253 2
3.3%
196045.41620518 1
1.6%
196200.099737911 2
3.3%
196326.996964933 1
1.6%
196440.382042886 1
1.6%
ValueCountFrequency (%)
206901.067778 1
1.6%
200646.877707817 1
1.6%
200597.33115081 1
1.6%
200471.903312912 2
3.3%
200469.015383985 1
1.6%
200184.397422031 1
1.6%
199993.699495652 1
1.6%
199943.016460781 2
3.3%
199902.25812814 1
1.6%
199717.48491779 1
1.6%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct50
Distinct (%)87.7%
Missing4
Missing (%)6.6%
Infinite0
Infinite (%)0.0%
Mean446866.81
Minimum367351.46
Maximum452852.02
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size681.0 B
2024-05-11T02:02:57.745917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum367351.46
5-th percentile446896.98
Q1447651.53
median447987.02
Q3448947.07
95-th percentile450014.97
Maximum452852.02
Range85500.556
Interquartile range (IQR)1295.5359

Descriptive statistics

Standard deviation10775.61
Coefficient of variation (CV)0.024113694
Kurtosis55.773422
Mean446866.81
Median Absolute Deviation (MAD)496.41544
Skewness-7.4280746
Sum25471408
Variance1.1611376 × 108
MonotonicityNot monotonic
2024-05-11T02:02:58.288996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
447386.725571553 2
 
3.3%
447651.530984522 2
 
3.3%
448199.065849486 2
 
3.3%
447953.578982142 2
 
3.3%
447943.138508679 2
 
3.3%
447987.019176963 2
 
3.3%
447116.532297206 2
 
3.3%
447753.685996193 1
 
1.6%
448046.692564344 1
 
1.6%
449438.219426092 1
 
1.6%
Other values (40) 40
65.6%
(Missing) 4
 
6.6%
ValueCountFrequency (%)
367351.46398 1
1.6%
446573.556869158 1
1.6%
446670.60098533 1
1.6%
446953.577094148 1
1.6%
446986.163348884 1
1.6%
447034.002502953 1
1.6%
447095.537114027 1
1.6%
447116.532297206 2
3.3%
447386.725571553 2
3.3%
447573.941467913 1
1.6%
ValueCountFrequency (%)
452852.020155302 1
1.6%
450238.200889239 1
1.6%
450045.800889568 1
1.6%
450007.264946026 1
1.6%
449769.124271782 1
1.6%
449682.624022977 1
1.6%
449606.70883809 1
1.6%
449468.730248023 1
1.6%
449438.219426092 1
1.6%
449419.918621066 1
1.6%

자본금
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing61
Missing (%)100.0%
Memory size681.0 B

거래처
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing61
Missing (%)100.0%
Memory size681.0 B

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자본금거래처
03020000197630200740150000419760528<NA>3폐업3폐지20070611<NA><NA><NA>02 7135346514.8<NA>서울특별시 용산구 원효로1가 40-13<NA><NA>원일주유소2009-11-18 14:11:01I2018-08-31 23:59:59.0주유소<NA><NA><NA><NA>
1302000019763020074015000051976-05-26<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 7192202595.0<NA>서울특별시 용산구 원효로2가 70-2서울특별시 용산구 원효로 178 (원효로2가)<NA>(주)영원에너지 풍기주유소 (대표자:박종영)2023-10-25 16:13:55U2022-10-30 22:07:00.0주유소196633.605455448152.685899<NA><NA>
23020000197630200740150000619760517<NA>3폐업3폐지<NA><NA><NA><NA>02 7950204664.0<NA>서울특별시 용산구 한강로3가 65-155서울특별시 용산구 한강대로 48 (한강로3가)<NA>용산뉴타운주유소2021-04-08 10:05:58U2021-04-10 02:40:00.0주유소196779.979322446986.163349<NA><NA>
33020000197630200740150000719760517<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 79010201019.0<NA>서울특별시 용산구 이촌동 302-79서울특별시 용산구 이촌로 166 (이촌동)<NA>한국석유공업(주) 한석주유소2022-08-24 14:00:54U2021-12-07 22:07:00.0주유소196885.500803446573.556869<NA><NA>
43020000197630200740150001019760526<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 7958484882.0<NA>서울특별시 용산구 한남동 707-14서울특별시 용산구 한남대로 82 (한남동)<NA>(주)중앙에너비스 에너비스2017-11-23 11:24:58I2018-08-31 23:59:59.0주유소200471.903313448199.065849<NA><NA>
5302000019763020074015000111976-05-24<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 79329541821.0<NA>서울특별시 용산구 서빙고동 37서울특별시 용산구 녹사평대로11길 24 (서빙고동)<NA>(주)남경주유소2023-10-25 16:13:11U2022-10-30 22:07:00.0주유소199330.560094446670.600985<NA><NA>
63020000197930200740120000119790525<NA>3폐업3폐지<NA><NA><NA><NA>02797 1243<NA><NA>서울특별시 용산구 남영동 80-1서울특별시 용산구 한강대로80길 11-3 (남영동)<NA>남영사2006-06-12 00:00:00I2018-08-31 23:59:59.0일반판매소197584.699532448947.066903<NA><NA>
73020000197930200740120000219790525<NA>3폐업3폐지20080826<NA><NA><NA>02713 6611<NA><NA>서울특별시 용산구 원효로1가 115-3<NA><NA>원일사2008-08-26 15:09:55I2018-08-31 23:59:59.0일반판매소<NA><NA><NA><NA>
83020000197930200740120000319790525<NA>3폐업3폐지<NA><NA><NA><NA>02713 3369<NA><NA>서울특별시 용산구 용문동 5-52서울특별시 용산구 백범로 308 (용문동)<NA>용남석유2016-01-11 14:08:50I2018-08-31 23:59:59.0일반판매소196668.467439448483.434617<NA><NA>
93020000197930200740120000419790525<NA>3폐업3폐지20200330<NA><NA><NA>02 719 6300<NA><NA>서울특별시 용산구 원효로3가 227-23서울특별시 용산구 원효로 113 (원효로3가)<NA>우진에너지2020-03-30 15:08:21U2020-04-01 02:40:00.0일반판매소196045.416205447853.764604<NA><NA>
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)자본금거래처
51302000020033020074015000021976-04-30<NA>1영업/정상1신규등록<NA><NA><NA><NA>0275452171020.0<NA>서울특별시 용산구 갈월동 11-34 sk갈월동주유소서울특별시 용산구 한강대로 322, 갈월동주유소 (갈월동)4334에이치디현대오일뱅크(주)직영 갈월동주유소2023-04-05 09:52:00U2022-12-04 00:07:00.0주유소197476.794419449419.918621<NA><NA>
52302000020033020074015000071976-05-24<NA>1영업/정상1신규등록<NA>2012-05-212012-08-10<NA>02 7955688482.0<NA>서울특별시 용산구 한남동 704-14서울특별시 용산구 한남대로21길 4 (한남동)<NA>(주)중앙에너비스 한남지점2023-10-25 16:17:18U2022-10-30 22:07:00.0주유소200469.015384448053.840358<NA><NA>
533020000200330200740150001520030925<NA>3폐업3폐지<NA><NA><NA><NA>027947665595.0<NA>서울특별시 용산구 한남동 739-11서울특별시 용산구 이태원로 255 (한남동)<NA>SK네트웍스(주)남산주유소2015-12-22 17:43:31I2018-08-31 23:59:59.0주유소199993.699496448367.664079<NA><NA>
54302000020033020074015000172003-09-25<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 75452171001.0<NA>서울특별시 용산구 한남동 726-370서울특별시 용산구 한남대로 204 (한남동)4417에이치디현대오일뱅크(주)직영 한남동주유소2023-04-12 09:04:59U2022-12-03 23:04:00.0주유소200184.397422449300.985016<NA><NA>
553020000200330200740150001819760630<NA>3폐업3폐지<NA><NA><NA><NA>02 7019130610.0<NA>서울특별시 용산구 청파동1가 180서울특별시 용산구 청파로 311 (청파동1가)<NA>SK네트웍스(주) 청파주유소2018-03-26 17:14:06I2018-08-31 23:59:59.0주유소197271.499019449468.730248<NA><NA>
563020000200430200740150000120040324<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 7905736<NA><NA>서울특별시 용산구 한강로2가 244-1서울특별시 용산구 한강대로 127-2 (한강로2가)<NA>(주)하나유화2004-03-24 00:00:00I2018-08-31 23:59:59.0용제판매소197231.546341447661.066844<NA><NA>
57302000020053020074015091881976-05-04<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 7066485749.4<NA>서울특별시 용산구 서계동 47-15서울특별시 용산구 청파로 367 (서계동)<NA>서계주유소2023-10-25 16:16:09U2022-10-30 22:07:00.0주유소197186.302872450007.264946<NA><NA>
583020000200730200950150000120070502<NA>3폐업3폐지<NA><NA><NA><NA>02 7950757<NA><NA>서울특별시 용산구 한강로3가 16-91 한강그랜드 오피스텔711호서울특별시 용산구 이촌로 5 (한강로3가,한강그랜드 오피스텔711호)<NA>(주)에치아이씨2013-04-10 14:17:19I2018-08-31 23:59:59.0용제판매소195956.832412447651.530985<NA><NA>
593020000200730200950150000220070502<NA>1영업/정상1신규등록<NA><NA><NA><NA>02 7950757<NA><NA>서울특별시 용산구 한강로3가 16-91 한강그랜드오피스텔 711호서울특별시 용산구 이촌로 5 (한강로3가)<NA>(주)에이치아이씨2011-10-30 13:39:15I2018-08-31 23:59:59.0용제판매소195956.832412447651.530985<NA><NA>
603020000201830201560120000120181122<NA>1영업/정상1신규등록<NA><NA><NA><NA><NA>30.0<NA>서울특별시 용산구 한강로1가 231-23서울특별시 용산구 한강대로62길 18 (한강로1가)4382용산에너지2020-09-24 18:12:04U2020-09-26 02:40:00.0일반판매소197570.006193447953.578982<NA><NA>