Overview

Dataset statistics

Number of variables27
Number of observations92
Missing cells694
Missing cells (%)27.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.8 KiB
Average record size in memory231.4 B

Variable types

Categorical8
Numeric5
DateTime3
Unsupported5
Text6

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),취급제품명,담배공급업체명
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-19899/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
인허가취소일자 is highly imbalanced (83.8%)Imbalance
데이터갱신일자 is highly imbalanced (57.5%)Imbalance
폐업일자 has 80 (87.0%) missing valuesMissing
휴업시작일자 has 92 (100.0%) missing valuesMissing
휴업종료일자 has 92 (100.0%) missing valuesMissing
재개업일자 has 92 (100.0%) missing valuesMissing
전화번호 has 32 (34.8%) missing valuesMissing
소재지면적 has 92 (100.0%) missing valuesMissing
소재지우편번호 has 35 (38.0%) missing valuesMissing
지번주소 has 1 (1.1%) missing valuesMissing
도로명주소 has 31 (33.7%) missing valuesMissing
도로명우편번호 has 31 (33.7%) missing valuesMissing
업태구분명 has 92 (100.0%) missing valuesMissing
좌표정보(X) has 1 (1.1%) missing valuesMissing
좌표정보(Y) has 1 (1.1%) missing valuesMissing
취급제품명 has 11 (12.0%) missing valuesMissing
담배공급업체명 has 11 (12.0%) missing valuesMissing
관리번호 has unique valuesUnique
최종수정일자 has unique valuesUnique
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업종료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-06 12:20:25.543031
Analysis finished2024-04-06 12:20:26.698468
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
3220000
92 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3220000
2nd row3220000
3rd row3220000
4th row3220000
5th row3220000

Common Values

ValueCountFrequency (%)
3220000 92
100.0%

Length

2024-04-06T21:20:26.820018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:27.072253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3220000 92
100.0%

관리번호
Real number (ℝ)

UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0133872 × 1018
Minimum1.994322 × 1018
Maximum2.023322 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-04-06T21:20:27.383288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.994322 × 1018
5-th percentile2.004322 × 1018
Q12.009322 × 1018
median2.014322 × 1018
Q32.015322 × 1018
95-th percentile2.022322 × 1018
Maximum2.023322 × 1018
Range2.9000012 × 1016
Interquartile range (IQR)6.0000035 × 1015

Descriptive statistics

Standard deviation5.7067657 × 1015
Coefficient of variation (CV)0.0028344104
Kurtosis0.27871822
Mean2.0133872 × 1018
Median Absolute Deviation (MAD)4 × 1015
Skewness-0.37122185
Sum7.6418477 × 1017
Variance3.2567175 × 1031
MonotonicityStrictly increasing
2024-04-06T21:20:27.723665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1994322012715500001 1
 
1.1%
2015322016215500010 1
 
1.1%
2015322016215500019 1
 
1.1%
2015322016215500018 1
 
1.1%
2015322016215500017 1
 
1.1%
2015322016215500016 1
 
1.1%
2015322016215500015 1
 
1.1%
2015322016215500014 1
 
1.1%
2015322016215500013 1
 
1.1%
2015322016215500012 1
 
1.1%
Other values (82) 82
89.1%
ValueCountFrequency (%)
1994322012715500001 1
1.1%
2002322008312500001 1
1.1%
2002322012715500015 1
1.1%
2004322008312500001 1
1.1%
2004322008312500002 1
1.1%
2004322008312500003 1
1.1%
2004322012715600008 1
1.1%
2005322008312500001 1
1.1%
2005322008312500002 1
1.1%
2005322016215500001 1
1.1%
ValueCountFrequency (%)
2023322024915500004 1
1.1%
2023322024915500003 1
1.1%
2023322024915500002 1
1.1%
2023322024915500001 1
1.1%
2022322024915500003 1
1.1%
2022322024915500002 1
1.1%
2022322024915500001 1
1.1%
2021322024915500004 1
1.1%
2021322024915500003 1
1.1%
2021322024915500002 1
1.1%
Distinct84
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
Minimum1994-12-10 00:00:00
Maximum2023-09-26 00:00:00
2024-04-06T21:20:28.020509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:20:28.311098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Categorical

IMBALANCE 

Distinct4
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
<NA>
88 
20160212
 
2
20160422
 
1
20160615
 
1

Length

Max length8
Median length4
Mean length4.173913
Min length4

Unique

Unique2 ?
Unique (%)2.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 88
95.7%
20160212 2
 
2.2%
20160422 1
 
1.1%
20160615 1
 
1.1%

Length

2024-04-06T21:20:28.597306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:28.832243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 88
95.7%
20160212 2
 
2.2%
20160422 1
 
1.1%
20160615 1
 
1.1%
Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
1
76 
3
12 
4
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 76
82.6%
3 12
 
13.0%
4 4
 
4.3%

Length

2024-04-06T21:20:29.060089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:29.243781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 76
82.6%
3 12
 
13.0%
4 4
 
4.3%

영업상태명
Categorical

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
영업/정상
76 
폐업
12 
취소/말소/만료/정지/중지
 
4

Length

Max length14
Median length5
Mean length5
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row폐업
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 76
82.6%
폐업 12
 
13.0%
취소/말소/만료/정지/중지 4
 
4.3%

Length

2024-04-06T21:20:29.462963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:29.647637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 76
82.6%
폐업 12
 
13.0%
취소/말소/만료/정지/중지 4
 
4.3%
Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
1
76 
3
12 
4
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row3
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 76
82.6%
3 12
 
13.0%
4 4
 
4.3%

Length

2024-04-06T21:20:29.839426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:30.402571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 76
82.6%
3 12
 
13.0%
4 4
 
4.3%
Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
정상영업
76 
폐업처리
12 
직권취소
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상영업
2nd row폐업처리
3rd row정상영업
4th row정상영업
5th row정상영업

Common Values

ValueCountFrequency (%)
정상영업 76
82.6%
폐업처리 12
 
13.0%
직권취소 4
 
4.3%

Length

2024-04-06T21:20:30.597792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:30.899756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상영업 76
82.6%
폐업처리 12
 
13.0%
직권취소 4
 
4.3%

폐업일자
Date

MISSING 

Distinct11
Distinct (%)91.7%
Missing80
Missing (%)87.0%
Memory size868.0 B
Minimum2006-03-28 00:00:00
Maximum2023-05-09 00:00:00
2024-04-06T21:20:31.088821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:20:31.332573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing92
Missing (%)100.0%
Memory size960.0 B

휴업종료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing92
Missing (%)100.0%
Memory size960.0 B

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing92
Missing (%)100.0%
Memory size960.0 B

전화번호
Text

MISSING 

Distinct53
Distinct (%)88.3%
Missing32
Missing (%)34.8%
Memory size868.0 B
2024-04-06T21:20:31.799600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.383333
Min length8

Characters and Unicode

Total characters623
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)78.3%

Sample

1st row02 5571916
2nd row02-528-7033
3rd row0220510252
4th row0221127325
5th row0221127325
ValueCountFrequency (%)
02 4
 
6.2%
0221127325 3
 
4.6%
02-6959-9075 2
 
3.1%
02-3446-5546 2
 
3.1%
02-546-0879 2
 
3.1%
414-0852 2
 
3.1%
02-546-3757 2
 
3.1%
02-544-5999 1
 
1.5%
023069080 1
 
1.5%
070-7555-7008 1
 
1.5%
Other values (45) 45
69.2%
2024-04-06T21:20:32.635732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 97
15.6%
2 91
14.6%
5 87
14.0%
- 72
11.6%
1 43
6.9%
9 42
6.7%
7 41
6.6%
4 40
6.4%
6 40
6.4%
8 33
 
5.3%
Other values (2) 37
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 545
87.5%
Dash Punctuation 72
 
11.6%
Space Separator 6
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 97
17.8%
2 91
16.7%
5 87
16.0%
1 43
7.9%
9 42
7.7%
7 41
7.5%
4 40
7.3%
6 40
7.3%
8 33
 
6.1%
3 31
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 623
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 97
15.6%
2 91
14.6%
5 87
14.0%
- 72
11.6%
1 43
6.9%
9 42
6.7%
7 41
6.6%
4 40
6.4%
6 40
6.4%
8 33
 
5.3%
Other values (2) 37
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 623
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 97
15.6%
2 91
14.6%
5 87
14.0%
- 72
11.6%
1 43
6.9%
9 42
6.7%
7 41
6.6%
4 40
6.4%
6 40
6.4%
8 33
 
5.3%
Other values (2) 37
 
5.9%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing92
Missing (%)100.0%
Memory size960.0 B

소재지우편번호
Real number (ℝ)

MISSING 

Distinct45
Distinct (%)78.9%
Missing35
Missing (%)38.0%
Infinite0
Infinite (%)0.0%
Mean135735.91
Minimum135010
Maximum135998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-04-06T21:20:32.934503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum135010
5-th percentile135080
Q1135747
median135839
Q3135910
95-th percentile135962
Maximum135998
Range988
Interquartile range (IQR)163

Descriptive statistics

Standard deviation275.56898
Coefficient of variation (CV)0.0020301847
Kurtosis1.2926308
Mean135735.91
Median Absolute Deviation (MAD)80
Skewness-1.6060029
Sum7736947
Variance75938.26
MonotonicityNot monotonic
2024-04-06T21:20:33.238094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
135270 3
 
3.3%
135839 3
 
3.3%
135910 2
 
2.2%
135962 2
 
2.2%
135747 2
 
2.2%
135816 2
 
2.2%
135820 2
 
2.2%
135811 2
 
2.2%
135080 2
 
2.2%
135010 2
 
2.2%
Other values (35) 35
38.0%
(Missing) 35
38.0%
ValueCountFrequency (%)
135010 2
2.2%
135080 2
2.2%
135120 1
 
1.1%
135270 3
3.3%
135280 1
 
1.1%
135509 1
 
1.1%
135517 1
 
1.1%
135545 1
 
1.1%
135726 1
 
1.1%
135744 1
 
1.1%
ValueCountFrequency (%)
135998 1
1.1%
135965 1
1.1%
135962 2
2.2%
135957 1
1.1%
135936 1
1.1%
135935 1
1.1%
135934 1
1.1%
135930 1
1.1%
135925 1
1.1%
135921 1
1.1%

지번주소
Text

MISSING 

Distinct87
Distinct (%)95.6%
Missing1
Missing (%)1.1%
Memory size868.0 B
2024-04-06T21:20:33.770169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length33
Mean length29
Min length17

Characters and Unicode

Total characters2639
Distinct characters128
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)91.2%

Sample

1st row서울특별시 강남구 청담동 97번지 1호 엠빌딩 3층
2nd row서울특별시 강남구 역삼동 828번지 10호
3rd row서울특별시 강남구 삼성동 141번지 32호 BGF사옥
4th row서울특별시 강남구 대치동 981번지 2층
5th row서울특별시 강남구 역삼동 737번지 강남파이낸스센터 42층
ValueCountFrequency (%)
서울특별시 91
 
16.3%
강남구 91
 
16.3%
역삼동 18
 
3.2%
논현동 16
 
2.9%
신사동 10
 
1.8%
대치동 10
 
1.8%
2층 9
 
1.6%
1층 7
 
1.3%
청담동 6
 
1.1%
2호 6
 
1.1%
Other values (203) 294
52.7%
2024-04-06T21:20:34.581889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
474
 
18.0%
1 121
 
4.6%
105
 
4.0%
95
 
3.6%
95
 
3.6%
94
 
3.6%
93
 
3.5%
93
 
3.5%
2 91
 
3.4%
91
 
3.4%
Other values (118) 1287
48.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1578
59.8%
Decimal Number 560
 
21.2%
Space Separator 474
 
18.0%
Dash Punctuation 15
 
0.6%
Uppercase Letter 8
 
0.3%
Other Punctuation 3
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
6.7%
95
 
6.0%
95
 
6.0%
94
 
6.0%
93
 
5.9%
93
 
5.9%
91
 
5.8%
91
 
5.8%
91
 
5.8%
91
 
5.8%
Other values (99) 639
40.5%
Decimal Number
ValueCountFrequency (%)
1 121
21.6%
2 91
16.2%
0 54
9.6%
3 51
9.1%
4 50
8.9%
9 45
 
8.0%
7 43
 
7.7%
8 40
 
7.1%
6 35
 
6.2%
5 30
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
B 3
37.5%
S 2
25.0%
G 1
 
12.5%
F 1
 
12.5%
K 1
 
12.5%
Space Separator
ValueCountFrequency (%)
474
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1578
59.8%
Common 1052
39.9%
Latin 9
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
6.7%
95
 
6.0%
95
 
6.0%
94
 
6.0%
93
 
5.9%
93
 
5.9%
91
 
5.8%
91
 
5.8%
91
 
5.8%
91
 
5.8%
Other values (99) 639
40.5%
Common
ValueCountFrequency (%)
474
45.1%
1 121
 
11.5%
2 91
 
8.7%
0 54
 
5.1%
3 51
 
4.8%
4 50
 
4.8%
9 45
 
4.3%
7 43
 
4.1%
8 40
 
3.8%
6 35
 
3.3%
Other values (3) 48
 
4.6%
Latin
ValueCountFrequency (%)
B 3
33.3%
S 2
22.2%
1
 
11.1%
G 1
 
11.1%
F 1
 
11.1%
K 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1578
59.8%
ASCII 1060
40.2%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
474
44.7%
1 121
 
11.4%
2 91
 
8.6%
0 54
 
5.1%
3 51
 
4.8%
4 50
 
4.7%
9 45
 
4.2%
7 43
 
4.1%
8 40
 
3.8%
6 35
 
3.3%
Other values (8) 56
 
5.3%
Hangul
ValueCountFrequency (%)
105
 
6.7%
95
 
6.0%
95
 
6.0%
94
 
6.0%
93
 
5.9%
93
 
5.9%
91
 
5.8%
91
 
5.8%
91
 
5.8%
91
 
5.8%
Other values (99) 639
40.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

MISSING 

Distinct57
Distinct (%)93.4%
Missing31
Missing (%)33.7%
Memory size868.0 B
2024-04-06T21:20:35.109036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length39
Mean length34.409836
Min length23

Characters and Unicode

Total characters2099
Distinct characters126
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)86.9%

Sample

1st row서울특별시 강남구 도산대로 445, 3층 (청담동, 엠빌딩)
2nd row서울특별시 강남구 테헤란로 405 (삼성동)
3rd row서울특별시 강남구 테헤란로 152, 42층 (역삼동, 강남파이낸스센터)
4th row서울특별시 강남구 테헤란로 152, 42층 (역삼동, 강남파이낸스센터)
5th row서울특별시 강남구 테헤란로 301, 19층 (역삼동, 삼정개발빌딩)
ValueCountFrequency (%)
서울특별시 61
 
15.0%
강남구 61
 
15.0%
논현동 15
 
3.7%
역삼동 14
 
3.4%
신사동 9
 
2.2%
대치동 7
 
1.7%
4층 6
 
1.5%
테헤란로 6
 
1.5%
6층 6
 
1.5%
2층 6
 
1.5%
Other values (153) 217
53.2%
2024-04-06T21:20:35.809239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
347
 
16.5%
, 82
 
3.9%
1 77
 
3.7%
72
 
3.4%
71
 
3.4%
66
 
3.1%
66
 
3.1%
2 64
 
3.0%
63
 
3.0%
62
 
3.0%
Other values (116) 1129
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1187
56.6%
Decimal Number 350
 
16.7%
Space Separator 347
 
16.5%
Other Punctuation 82
 
3.9%
Open Punctuation 61
 
2.9%
Close Punctuation 61
 
2.9%
Dash Punctuation 5
 
0.2%
Uppercase Letter 5
 
0.2%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
6.1%
71
 
6.0%
66
 
5.6%
66
 
5.6%
63
 
5.3%
62
 
5.2%
61
 
5.1%
61
 
5.1%
61
 
5.1%
61
 
5.1%
Other values (97) 543
45.7%
Decimal Number
ValueCountFrequency (%)
1 77
22.0%
2 64
18.3%
3 40
11.4%
4 37
10.6%
0 36
10.3%
5 28
 
8.0%
7 25
 
7.1%
6 17
 
4.9%
8 16
 
4.6%
9 10
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
S 3
60.0%
G 1
 
20.0%
K 1
 
20.0%
Space Separator
ValueCountFrequency (%)
347
100.0%
Other Punctuation
ValueCountFrequency (%)
, 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 61
100.0%
Close Punctuation
ValueCountFrequency (%)
) 61
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1187
56.6%
Common 906
43.2%
Latin 6
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
6.1%
71
 
6.0%
66
 
5.6%
66
 
5.6%
63
 
5.3%
62
 
5.2%
61
 
5.1%
61
 
5.1%
61
 
5.1%
61
 
5.1%
Other values (97) 543
45.7%
Common
ValueCountFrequency (%)
347
38.3%
, 82
 
9.1%
1 77
 
8.5%
2 64
 
7.1%
( 61
 
6.7%
) 61
 
6.7%
3 40
 
4.4%
4 37
 
4.1%
0 36
 
4.0%
5 28
 
3.1%
Other values (5) 73
 
8.1%
Latin
ValueCountFrequency (%)
S 3
50.0%
G 1
 
16.7%
1
 
16.7%
K 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1187
56.6%
ASCII 911
43.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
347
38.1%
, 82
 
9.0%
1 77
 
8.5%
2 64
 
7.0%
( 61
 
6.7%
) 61
 
6.7%
3 40
 
4.4%
4 37
 
4.1%
0 36
 
4.0%
5 28
 
3.1%
Other values (8) 78
 
8.6%
Hangul
ValueCountFrequency (%)
72
 
6.1%
71
 
6.0%
66
 
5.6%
66
 
5.6%
63
 
5.3%
62
 
5.2%
61
 
5.1%
61
 
5.1%
61
 
5.1%
61
 
5.1%
Other values (97) 543
45.7%
Number Forms
ValueCountFrequency (%)
1
100.0%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct52
Distinct (%)85.2%
Missing31
Missing (%)33.7%
Infinite0
Infinite (%)0.0%
Mean74183.033
Minimum6014
Maximum135998
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-04-06T21:20:36.126873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6014
5-th percentile6025
Q16153
median135726
Q3135840
95-th percentile135935
Maximum135998
Range129984
Interquartile range (IQR)129687

Descriptive statistics

Standard deviation65309.586
Coefficient of variation (CV)0.88038441
Kurtosis-2.058414
Mean74183.033
Median Absolute Deviation (MAD)239
Skewness-0.10097985
Sum4525165
Variance4.265342 × 109
MonotonicityNot monotonic
2024-04-06T21:20:36.450799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135820 2
 
2.2%
6236 2
 
2.2%
6014 2
 
2.2%
135892 2
 
2.2%
135747 2
 
2.2%
135816 2
 
2.2%
6349 2
 
2.2%
6179 2
 
2.2%
135811 2
 
2.2%
6026 1
 
1.1%
Other values (42) 42
45.7%
(Missing) 31
33.7%
ValueCountFrequency (%)
6014 2
2.2%
6019 1
1.1%
6025 1
1.1%
6026 1
1.1%
6043 1
1.1%
6045 1
1.1%
6072 1
1.1%
6075 1
1.1%
6098 1
1.1%
6099 1
1.1%
ValueCountFrequency (%)
135998 1
1.1%
135965 1
1.1%
135962 1
1.1%
135935 1
1.1%
135930 1
1.1%
135925 1
1.1%
135921 1
1.1%
135919 1
1.1%
135910 1
1.1%
135892 2
2.2%
Distinct91
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size868.0 B
2024-04-06T21:20:36.883936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length9.1521739
Min length2

Characters and Unicode

Total characters842
Distinct characters186
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)97.8%

Sample

1st row(주)영유통
2nd rowMLK
3rd row주식회사 비지에프리테일
4th row(주)다민엘앤티
5th row브리티쉬아메리칸토바코코리아(주)
ValueCountFrequency (%)
주식회사 26
 
20.0%
로스만스파이스트비브이 2
 
1.5%
korea 2
 
1.5%
이씬코리아 2
 
1.5%
강남지점 2
 
1.5%
스누스코리아 2
 
1.5%
엘에스에스코리아 1
 
0.8%
주)토로코리아 1
 
0.8%
전자담배 1
 
0.8%
로즈데일 1
 
0.8%
Other values (90) 90
69.2%
2024-04-06T21:20:37.548456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
7.4%
38
 
4.5%
( 38
 
4.5%
) 38
 
4.5%
34
 
4.0%
31
 
3.7%
31
 
3.7%
29
 
3.4%
28
 
3.3%
28
 
3.3%
Other values (176) 485
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 684
81.2%
Space Separator 38
 
4.5%
Open Punctuation 38
 
4.5%
Close Punctuation 38
 
4.5%
Uppercase Letter 26
 
3.1%
Lowercase Letter 15
 
1.8%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
9.1%
34
 
5.0%
31
 
4.5%
31
 
4.5%
29
 
4.2%
28
 
4.1%
28
 
4.1%
27
 
3.9%
23
 
3.4%
15
 
2.2%
Other values (150) 376
55.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
11.5%
K 3
11.5%
O 3
11.5%
S 3
11.5%
C 2
7.7%
M 2
7.7%
P 2
7.7%
R 2
7.7%
E 2
7.7%
I 1
 
3.8%
Other values (3) 3
11.5%
Lowercase Letter
ValueCountFrequency (%)
e 3
20.0%
r 3
20.0%
u 2
13.3%
o 2
13.3%
l 2
13.3%
d 1
 
6.7%
s 1
 
6.7%
v 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
: 1
33.3%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 684
81.2%
Common 117
 
13.9%
Latin 41
 
4.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
9.1%
34
 
5.0%
31
 
4.5%
31
 
4.5%
29
 
4.2%
28
 
4.1%
28
 
4.1%
27
 
3.9%
23
 
3.4%
15
 
2.2%
Other values (150) 376
55.0%
Latin
ValueCountFrequency (%)
A 3
 
7.3%
K 3
 
7.3%
O 3
 
7.3%
S 3
 
7.3%
e 3
 
7.3%
r 3
 
7.3%
u 2
 
4.9%
C 2
 
4.9%
M 2
 
4.9%
P 2
 
4.9%
Other values (11) 15
36.6%
Common
ValueCountFrequency (%)
38
32.5%
( 38
32.5%
) 38
32.5%
. 2
 
1.7%
: 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 684
81.2%
ASCII 158
 
18.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
9.1%
34
 
5.0%
31
 
4.5%
31
 
4.5%
29
 
4.2%
28
 
4.1%
28
 
4.1%
27
 
3.9%
23
 
3.4%
15
 
2.2%
Other values (150) 376
55.0%
ASCII
ValueCountFrequency (%)
38
24.1%
( 38
24.1%
) 38
24.1%
A 3
 
1.9%
K 3
 
1.9%
O 3
 
1.9%
S 3
 
1.9%
e 3
 
1.9%
r 3
 
1.9%
u 2
 
1.3%
Other values (16) 24
15.2%

최종수정일자
Date

UNIQUE 

Distinct92
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size868.0 B
Minimum2005-10-13 09:42:35
Maximum2024-01-22 10:35:36
2024-04-06T21:20:37.795559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T21:20:38.040239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size868.0 B
I
80 
U
12 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowU
4th rowI
5th rowU

Common Values

ValueCountFrequency (%)
I 80
87.0%
U 12
 
13.0%

Length

2024-04-06T21:20:38.266487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T21:20:38.456744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 80
87.0%
u 12
 
13.0%

데이터갱신일자
Categorical

IMBALANCE 

Distinct23
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size868.0 B
2018-08-31 23:59:59.0
69 
2021-01-16 02:40:00.0
 
2
2021-12-08 02:40:00.0
 
1
2020-09-23 02:40:00.0
 
1
2022-11-30 23:05:00.0
 
1
Other values (18)
18 

Length

Max length21
Median length21
Mean length21
Min length21

Unique

Unique21 ?
Unique (%)22.8%

Sample

1st row2018-08-31 23:59:59.0
2nd row2018-08-31 23:59:59.0
3rd row2022-10-31 23:06:00.0
4th row2018-08-31 23:59:59.0
5th row2021-01-16 02:40:00.0

Common Values

ValueCountFrequency (%)
2018-08-31 23:59:59.0 69
75.0%
2021-01-16 02:40:00.0 2
 
2.2%
2021-12-08 02:40:00.0 1
 
1.1%
2020-09-23 02:40:00.0 1
 
1.1%
2022-11-30 23:05:00.0 1
 
1.1%
2019-04-13 02:40:00.0 1
 
1.1%
2021-10-31 23:08:00.0 1
 
1.1%
2020-02-26 00:23:23.0 1
 
1.1%
2020-02-29 00:23:35.0 1
 
1.1%
2020-04-12 00:23:22.0 1
 
1.1%
Other values (13) 13
 
14.1%

Length

2024-04-06T21:20:38.616729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2018-08-31 69
37.5%
23:59:59.0 69
37.5%
02:40:00.0 5
 
2.7%
2021-01-16 2
 
1.1%
22:08:00.0 2
 
1.1%
2021-11-01 1
 
0.5%
21:01:00.0 1
 
0.5%
2021-12-14 1
 
0.5%
00:22:42.0 1
 
0.5%
2022-12-04 1
 
0.5%
Other values (32) 32
17.4%

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing92
Missing (%)100.0%
Memory size960.0 B

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct78
Distinct (%)85.7%
Missing1
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean203674.38
Minimum201723.29
Maximum209052.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-04-06T21:20:38.816105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201723.29
5-th percentile202059.69
Q1202747.17
median203345.69
Q3204462.57
95-th percentile205606.33
Maximum209052.07
Range7328.7825
Interquartile range (IQR)1715.3969

Descriptive statistics

Standard deviation1404.189
Coefficient of variation (CV)0.0068942838
Kurtosis4.7565984
Mean203674.38
Median Absolute Deviation (MAD)843.58177
Skewness1.763021
Sum18534369
Variance1971746.7
MonotonicityNot monotonic
2024-04-06T21:20:39.063693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
202502.111899175 4
 
4.3%
203169.223281406 3
 
3.3%
208937.760652081 2
 
2.2%
203444.683404827 2
 
2.2%
203744.372341385 2
 
2.2%
201723.29 2
 
2.2%
205166.782425999 2
 
2.2%
204540.923467836 2
 
2.2%
202425.73 2
 
2.2%
203553.450445332 2
 
2.2%
Other values (68) 68
73.9%
ValueCountFrequency (%)
201723.29 2
2.2%
201746.882357963 1
1.1%
201757.105695785 1
1.1%
201894.68499217 1
1.1%
202224.687390502 1
1.1%
202233.328128189 1
1.1%
202271.34425369 1
1.1%
202334.848611446 1
1.1%
202389.712632803 1
1.1%
202395.193957744 1
1.1%
ValueCountFrequency (%)
209052.072465426 1
1.1%
208937.760652081 2
2.2%
205808.345 1
1.1%
205650.740012856 1
1.1%
205561.927843074 1
1.1%
205221.120000001 1
1.1%
205195.491562325 1
1.1%
205166.782425999 2
2.2%
205026.12335825 1
1.1%
204970.473165373 1
1.1%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct78
Distinct (%)85.7%
Missing1
Missing (%)1.1%
Infinite0
Infinite (%)0.0%
Mean444981.93
Minimum441843.31
Maximum447173.64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-04-06T21:20:39.309806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum441843.31
5-th percentile442757.57
Q1444247.39
median444852.88
Q3446085.36
95-th percentile446855.51
Maximum447173.64
Range5330.3235
Interquartile range (IQR)1837.962

Descriptive statistics

Standard deviation1269.6432
Coefficient of variation (CV)0.0028532467
Kurtosis-0.46752082
Mean444981.93
Median Absolute Deviation (MAD)921.24793
Skewness-0.31062684
Sum40493356
Variance1611994
MonotonicityNot monotonic
2024-04-06T21:20:39.561163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
446464.940373234 4
 
4.3%
444178.357261909 3
 
3.3%
442873.588039887 2
 
2.2%
446491.411615032 2
 
2.2%
444548.690712365 2
 
2.2%
445846.695 2
 
2.2%
444887.051830764 2
 
2.2%
444718.882588083 2
 
2.2%
444852.88 2
 
2.2%
446939.804264849 2
 
2.2%
Other values (68) 68
73.9%
ValueCountFrequency (%)
441843.311883943 1
1.1%
441900.400155464 1
1.1%
442595.241952968 1
1.1%
442709.118807822 1
1.1%
442724.388182642 1
1.1%
442790.75055832 1
1.1%
442797.138150563 1
1.1%
442873.588039887 2
2.2%
443149.100356632 1
1.1%
443336.025 1
1.1%
ValueCountFrequency (%)
447173.635377761 1
1.1%
446939.804264849 2
2.2%
446894.657211083 1
1.1%
446875.023189828 1
1.1%
446835.998695178 1
1.1%
446798.492471286 1
1.1%
446683.947963065 1
1.1%
446662.792093523 1
1.1%
446625.873769687 1
1.1%
446491.411615032 2
2.2%

취급제품명
Text

MISSING 

Distinct61
Distinct (%)75.3%
Missing11
Missing (%)12.0%
Memory size868.0 B
2024-04-06T21:20:39.986361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length46
Mean length14.580247
Min length1

Characters and Unicode

Total characters1181
Distinct characters167
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)67.9%

Sample

1st row말보로외 10종
2nd row브랜드오리지널, 브랜드맨솔
3rd rowFIGO
4th row던힐라이트 외 10종
5th row던힐라이트, 던힐라이트멘솔
ValueCountFrequency (%)
전자담배 28
 
13.1%
액상 6
 
2.8%
시가 5
 
2.3%
5
 
2.3%
5
 
2.3%
joomong 5
 
2.3%
need 5
 
2.3%
니코틴 4
 
1.9%
4
 
1.9%
던힐라이트 3
 
1.4%
Other values (122) 144
67.3%
2024-04-06T21:20:40.733788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
133
 
11.3%
, 65
 
5.5%
44
 
3.7%
44
 
3.7%
40
 
3.4%
38
 
3.2%
i 27
 
2.3%
e 27
 
2.3%
E 25
 
2.1%
O 25
 
2.1%
Other values (157) 713
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 474
40.1%
Lowercase Letter 239
20.2%
Uppercase Letter 211
17.9%
Space Separator 133
 
11.3%
Other Punctuation 71
 
6.0%
Decimal Number 26
 
2.2%
Open Punctuation 12
 
1.0%
Close Punctuation 12
 
1.0%
Dash Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.3%
44
 
9.3%
40
 
8.4%
38
 
8.0%
16
 
3.4%
16
 
3.4%
11
 
2.3%
11
 
2.3%
11
 
2.3%
8
 
1.7%
Other values (94) 235
49.6%
Uppercase Letter
ValueCountFrequency (%)
E 25
 
11.8%
O 25
 
11.8%
N 17
 
8.1%
M 12
 
5.7%
G 12
 
5.7%
I 10
 
4.7%
S 10
 
4.7%
P 9
 
4.3%
T 9
 
4.3%
D 8
 
3.8%
Other values (15) 74
35.1%
Lowercase Letter
ValueCountFrequency (%)
i 27
11.3%
e 27
11.3%
l 18
 
7.5%
o 18
 
7.5%
t 17
 
7.1%
a 16
 
6.7%
s 15
 
6.3%
n 15
 
6.3%
k 11
 
4.6%
h 11
 
4.6%
Other values (12) 64
26.8%
Decimal Number
ValueCountFrequency (%)
0 7
26.9%
5 5
19.2%
2 5
19.2%
1 3
11.5%
9 2
 
7.7%
7 2
 
7.7%
8 1
 
3.8%
4 1
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 65
91.5%
/ 3
 
4.2%
' 2
 
2.8%
. 1
 
1.4%
Space Separator
ValueCountFrequency (%)
133
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 474
40.1%
Latin 450
38.1%
Common 257
21.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.3%
44
 
9.3%
40
 
8.4%
38
 
8.0%
16
 
3.4%
16
 
3.4%
11
 
2.3%
11
 
2.3%
11
 
2.3%
8
 
1.7%
Other values (94) 235
49.6%
Latin
ValueCountFrequency (%)
i 27
 
6.0%
e 27
 
6.0%
E 25
 
5.6%
O 25
 
5.6%
l 18
 
4.0%
o 18
 
4.0%
t 17
 
3.8%
N 17
 
3.8%
a 16
 
3.6%
s 15
 
3.3%
Other values (37) 245
54.4%
Common
ValueCountFrequency (%)
133
51.8%
, 65
25.3%
( 12
 
4.7%
) 12
 
4.7%
0 7
 
2.7%
5 5
 
1.9%
2 5
 
1.9%
/ 3
 
1.2%
1 3
 
1.2%
- 3
 
1.2%
Other values (6) 9
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 707
59.9%
Hangul 474
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
133
 
18.8%
, 65
 
9.2%
i 27
 
3.8%
e 27
 
3.8%
E 25
 
3.5%
O 25
 
3.5%
l 18
 
2.5%
o 18
 
2.5%
t 17
 
2.4%
N 17
 
2.4%
Other values (53) 335
47.4%
Hangul
ValueCountFrequency (%)
44
 
9.3%
44
 
9.3%
40
 
8.4%
38
 
8.0%
16
 
3.4%
16
 
3.4%
11
 
2.3%
11
 
2.3%
11
 
2.3%
8
 
1.7%
Other values (94) 235
49.6%

담배공급업체명
Text

MISSING 

Distinct73
Distinct (%)90.1%
Missing11
Missing (%)12.0%
Memory size868.0 B
2024-04-06T21:20:41.075100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length78
Median length37
Mean length15.135802
Min length1

Characters and Unicode

Total characters1226
Distinct characters198
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)80.2%

Sample

1st row필립모리스코리아(주)
2nd row(주)어드베코리아
3rd row(주)다민바이오텍
4th row로스만스파이스트비브이
5th row브리티쉬아메리칸토바코코리아(주)
ValueCountFrequency (%)
shenzhen 9
 
5.7%
technology 8
 
5.1%
ltd 5
 
3.2%
co 5
 
3.2%
electronic 3
 
1.9%
주)코리아토바코컴퍼니 2
 
1.3%
코리아 2
 
1.3%
주)액상코리아 2
 
1.3%
co.ltd 2
 
1.3%
bilsen 2
 
1.3%
Other values (107) 117
74.5%
2024-04-06T21:20:41.747723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
6.4%
e 47
 
3.8%
( 44
 
3.6%
) 44
 
3.6%
43
 
3.5%
42
 
3.4%
n 40
 
3.3%
o 39
 
3.2%
36
 
2.9%
h 30
 
2.4%
Other values (188) 783
63.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 548
44.7%
Lowercase Letter 316
25.8%
Uppercase Letter 159
 
13.0%
Space Separator 78
 
6.4%
Open Punctuation 44
 
3.6%
Close Punctuation 44
 
3.6%
Other Punctuation 33
 
2.7%
Dash Punctuation 3
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.8%
42
 
7.7%
36
 
6.6%
28
 
5.1%
18
 
3.3%
16
 
2.9%
14
 
2.6%
13
 
2.4%
12
 
2.2%
9
 
1.6%
Other values (137) 317
57.8%
Uppercase Letter
ValueCountFrequency (%)
S 18
 
11.3%
L 15
 
9.4%
E 13
 
8.2%
T 12
 
7.5%
C 11
 
6.9%
O 9
 
5.7%
I 8
 
5.0%
A 7
 
4.4%
H 6
 
3.8%
N 6
 
3.8%
Other values (14) 54
34.0%
Lowercase Letter
ValueCountFrequency (%)
e 47
14.9%
n 40
12.7%
o 39
12.3%
h 30
9.5%
c 23
7.3%
i 23
7.3%
t 19
 
6.0%
d 17
 
5.4%
l 16
 
5.1%
g 11
 
3.5%
Other values (10) 51
16.1%
Other Punctuation
ValueCountFrequency (%)
, 22
66.7%
. 11
33.3%
Space Separator
ValueCountFrequency (%)
78
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Decimal Number
ValueCountFrequency (%)
8 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 548
44.7%
Latin 475
38.7%
Common 203
 
16.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.8%
42
 
7.7%
36
 
6.6%
28
 
5.1%
18
 
3.3%
16
 
2.9%
14
 
2.6%
13
 
2.4%
12
 
2.2%
9
 
1.6%
Other values (137) 317
57.8%
Latin
ValueCountFrequency (%)
e 47
 
9.9%
n 40
 
8.4%
o 39
 
8.2%
h 30
 
6.3%
c 23
 
4.8%
i 23
 
4.8%
t 19
 
4.0%
S 18
 
3.8%
d 17
 
3.6%
l 16
 
3.4%
Other values (34) 203
42.7%
Common
ValueCountFrequency (%)
78
38.4%
( 44
21.7%
) 44
21.7%
, 22
 
10.8%
. 11
 
5.4%
- 3
 
1.5%
8 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 678
55.3%
Hangul 548
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
78
 
11.5%
e 47
 
6.9%
( 44
 
6.5%
) 44
 
6.5%
n 40
 
5.9%
o 39
 
5.8%
h 30
 
4.4%
c 23
 
3.4%
i 23
 
3.4%
, 22
 
3.2%
Other values (41) 288
42.5%
Hangul
ValueCountFrequency (%)
43
 
7.8%
42
 
7.7%
36
 
6.6%
28
 
5.1%
18
 
3.3%
16
 
2.9%
14
 
2.6%
13
 
2.4%
12
 
2.2%
9
 
1.6%
Other values (137) 317
57.8%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)취급제품명담배공급업체명
03220000199432201271550000119941210<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA>135517서울특별시 강남구 청담동 97번지 1호 엠빌딩 3층서울특별시 강남구 도산대로 445, 3층 (청담동, 엠빌딩)135517(주)영유통2014-03-24 10:27:03I2018-08-31 23:59:59.0<NA>203886.536517446894.657211말보로외 10종필립모리스코리아(주)
13220000200232200831250000120020803<NA>3폐업3폐업처리20060328<NA><NA><NA>02 5571916<NA><NA>서울특별시 강남구 역삼동 828번지 10호<NA><NA>MLK2006-03-28 11:02:54I2018-08-31 23:59:59.0<NA>202849.232289443764.881624브랜드오리지널, 브랜드맨솔(주)어드베코리아
2322000020023220127155000152002-09-24<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-528-7033<NA><NA>서울특별시 강남구 삼성동 141번지 32호 BGF사옥서울특별시 강남구 테헤란로 405 (삼성동)6162주식회사 비지에프리테일2023-11-14 18:00:23U2022-10-31 23:06:00.0<NA>204333.137878444788.050526<NA><NA>
33220000200432200831250000120040527<NA>1영업/정상1정상영업<NA><NA><NA><NA>0220510252<NA><NA>서울특별시 강남구 대치동 981번지 2층<NA><NA>(주)다민엘앤티2006-08-25 10:49:52I2018-08-31 23:59:59.0<NA>205561.927843444515.573781FIGO(주)다민바이오텍
43220000200432200831250000220040806<NA>1영업/정상1정상영업<NA><NA><NA><NA>0221127325<NA><NA>서울특별시 강남구 역삼동 737번지 강남파이낸스센터 42층서울특별시 강남구 테헤란로 152, 42층 (역삼동, 강남파이낸스센터)6236브리티쉬아메리칸토바코코리아(주)2021-01-14 15:47:52U2021-01-16 02:40:00.0<NA>203169.223281444178.357262던힐라이트 외 10종로스만스파이스트비브이
53220000200432200831250000320040806<NA>1영업/정상1정상영업<NA><NA><NA><NA>0221127325<NA><NA>서울특별시 강남구 역삼동 737번지 강남파이낸스센터 42층서울특별시 강남구 테헤란로 152, 42층 (역삼동, 강남파이낸스센터)6236로스만스파이스트비브이2021-01-14 15:46:58U2021-01-16 02:40:00.0<NA>203169.223281444178.357262던힐라이트, 던힐라이트멘솔브리티쉬아메리칸토바코코리아(주)
63220000200432201271560000820041213<NA>3폐업3폐업처리20070823<NA><NA><NA><NA><NA>135962서울특별시 강남구 포이동 198번지 4호<NA><NA>시가웰2007-08-24 11:43:39I2018-08-31 23:59:59.0<NA><NA><NA>던힐 라이트외 수종BAT(주)
73220000200532200831250000120051013<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 5981391<NA><NA>서울특별시 강남구 역삼동 688번지 2호 성우빌딩 2층 201호<NA><NA>(주)이프인터내셔널2005-10-13 09:42:35I2018-08-31 23:59:59.0<NA>203477.11478445044.126118이프(주)구강물산
83220000200532200831250000220051129<NA>1영업/정상1정상영업<NA><NA><NA><NA>02 5537030<NA><NA>서울특별시 강남구 대치동 944번지 11호 삼흥빌딩 10층<NA><NA>이프라이프2006-03-09 18:03:56I2018-08-31 23:59:59.0<NA>205221.12445007.05이프(주)구강물산
93220000200532201621550000120050118<NA>3폐업3폐업처리20130909<NA><NA><NA><NA><NA>135957서울특별시 강남구 청담동 130번지 9호 현대리버스빌 805호<NA><NA>대진2014-10-21 09:18:28I2018-08-31 23:59:59.0<NA>204829.390808446875.02319--
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)취급제품명담배공급업체명
823220000202132202491550000220211117<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-6959-9075<NA><NA>서울특별시 강남구 논현동 39-18 성도빌딩서울특별시 강남구 논현로131길 30, 성도빌딩 4층 (논현동)6045주식회사 디파이브컴퍼니2021-11-17 16:46:31I2021-11-19 00:22:44.0<NA>202395.193958445774.127925넥스트 칵스, 칵스 일회용 전자담배동관헤이마전자과기유한공사, 심천시웨이메이전자과기유한공사
83322000020213220249155000032021-11-17<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-2038-6776<NA><NA>서울특별시 강남구 대치동 943-17 구일빌딩서울특별시 강남구 삼성로86길 23, 구일빌딩 6층 (대치동)6179주식회사 디베이프2023-03-29 09:06:12U2022-12-02 21:01:00.0<NA>205166.782426444887.051831<NA><NA>
843220000202132202491550000420211210<NA>1영업/정상1정상영업<NA><NA><NA><NA>02-3446-5546<NA><NA>서울특별시 강남구 청담동 88-29서울특별시 강남구 도산대로57길 13-4, 2층 (청담동)6014주식회사 시가스토리2021-12-10 13:28:30I2021-12-14 00:22:42.0<NA>203553.450445446939.804265시가 및 시가관련제품자사수입, 부루벨코리아(다비도프)
85322000020223220249155000012022-02-25<NA>3폐업3폐업처리2023-05-09<NA><NA><NA>02-6959-9075<NA><NA>서울특별시 강남구 대치동 943-17 구일빌딩서울특별시 강남구 삼성로86길 23, 구일빌딩 6층 (대치동)6179주식회사 디파이브글로벌2023-05-09 10:11:42U2022-12-04 23:01:00.0<NA>205166.782426444887.051831<NA><NA>
86322000020223220249155000022022-06-24<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 도곡동 951서울특별시 강남구 남부순환로351길 30, 4층 (도곡동)6266주식회사 글로벌에스와이2024-01-22 10:35:36U2023-11-30 22:04:00.0<NA>202941.397019442724.388183<NA><NA>
873220000202232202491550000320221104<NA>1영업/정상1정상영업<NA><NA><NA><NA>025650255<NA><NA>서울특별시 강남구 삼성동 140-12서울특별시 강남구 선릉로100길 26, 4층 (삼성동)6161주식회사 지오프랜드2022-11-07 13:03:05I2021-11-01 00:09:00.0<NA>204289.676668444952.973781<NA><NA>
88322000020233220249155000012023-01-04<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 삼성동 142-34서울특별시 강남구 선릉로92길 28, 4층 441호 (삼성동)6160주식회사 알카2023-02-10 16:22:56U2022-12-01 23:02:00.0<NA>204432.46444839.755<NA><NA>
89322000020233220249155000022023-07-25<NA>1영업/정상1정상영업<NA><NA><NA><NA><NA><NA><NA>서울특별시 강남구 신사동 666-20서울특별시 강남구 도산대로53길 10, 지하1층 (신사동)6019주식회사 와이와이디에스코리아2023-07-26 10:54:16I2022-12-06 22:08:00.0<NA>203331.111307446835.998695<NA><NA>
90322000020233220249155000032023-08-24<NA>1영업/정상1정상영업<NA><NA><NA><NA>0218998976<NA><NA>서울특별시 강남구 삼성동 114-1 몽베르빌딩서울특별시 강남구 봉은사로 438, 몽베르빌딩 8층 (삼성동)6153이주스어워드코리아 유한회사2023-08-24 17:49:25I2022-12-07 22:06:00.0<NA>204155.766813445442.476716<NA><NA>
91322000020233220249155000042023-09-26<NA>1영업/정상1정상영업<NA><NA><NA><NA>0234978900<NA><NA>서울특별시 강남구 대치동 907-8 세토피아 빌딩서울특별시 강남구 역삼로 413, 세토피아 빌딩 3,5층 (대치동)6198주식회사 세토피아2023-09-26 16:22:55I2022-12-08 22:08:00.0<NA>204542.807065444339.179112<NA><NA>