Overview

Dataset statistics

Number of variables27
Number of observations295
Missing cells2864
Missing cells (%)36.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory66.1 KiB
Average record size in memory229.4 B

Variable types

Categorical6
Numeric6
DateTime4
Unsupported6
Text5

Dataset

Description개방자치단체코드,관리번호,인허가일자,인허가취소일자,영업상태코드,영업상태명,상세영업상태코드,상세영업상태명,폐업일자,휴업시작일자,휴업종료일자,재개업일자,전화번호,소재지면적,소재지우편번호,지번주소,도로명주소,도로명우편번호,사업장명,최종수정일자,데이터갱신구분,데이터갱신일자,업태구분명,좌표정보(X),좌표정보(Y),취급제품명,담배공급업체명
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-16145/S/1/datasetView.do

Alerts

개방자치단체코드 has constant value ""Constant
영업상태코드 is highly imbalanced (65.7%)Imbalance
영업상태명 is highly imbalanced (65.7%)Imbalance
상세영업상태명 is highly imbalanced (65.7%)Imbalance
인허가취소일자 has 295 (100.0%) missing valuesMissing
폐업일자 has 295 (100.0%) missing valuesMissing
휴업시작일자 has 268 (90.8%) missing valuesMissing
휴업종료일자 has 286 (96.9%) missing valuesMissing
재개업일자 has 295 (100.0%) missing valuesMissing
전화번호 has 65 (22.0%) missing valuesMissing
소재지면적 has 295 (100.0%) missing valuesMissing
소재지우편번호 has 199 (67.5%) missing valuesMissing
지번주소 has 10 (3.4%) missing valuesMissing
도로명주소 has 43 (14.6%) missing valuesMissing
도로명우편번호 has 295 (100.0%) missing valuesMissing
업태구분명 has 295 (100.0%) missing valuesMissing
좌표정보(X) has 46 (15.6%) missing valuesMissing
좌표정보(Y) has 46 (15.6%) missing valuesMissing
취급제품명 has 47 (15.9%) missing valuesMissing
담배공급업체명 has 84 (28.5%) missing valuesMissing
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
폐업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
재개업일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
도로명우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 18:54:00.964426
Analysis finished2024-04-29 18:54:01.903841
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
6110000
295 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6110000
2nd row6110000
3rd row6110000
4th row6110000
5th row6110000

Common Values

ValueCountFrequency (%)
6110000 295
100.0%

Length

2024-04-30T03:54:01.966339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:02.034495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6110000 295
100.0%

관리번호
Real number (ℝ)

Distinct30
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0124246 × 1019
Minimum1.989611 × 1019
Maximum2.022611 × 1019
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-30T03:54:02.104524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.989611 × 1019
5-th percentile1.998611 × 1019
Q12.010611 × 1019
median2.013611 × 1019
Q32.016611 × 1019
95-th percentile2.020611 × 1019
Maximum2.022611 × 1019
Range3.3 × 1017
Interquartile range (IQR)6 × 1016

Descriptive statistics

Standard deviation6.7306654 × 1016
Coefficient of variation (CV)0.0033445554
Kurtosis1.3250515
Mean2.0124246 × 1019
Median Absolute Deviation (MAD)3 × 1016
Skewness-1.1770891
Sum5.9366525 × 1021
Variance4.5301856 × 1033
MonotonicityNot monotonic
2024-04-30T03:54:02.201285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
2.0116110000011e+19 37
12.5%
2.0146110000011e+19 35
11.9%
2.0196110000011e+19 25
 
8.5%
2.0156110000011e+19 22
 
7.5%
2.0126110000011e+19 19
 
6.4%
2.0166110000011e+19 17
 
5.8%
2.0186110000011e+19 15
 
5.1%
2.0176110000011e+19 14
 
4.7%
2.0136110000011e+19 13
 
4.4%
2.0106110000011e+19 12
 
4.1%
Other values (20) 86
29.2%
ValueCountFrequency (%)
1.9896110000011e+19 3
1.0%
1.9906110000011e+19 3
1.0%
1.9956110000011e+19 1
 
0.3%
1.9966110000011e+19 3
1.0%
1.9976110000011e+19 1
 
0.3%
1.9986110000011e+19 5
1.7%
1.9996110000011e+19 3
1.0%
2.0006110000011e+19 2
 
0.7%
2.0016110000011e+19 6
2.0%
2.0026110000011e+19 7
2.4%
ValueCountFrequency (%)
2.0226110000011e+19 3
 
1.0%
2.0216110000011e+19 8
 
2.7%
2.0206110000011e+19 7
 
2.4%
2.0196110000011e+19 25
8.5%
2.0186110000011e+19 15
5.1%
2.0176110000011e+19 14
 
4.7%
2.0166110000011e+19 17
5.8%
2.0156110000011e+19 22
7.5%
2.0146110000011e+19 35
11.9%
2.0136110000011e+19 13
 
4.4%
Distinct257
Distinct (%)87.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum1989-04-29 00:00:00
Maximum2022-08-23 00:00:00
2024-04-30T03:54:02.310746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T03:54:02.422250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB

영업상태코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
1
264 
3
27 
2
 
4

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 264
89.5%
3 27
 
9.2%
2 4
 
1.4%

Length

2024-04-30T03:54:02.531688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:02.618367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 264
89.5%
3 27
 
9.2%
2 4
 
1.4%

영업상태명
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
영업/정상
264 
폐업
27 
휴업
 
4

Length

Max length5
Median length5
Mean length4.6847458
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row휴업
5th row영업/정상

Common Values

ValueCountFrequency (%)
영업/정상 264
89.5%
폐업 27
 
9.2%
휴업 4
 
1.4%

Length

2024-04-30T03:54:02.709970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:02.790611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 264
89.5%
폐업 27
 
9.2%
휴업 4
 
1.4%
Distinct4
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
BBBB
218 
3
46 
2
27 
1
 
4

Length

Max length4
Median length4
Mean length3.2169492
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowBBBB
2nd row3
3rd row3
4th row1
5th row3

Common Values

ValueCountFrequency (%)
BBBB 218
73.9%
3 46
 
15.6%
2 27
 
9.2%
1 4
 
1.4%

Length

2024-04-30T03:54:02.879268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:02.975095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
bbbb 218
73.9%
3 46
 
15.6%
2 27
 
9.2%
1 4
 
1.4%

상세영업상태명
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
영업
264 
폐업
27 
휴업
 
4

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row휴업
5th row영업

Common Values

ValueCountFrequency (%)
영업 264
89.5%
폐업 27
 
9.2%
휴업 4
 
1.4%

Length

2024-04-30T03:54:03.078166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:03.166976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업 264
89.5%
폐업 27
 
9.2%
휴업 4
 
1.4%

폐업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB

휴업시작일자
Date

MISSING 

Distinct27
Distinct (%)100.0%
Missing268
Missing (%)90.8%
Memory size2.4 KiB
Minimum2011-11-24 00:00:00
Maximum2023-10-04 00:00:00
2024-04-30T03:54:03.254418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T03:54:03.355849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)

휴업종료일자
Real number (ℝ)

MISSING 

Distinct9
Distinct (%)100.0%
Missing286
Missing (%)96.9%
Infinite0
Infinite (%)0.0%
Mean20161862
Minimum20121113
Maximum20220527
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-30T03:54:03.452442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20121113
5-th percentile20121113
Q120121123
median20151221
Q320211231
95-th percentile20220361
Maximum20220527
Range99414
Interquartile range (IQR)90108

Descriptive statistics

Standard deviation43833.78
Coefficient of variation (CV)0.0021740939
Kurtosis-1.7719231
Mean20161862
Median Absolute Deviation (MAD)30107
Skewness0.54195631
Sum1.8145675 × 108
Variance1.9214002 × 109
MonotonicityNot monotonic
2024-04-30T03:54:03.548143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
20220527 1
 
0.3%
20121123 1
 
0.3%
20220112 1
 
0.3%
20160112 1
 
0.3%
20121113 1
 
0.3%
20121114 1
 
0.3%
20130201 1
 
0.3%
20151221 1
 
0.3%
20211231 1
 
0.3%
(Missing) 286
96.9%
ValueCountFrequency (%)
20121113 1
0.3%
20121114 1
0.3%
20121123 1
0.3%
20130201 1
0.3%
20151221 1
0.3%
20160112 1
0.3%
20211231 1
0.3%
20220112 1
0.3%
20220527 1
0.3%
ValueCountFrequency (%)
20220527 1
0.3%
20220112 1
0.3%
20211231 1
0.3%
20160112 1
0.3%
20151221 1
0.3%
20130201 1
0.3%
20121123 1
0.3%
20121114 1
0.3%
20121113 1
0.3%

재개업일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB

전화번호
Real number (ℝ)

MISSING 

Distinct205
Distinct (%)89.1%
Missing65
Missing (%)22.0%
Infinite0
Infinite (%)0.0%
Mean1.0201148 × 109
Minimum0
Maximum7.0886779 × 109
Zeros1
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-30T03:54:03.649724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile23586146
Q125638594
median29669444
Q32.344804 × 108
95-th percentile7.0752285 × 109
Maximum7.0886779 × 109
Range7.0886779 × 109
Interquartile range (IQR)2.0884181 × 108

Descriptive statistics

Standard deviation2.3475585 × 109
Coefficient of variation (CV)2.3012689
Kurtosis2.8840682
Mean1.0201148 × 109
Median Absolute Deviation (MAD)6486017.5
Skewness2.2004804
Sum2.3462641 × 1011
Variance5.5110311 × 1018
MonotonicityNot monotonic
2024-04-30T03:54:03.766044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
264479079 3
 
1.0%
27904522 3
 
1.0%
220684860 3
 
1.0%
23256886 3
 
1.0%
231471912 2
 
0.7%
264014110 2
 
0.7%
27550890 2
 
0.7%
27908014 2
 
0.7%
25473003 2
 
0.7%
27325711 2
 
0.7%
Other values (195) 206
69.8%
(Missing) 65
 
22.0%
ValueCountFrequency (%)
0 1
 
0.3%
20000000 1
 
0.3%
23109968 1
 
0.3%
23256886 3
1.0%
23339155 1
 
0.3%
23350100 2
0.7%
23544428 1
 
0.3%
23544488 1
 
0.3%
23578280 1
 
0.3%
23595761 1
 
0.3%
ValueCountFrequency (%)
7088677941 1
0.3%
7088465431 1
0.3%
7088202015 1
0.3%
7087695892 1
0.3%
7086394367 1
0.3%
7082835151 1
0.3%
7082250540 1
0.3%
7082103246 1
0.3%
7078588502 1
0.3%
7077677200 1
0.3%

소재지면적
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB

소재지우편번호
Real number (ℝ)

MISSING 

Distinct73
Distinct (%)76.0%
Missing199
Missing (%)67.5%
Infinite0
Infinite (%)0.0%
Mean112994.07
Minimum2134
Maximum158076
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-30T03:54:03.887497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2134
5-th percentile5841
Q1118251.25
median135605
Q3142907.5
95-th percentile157015
Maximum158076
Range155942
Interquartile range (IQR)24656.25

Descriptive statistics

Standard deviation52856.146
Coefficient of variation (CV)0.46777804
Kurtosis0.33802065
Mean112994.07
Median Absolute Deviation (MAD)14405
Skewness-1.4321197
Sum10847431
Variance2.7937721 × 109
MonotonicityNot monotonic
2024-04-30T03:54:04.005297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
152050 6
 
2.0%
135280 3
 
1.0%
135080 3
 
1.0%
134030 3
 
1.0%
137070 2
 
0.7%
153010 2
 
0.7%
157030 2
 
0.7%
121200 2
 
0.7%
137060 2
 
0.7%
6075 2
 
0.7%
Other values (63) 69
 
23.4%
(Missing) 199
67.5%
ValueCountFrequency (%)
2134 1
0.3%
3992 1
0.3%
4342 1
0.3%
5400 1
0.3%
5841 2
0.7%
6045 1
0.3%
6075 2
0.7%
6163 1
0.3%
6166 1
0.3%
6233 1
0.3%
ValueCountFrequency (%)
158076 1
 
0.3%
157930 1
 
0.3%
157220 1
 
0.3%
157030 2
 
0.7%
157010 1
 
0.3%
153030 1
 
0.3%
153023 2
 
0.7%
153010 2
 
0.7%
152748 1
 
0.3%
152050 6
2.0%

지번주소
Text

MISSING 

Distinct249
Distinct (%)87.4%
Missing10
Missing (%)3.4%
Memory size2.4 KiB
2024-04-30T03:54:04.259612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length44
Mean length29.631579
Min length1

Characters and Unicode

Total characters8445
Distinct characters280
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)82.8%

Sample

1st row서울특별시 서초구 반포동 반포*동 **-* (고속터미널 *-***)
2nd row서울특별시 강남구 포이동 ***-* (광진빌딩 ***호)
3rd row전라북도 군산시 월명동 **번지 *호 **통 *반 ***동 ****호
4th row서울특별시 영등포구 여의도동 **-** 서린빌딩 ***호
5th row서울특별시 구로구 구로동 ****번지 *호 부마빌딩 ***호
ValueCountFrequency (%)
250
 
14.4%
서울특별시 216
 
12.5%
번지 144
 
8.3%
105
 
6.1%
97
 
5.6%
69
 
4.0%
52
 
3.0%
강남구 39
 
2.3%
33
 
1.9%
경기도 32
 
1.8%
Other values (369) 696
40.2%
2024-04-30T03:54:04.632400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1886
22.3%
* 1822
21.6%
355
 
4.2%
274
 
3.2%
267
 
3.2%
265
 
3.1%
264
 
3.1%
223
 
2.6%
218
 
2.6%
216
 
2.6%
Other values (270) 2655
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4645
55.0%
Space Separator 1886
22.3%
Other Punctuation 1823
 
21.6%
Dash Punctuation 45
 
0.5%
Uppercase Letter 27
 
0.3%
Open Punctuation 9
 
0.1%
Close Punctuation 9
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
 
7.6%
274
 
5.9%
267
 
5.7%
265
 
5.7%
264
 
5.7%
223
 
4.8%
218
 
4.7%
216
 
4.7%
151
 
3.3%
146
 
3.1%
Other values (252) 2266
48.8%
Uppercase Letter
ValueCountFrequency (%)
B 5
18.5%
S 5
18.5%
K 4
14.8%
E 2
 
7.4%
W 2
 
7.4%
I 2
 
7.4%
V 2
 
7.4%
G 2
 
7.4%
R 1
 
3.7%
F 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
* 1822
99.9%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1886
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4645
55.0%
Common 3773
44.7%
Latin 27
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
 
7.6%
274
 
5.9%
267
 
5.7%
265
 
5.7%
264
 
5.7%
223
 
4.8%
218
 
4.7%
216
 
4.7%
151
 
3.3%
146
 
3.1%
Other values (252) 2266
48.8%
Latin
ValueCountFrequency (%)
B 5
18.5%
S 5
18.5%
K 4
14.8%
E 2
 
7.4%
W 2
 
7.4%
I 2
 
7.4%
V 2
 
7.4%
G 2
 
7.4%
R 1
 
3.7%
F 1
 
3.7%
Common
ValueCountFrequency (%)
1886
50.0%
* 1822
48.3%
- 45
 
1.2%
( 9
 
0.2%
) 9
 
0.2%
& 1
 
< 0.1%
~ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4645
55.0%
ASCII 3800
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1886
49.6%
* 1822
47.9%
- 45
 
1.2%
( 9
 
0.2%
) 9
 
0.2%
B 5
 
0.1%
S 5
 
0.1%
K 4
 
0.1%
E 2
 
0.1%
W 2
 
0.1%
Other values (8) 11
 
0.3%
Hangul
ValueCountFrequency (%)
355
 
7.6%
274
 
5.9%
267
 
5.7%
265
 
5.7%
264
 
5.7%
223
 
4.8%
218
 
4.7%
216
 
4.7%
151
 
3.3%
146
 
3.1%
Other values (252) 2266
48.8%

도로명주소
Text

MISSING 

Distinct236
Distinct (%)93.7%
Missing43
Missing (%)14.6%
Memory size2.4 KiB
2024-04-30T03:54:04.854802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length43
Mean length35.238095
Min length21

Characters and Unicode

Total characters8880
Distinct characters338
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique221 ?
Unique (%)87.7%

Sample

1st row서울특별시 용산구 독서당로 **, B***호(한남동, 신성미소시티)
2nd row전라북도 군산시 월명*길 *, ***동 ****호 (월명동,다원클래시움아파트)
3rd row서울특별시 구로구 시흥대로 ***, ***호 (구로동,부마빌딩)
4th row서울특별시 종로구 북촌로*길 ** (안국동)
5th row서울특별시 송파구 오금로**길 **, ***동 ***호 (거여동,현대*차아파트)
ValueCountFrequency (%)
259
 
16.4%
서울특별시 208
 
13.1%
133
 
8.4%
62
 
3.9%
40
 
2.5%
강남구 34
 
2.1%
경기도 32
 
2.0%
서초구 20
 
1.3%
송파구 17
 
1.1%
마포구 17
 
1.1%
Other values (502) 762
48.1%
2024-04-30T03:54:05.224637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 1617
18.2%
1332
 
15.0%
355
 
4.0%
, 309
 
3.5%
289
 
3.3%
270
 
3.0%
267
 
3.0%
255
 
2.9%
( 247
 
2.8%
) 247
 
2.8%
Other values (328) 3692
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5042
56.8%
Other Punctuation 1927
 
21.7%
Space Separator 1332
 
15.0%
Open Punctuation 247
 
2.8%
Close Punctuation 247
 
2.8%
Uppercase Letter 38
 
0.4%
Dash Punctuation 37
 
0.4%
Lowercase Letter 8
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
355
 
7.0%
289
 
5.7%
270
 
5.4%
267
 
5.3%
255
 
5.1%
212
 
4.2%
212
 
4.2%
209
 
4.1%
160
 
3.2%
133
 
2.6%
Other values (299) 2680
53.2%
Uppercase Letter
ValueCountFrequency (%)
B 7
18.4%
S 5
13.2%
E 5
13.2%
K 4
10.5%
V 2
 
5.3%
I 2
 
5.3%
W 2
 
5.3%
C 2
 
5.3%
R 2
 
5.3%
G 2
 
5.3%
Other values (5) 5
13.2%
Lowercase Letter
ValueCountFrequency (%)
s 2
25.0%
e 2
25.0%
i 1
12.5%
r 1
12.5%
a 1
12.5%
t 1
12.5%
Other Punctuation
ValueCountFrequency (%)
* 1617
83.9%
, 309
 
16.0%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1332
100.0%
Open Punctuation
ValueCountFrequency (%)
( 247
100.0%
Close Punctuation
ValueCountFrequency (%)
) 247
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5042
56.8%
Common 3792
42.7%
Latin 46
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
355
 
7.0%
289
 
5.7%
270
 
5.4%
267
 
5.3%
255
 
5.1%
212
 
4.2%
212
 
4.2%
209
 
4.1%
160
 
3.2%
133
 
2.6%
Other values (299) 2680
53.2%
Latin
ValueCountFrequency (%)
B 7
15.2%
S 5
 
10.9%
E 5
 
10.9%
K 4
 
8.7%
V 2
 
4.3%
I 2
 
4.3%
W 2
 
4.3%
C 2
 
4.3%
s 2
 
4.3%
R 2
 
4.3%
Other values (11) 13
28.3%
Common
ValueCountFrequency (%)
* 1617
42.6%
1332
35.1%
, 309
 
8.1%
( 247
 
6.5%
) 247
 
6.5%
- 37
 
1.0%
~ 2
 
0.1%
& 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5042
56.8%
ASCII 3838
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 1617
42.1%
1332
34.7%
, 309
 
8.1%
( 247
 
6.4%
) 247
 
6.4%
- 37
 
1.0%
B 7
 
0.2%
S 5
 
0.1%
E 5
 
0.1%
K 4
 
0.1%
Other values (19) 28
 
0.7%
Hangul
ValueCountFrequency (%)
355
 
7.0%
289
 
5.7%
270
 
5.4%
267
 
5.3%
255
 
5.1%
212
 
4.2%
212
 
4.2%
209
 
4.1%
160
 
3.2%
133
 
2.6%
Other values (299) 2680
53.2%

도로명우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB
Distinct277
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-04-30T03:54:05.440649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length8.1762712
Min length2

Characters and Unicode

Total characters2412
Distinct characters327
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)88.1%

Sample

1st row(주)힐링123
2nd row동북아국제교역(주)
3rd row맨하탄
4th row씨케이티티(주)
5th row(주)국초쑥나라
ValueCountFrequency (%)
주식회사 42
 
11.5%
주)토로코리아 3
 
0.8%
짚코리아 2
 
0.5%
코리아(유 2
 
0.5%
이씬코리아 2
 
0.5%
주)하이그레이트썬 2
 
0.5%
락인터내셔널코리아 2
 
0.5%
주)에스알티인터내셔날 2
 
0.5%
주)브리티씨파트너스 2
 
0.5%
엔조이코리아(주 2
 
0.5%
Other values (292) 303
83.2%
2024-04-30T03:54:05.772096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
165
 
6.8%
( 128
 
5.3%
) 128
 
5.3%
99
 
4.1%
90
 
3.7%
84
 
3.5%
76
 
3.2%
69
 
2.9%
68
 
2.8%
55
 
2.3%
Other values (317) 1450
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1951
80.9%
Open Punctuation 128
 
5.3%
Close Punctuation 128
 
5.3%
Space Separator 69
 
2.9%
Uppercase Letter 53
 
2.2%
Lowercase Letter 36
 
1.5%
Other Symbol 31
 
1.3%
Decimal Number 10
 
0.4%
Other Punctuation 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
165
 
8.5%
99
 
5.1%
90
 
4.6%
84
 
4.3%
76
 
3.9%
68
 
3.5%
55
 
2.8%
47
 
2.4%
45
 
2.3%
35
 
1.8%
Other values (269) 1187
60.8%
Uppercase Letter
ValueCountFrequency (%)
O 7
13.2%
S 7
13.2%
J 4
 
7.5%
E 4
 
7.5%
T 4
 
7.5%
R 4
 
7.5%
A 3
 
5.7%
L 3
 
5.7%
C 3
 
5.7%
N 3
 
5.7%
Other values (7) 11
20.8%
Lowercase Letter
ValueCountFrequency (%)
o 4
11.1%
i 4
11.1%
n 4
11.1%
l 3
 
8.3%
a 3
 
8.3%
r 2
 
5.6%
s 2
 
5.6%
e 2
 
5.6%
t 2
 
5.6%
g 2
 
5.6%
Other values (7) 8
22.2%
Decimal Number
ValueCountFrequency (%)
1 3
30.0%
2 2
20.0%
4 1
 
10.0%
5 1
 
10.0%
3 1
 
10.0%
9 1
 
10.0%
7 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
& 3
50.0%
; 2
33.3%
# 1
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 128
100.0%
Space Separator
ValueCountFrequency (%)
69
100.0%
Other Symbol
ValueCountFrequency (%)
31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1982
82.2%
Common 341
 
14.1%
Latin 89
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
165
 
8.3%
99
 
5.0%
90
 
4.5%
84
 
4.2%
76
 
3.8%
68
 
3.4%
55
 
2.8%
47
 
2.4%
45
 
2.3%
35
 
1.8%
Other values (270) 1218
61.5%
Latin
ValueCountFrequency (%)
O 7
 
7.9%
S 7
 
7.9%
J 4
 
4.5%
E 4
 
4.5%
o 4
 
4.5%
i 4
 
4.5%
T 4
 
4.5%
R 4
 
4.5%
n 4
 
4.5%
A 3
 
3.4%
Other values (24) 44
49.4%
Common
ValueCountFrequency (%)
( 128
37.5%
) 128
37.5%
69
20.2%
1 3
 
0.9%
& 3
 
0.9%
2 2
 
0.6%
; 2
 
0.6%
4 1
 
0.3%
5 1
 
0.3%
# 1
 
0.3%
Other values (3) 3
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1951
80.9%
ASCII 430
 
17.8%
None 31
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
165
 
8.5%
99
 
5.1%
90
 
4.6%
84
 
4.3%
76
 
3.9%
68
 
3.5%
55
 
2.8%
47
 
2.4%
45
 
2.3%
35
 
1.8%
Other values (269) 1187
60.8%
ASCII
ValueCountFrequency (%)
( 128
29.8%
) 128
29.8%
69
16.0%
O 7
 
1.6%
S 7
 
1.6%
J 4
 
0.9%
E 4
 
0.9%
o 4
 
0.9%
i 4
 
0.9%
T 4
 
0.9%
Other values (37) 71
16.5%
None
ValueCountFrequency (%)
31
100.0%
Distinct184
Distinct (%)62.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2005-10-04 00:00:00
Maximum2023-10-10 00:00:00
2024-04-30T03:54:05.888473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T03:54:06.169276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
I
258 
U
37 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowU
5th rowI

Common Values

ValueCountFrequency (%)
I 258
87.5%
U 37
 
12.5%

Length

2024-04-30T03:54:06.275288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T03:54:06.353721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
i 258
87.5%
u 37
 
12.5%
Distinct57
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2018-08-31 23:59:59
Maximum2022-10-30 23:02:00
2024-04-30T03:54:06.446664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-30T03:54:06.572542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

업태구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing295
Missing (%)100.0%
Memory size2.7 KiB

좌표정보(X)
Real number (ℝ)

MISSING 

Distinct223
Distinct (%)89.6%
Missing46
Missing (%)15.6%
Infinite0
Infinite (%)0.0%
Mean203057.67
Minimum149382.07
Maximum412724.77
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-04-30T03:54:06.704907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum149382.07
5-th percentile185442.6
Q1193326.93
median201072.33
Q3205561.93
95-th percentile217859.54
Maximum412724.77
Range263342.7
Interquartile range (IQR)12234.997

Descriptive statistics

Standard deviation25868.955
Coefficient of variation (CV)0.12739708
Kurtosis38.581519
Mean203057.67
Median Absolute Deviation (MAD)7210.9899
Skewness5.6394555
Sum50561360
Variance6.6920283 × 108
MonotonicityNot monotonic
2024-04-30T03:54:06.825513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
193326.931018911 3
 
1.0%
185796.522248947 3
 
1.0%
208394.416382167 3
 
1.0%
221173.876445599 2
 
0.7%
202502.111899175 2
 
0.7%
183547.0 2
 
0.7%
190352.185129194 2
 
0.7%
185552.222707266 2
 
0.7%
199707.004408754 2
 
0.7%
195258.466477275 2
 
0.7%
Other values (213) 226
76.6%
(Missing) 46
 
15.6%
ValueCountFrequency (%)
149382.070416 1
0.3%
167513.188839653 1
0.3%
168114.960082 1
0.3%
173394.968972 1
0.3%
175250.934231391 1
0.3%
177258.0 1
0.3%
178573.664825 1
0.3%
181262.398558446 1
0.3%
183547.0 2
0.7%
183582.0 1
0.3%
ValueCountFrequency (%)
412724.766792563 1
0.3%
388569.448930725 1
0.3%
380929.831668814 1
0.3%
331258.826491 1
0.3%
303527.331877383 1
0.3%
244845.495951 1
0.3%
238773.923541 1
0.3%
222005.0 1
0.3%
221173.876445599 2
0.7%
219591.521773281 1
0.3%

좌표정보(Y)
Real number (ℝ)

MISSING 

Distinct223
Distinct (%)89.6%
Missing46
Missing (%)15.6%
Infinite0
Infinite (%)0.0%
Mean437965.26
Minimum-26469.212
Maximum468458.75
Zeros0
Zeros (%)0.0%
Negative1
Negative (%)0.3%
Memory size2.7 KiB
2024-04-30T03:54:06.950183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-26469.212
5-th percentile406233.11
Q1443613.18
median446973.91
Q3451056.2
95-th percentile460523.2
Maximum468458.75
Range494927.97
Interquartile range (IQR)7443.0198

Descriptive statistics

Standard deviation50331.392
Coefficient of variation (CV)0.11492097
Kurtosis38.855658
Mean437965.26
Median Absolute Deviation (MAD)3709.073
Skewness-5.7905367
Sum1.0905335 × 108
Variance2.533249 × 109
MonotonicityNot monotonic
2024-04-30T03:54:07.065922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
446973.914570276 3
 
1.0%
450735.884713387 3
 
1.0%
448165.279999905 3
 
1.0%
461803.519237615 2
 
0.7%
446464.940373234 2
 
0.7%
464606.0 2
 
0.7%
443799.918089908 2
 
0.7%
447182.914999762 2
 
0.7%
442078.56253585 2
 
0.7%
448540.434893446 2
 
0.7%
Other values (213) 226
76.6%
(Missing) 46
 
15.6%
ValueCountFrequency (%)
-26469.2119723 1
0.3%
185337.074496747 1
0.3%
185453.187416704 1
0.3%
186981.679213108 1
0.3%
229878.407405013 1
0.3%
230257.667895 1
0.3%
245554.210845 1
0.3%
275996.282906 1
0.3%
283860.418422 1
0.3%
348939.67556 1
0.3%
ValueCountFrequency (%)
468458.753156954 1
0.3%
467508.928681137 1
0.3%
464606.0 2
0.7%
464080.593904719 1
0.3%
463819.61772237 1
0.3%
463058.0 1
0.3%
461919.855299157 1
0.3%
461803.519237615 2
0.7%
461157.210858518 1
0.3%
460832.670736688 1
0.3%

취급제품명
Text

MISSING 

Distinct106
Distinct (%)42.7%
Missing47
Missing (%)15.9%
Memory size2.4 KiB
2024-04-30T03:54:07.318777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length38
Mean length6.5604839
Min length2

Characters and Unicode

Total characters1627
Distinct characters200
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)37.5%

Sample

1st row하나, 사슴등
2nd row시가
3rd row궐련형
4th row궐련
5th row전자담배
ValueCountFrequency (%)
전자담배 99
27.6%
궐련 42
 
11.7%
액상 15
 
4.2%
시가 10
 
2.8%
필터담배 7
 
1.9%
5
 
1.4%
권련 4
 
1.1%
니코틴 4
 
1.1%
파이프담배 3
 
0.8%
각련 3
 
0.8%
Other values (139) 167
46.5%
2024-04-30T03:54:07.755197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
 
9.0%
147
 
9.0%
116
 
7.1%
116
 
7.1%
111
 
6.8%
66
 
4.1%
55
 
3.4%
, 39
 
2.4%
E 25
 
1.5%
24
 
1.5%
Other values (190) 781
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1075
66.1%
Uppercase Letter 205
 
12.6%
Lowercase Letter 112
 
6.9%
Space Separator 111
 
6.8%
Other Punctuation 55
 
3.4%
Decimal Number 20
 
1.2%
Open Punctuation 19
 
1.2%
Close Punctuation 19
 
1.2%
Dash Punctuation 9
 
0.6%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
13.7%
147
13.7%
116
 
10.8%
116
 
10.8%
66
 
6.1%
55
 
5.1%
24
 
2.2%
20
 
1.9%
15
 
1.4%
15
 
1.4%
Other values (125) 354
32.9%
Uppercase Letter
ValueCountFrequency (%)
E 25
12.2%
I 19
 
9.3%
A 18
 
8.8%
T 17
 
8.3%
C 12
 
5.9%
L 11
 
5.4%
R 11
 
5.4%
O 10
 
4.9%
S 10
 
4.9%
P 10
 
4.9%
Other values (13) 62
30.2%
Lowercase Letter
ValueCountFrequency (%)
a 15
13.4%
e 14
12.5%
r 13
11.6%
i 10
 
8.9%
o 8
 
7.1%
t 7
 
6.2%
s 6
 
5.4%
u 5
 
4.5%
d 5
 
4.5%
g 4
 
3.6%
Other values (12) 25
22.3%
Decimal Number
ValueCountFrequency (%)
0 4
20.0%
2 3
15.0%
5 3
15.0%
1 2
10.0%
3 2
10.0%
7 2
10.0%
6 1
 
5.0%
4 1
 
5.0%
8 1
 
5.0%
9 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 39
70.9%
. 8
 
14.5%
/ 6
 
10.9%
: 2
 
3.6%
Space Separator
ValueCountFrequency (%)
111
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1075
66.1%
Latin 317
 
19.5%
Common 235
 
14.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
13.7%
147
13.7%
116
 
10.8%
116
 
10.8%
66
 
6.1%
55
 
5.1%
24
 
2.2%
20
 
1.9%
15
 
1.4%
15
 
1.4%
Other values (125) 354
32.9%
Latin
ValueCountFrequency (%)
E 25
 
7.9%
I 19
 
6.0%
A 18
 
5.7%
T 17
 
5.4%
a 15
 
4.7%
e 14
 
4.4%
r 13
 
4.1%
C 12
 
3.8%
L 11
 
3.5%
R 11
 
3.5%
Other values (35) 162
51.1%
Common
ValueCountFrequency (%)
111
47.2%
, 39
 
16.6%
( 19
 
8.1%
) 19
 
8.1%
- 9
 
3.8%
. 8
 
3.4%
/ 6
 
2.6%
0 4
 
1.7%
2 3
 
1.3%
5 3
 
1.3%
Other values (10) 14
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1075
66.1%
ASCII 552
33.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
147
13.7%
147
13.7%
116
 
10.8%
116
 
10.8%
66
 
6.1%
55
 
5.1%
24
 
2.2%
20
 
1.9%
15
 
1.4%
15
 
1.4%
Other values (125) 354
32.9%
ASCII
ValueCountFrequency (%)
111
20.1%
, 39
 
7.1%
E 25
 
4.5%
( 19
 
3.4%
I 19
 
3.4%
) 19
 
3.4%
A 18
 
3.3%
T 17
 
3.1%
a 15
 
2.7%
e 14
 
2.5%
Other values (55) 256
46.4%

담배공급업체명
Text

MISSING 

Distinct189
Distinct (%)89.6%
Missing84
Missing (%)28.5%
Memory size2.4 KiB
2024-04-30T03:54:08.005800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length240
Median length83
Mean length35.663507
Min length2

Characters and Unicode

Total characters7525
Distinct characters244
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique176 ?
Unique (%)83.4%

Sample

1st row산둥중연공업유한책임공사
2nd rowShenzhen E-Cikar Electronics & Technology Co., Ltd
3rd row성세중국연초국제그룹(마카오)유한공사, SHENGSHI ZHONGGUO TOBACCO INTERNATIONAL GROUP(MACAO) CO.,LTD.
4th rowALD GROUP LIMITED
5th row深&#22323;市德馨&#15103;吾生物科技有限公司
ValueCountFrequency (%)
ltd 75
 
7.2%
co 62
 
6.0%
shenzhen 59
 
5.7%
technology 53
 
5.1%
co.,ltd 35
 
3.4%
tobacco 32
 
3.1%
17
 
1.6%
limited 17
 
1.6%
international 15
 
1.4%
inc 13
 
1.3%
Other values (409) 661
63.6%
2024-04-30T03:54:08.370664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
836
 
11.1%
o 473
 
6.3%
n 465
 
6.2%
e 422
 
5.6%
a 276
 
3.7%
t 262
 
3.5%
h 249
 
3.3%
i 244
 
3.2%
c 236
 
3.1%
C 217
 
2.9%
Other values (234) 3845
51.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3831
50.9%
Uppercase Letter 1763
23.4%
Space Separator 836
 
11.1%
Other Letter 509
 
6.8%
Other Punctuation 391
 
5.2%
Decimal Number 84
 
1.1%
Close Punctuation 46
 
0.6%
Open Punctuation 40
 
0.5%
Dash Punctuation 24
 
0.3%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
5.5%
27
 
5.3%
25
 
4.9%
21
 
4.1%
18
 
3.5%
13
 
2.6%
13
 
2.6%
12
 
2.4%
9
 
1.8%
8
 
1.6%
Other values (159) 335
65.8%
Lowercase Letter
ValueCountFrequency (%)
o 473
12.3%
n 465
12.1%
e 422
11.0%
a 276
 
7.2%
t 262
 
6.8%
h 249
 
6.5%
i 244
 
6.4%
c 236
 
6.2%
d 182
 
4.8%
l 168
 
4.4%
Other values (16) 854
22.3%
Uppercase Letter
ValueCountFrequency (%)
C 217
12.3%
L 216
12.3%
T 171
 
9.7%
S 122
 
6.9%
I 113
 
6.4%
O 110
 
6.2%
E 85
 
4.8%
H 82
 
4.7%
N 76
 
4.3%
A 70
 
4.0%
Other values (16) 501
28.4%
Decimal Number
ValueCountFrequency (%)
2 26
31.0%
1 20
23.8%
3 15
17.9%
8 4
 
4.8%
6 4
 
4.8%
7 4
 
4.8%
4 4
 
4.8%
5 4
 
4.8%
0 2
 
2.4%
9 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 213
54.5%
, 138
35.3%
& 17
 
4.3%
/ 11
 
2.8%
; 4
 
1.0%
# 4
 
1.0%
: 3
 
0.8%
' 1
 
0.3%
Space Separator
ValueCountFrequency (%)
836
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5594
74.3%
Common 1422
 
18.9%
Hangul 476
 
6.3%
Han 33
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
5.9%
27
 
5.7%
25
 
5.3%
21
 
4.4%
18
 
3.8%
13
 
2.7%
13
 
2.7%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (141) 302
63.4%
Latin
ValueCountFrequency (%)
o 473
 
8.5%
n 465
 
8.3%
e 422
 
7.5%
a 276
 
4.9%
t 262
 
4.7%
h 249
 
4.5%
i 244
 
4.4%
c 236
 
4.2%
C 217
 
3.9%
L 216
 
3.9%
Other values (42) 2534
45.3%
Common
ValueCountFrequency (%)
836
58.8%
. 213
 
15.0%
, 138
 
9.7%
) 46
 
3.2%
( 40
 
2.8%
2 26
 
1.8%
- 24
 
1.7%
1 20
 
1.4%
& 17
 
1.2%
3 15
 
1.1%
Other values (13) 47
 
3.3%
Han
ValueCountFrequency (%)
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
Other values (8) 8
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7016
93.2%
Hangul 475
 
6.3%
CJK 33
 
0.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
836
 
11.9%
o 473
 
6.7%
n 465
 
6.6%
e 422
 
6.0%
a 276
 
3.9%
t 262
 
3.7%
h 249
 
3.5%
i 244
 
3.5%
c 236
 
3.4%
C 217
 
3.1%
Other values (65) 3336
47.5%
Hangul
ValueCountFrequency (%)
28
 
5.9%
27
 
5.7%
25
 
5.3%
21
 
4.4%
18
 
3.8%
13
 
2.7%
13
 
2.7%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (140) 301
63.4%
CJK
ValueCountFrequency (%)
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
3
 
9.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
Other values (8) 8
24.2%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Sample

개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)취급제품명담배공급업체명
061100002022611000001100000120220421<NA>1영업/정상BBBB영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 용산구 독서당로 **, B***호(한남동, 신성미소시티)<NA>(주)힐링1232022-04-21 00:00:00I2021-12-03 22:03:00.0<NA>200559.366155447696.887166<NA><NA>
161100001999611000001100000119990424<NA>1영업/정상3영업<NA><NA><NA><NA>25995147<NA><NA>서울특별시 서초구 반포동 반포*동 **-* (고속터미널 *-***)<NA><NA>동북아국제교역(주)2005-10-04 00:00:00I2018-08-31 23:59:59.0<NA><NA><NA>하나, 사슴등<NA>
261100001999611000001100000219990427<NA>1영업/정상3영업<NA><NA><NA><NA>25232696<NA><NA>서울특별시 강남구 포이동 ***-* (광진빌딩 ***호)<NA><NA>맨하탄2005-10-04 00:00:00I2018-08-31 23:59:59.0<NA><NA><NA>시가<NA>
361100002019611000001100000120190108<NA>2휴업1휴업<NA>2021052820220527<NA>218332084<NA><NA>전라북도 군산시 월명동 **번지 *호 **통 *반 ***동 ****호전라북도 군산시 월명*길 *, ***동 ****호 (월명동,다원클래시움아파트)<NA>씨케이티티(주)2021-05-31 00:00:00U2021-06-02 02:40:00.0<NA>173394.968972275996.282906궐련형산둥중연공업유한책임공사
461100002004611000001100000220040219<NA>1영업/정상3영업<NA><NA><NA><NA>27845823<NA><NA>서울특별시 영등포구 여의도동 **-** 서린빌딩 ***호<NA><NA>(주)국초쑥나라2005-10-04 00:00:00I2018-08-31 23:59:59.0<NA><NA><NA>궐련<NA>
561100002011611000001100002220110428<NA>2휴업1휴업<NA>2011112420121123<NA>7071198881<NA>152050서울특별시 구로구 구로동 ****번지 *호 부마빌딩 ***호서울특별시 구로구 시흥대로 ***, ***호 (구로동,부마빌딩)<NA>(주)그린시가2011-12-02 00:00:00I2018-08-31 23:59:59.0<NA>191015.406472442069.461636전자담배Shenzhen E-Cikar Electronics & Technology Co., Ltd
661100001999611000001100000319990624<NA>1영업/정상3영업<NA><NA><NA><NA>27230020<NA><NA>서울특별시 종로구 경운동 **-* (서원빌딩 **층)<NA><NA>팀오주2005-10-04 00:00:00I2018-08-31 23:59:59.0<NA><NA><NA>궐련<NA>
761100001990611000001100000119900625<NA>1영업/정상3영업<NA><NA><NA><NA>27338667<NA><NA>서울특별시 종로구 안국동 **서울특별시 종로구 북촌로*길 ** (안국동)<NA>(주)연일물산2013-12-30 00:00:00I2018-08-31 23:59:59.0<NA>198534.656818452759.811742<NA><NA>
861100001995611000001100000119950913<NA>1영업/정상3영업<NA><NA><NA><NA>24099534<NA><NA>서울특별시 송파구 거여동 ** 현대*차아파트 ***동 ***호서울특별시 송파구 오금로**길 **, ***동 ***호 (거여동,현대*차아파트)<NA>수아물산(주)2013-12-30 00:00:00I2018-08-31 23:59:59.0<NA>212944.468208443613.178458궐련. 시가<NA>
961100002020611000001100000120200114<NA>2휴업1휴업<NA>2021031920220112<NA><NA><NA>6163서울특별시 강남구 삼성동 ***번지 *호 대한해운사옥서울특별시 강남구 삼성로 ***, 대한해운사옥 *층 (삼성동)<NA>엘이에이파트너스 유한회사2021-04-12 00:00:00U2021-04-14 02:40:00.0<NA>204720.139443445572.258701궐련성세중국연초국제그룹(마카오)유한공사, SHENGSHI ZHONGGUO TOBACCO INTERNATIONAL GROUP(MACAO) CO.,LTD.
개방자치단체코드관리번호인허가일자인허가취소일자영업상태코드영업상태명상세영업상태코드상세영업상태명폐업일자휴업시작일자휴업종료일자재개업일자전화번호소재지면적소재지우편번호지번주소도로명주소도로명우편번호사업장명최종수정일자데이터갱신구분데이터갱신일자업태구분명좌표정보(X)좌표정보(Y)취급제품명담배공급업체명
28561100002020611000001100000420200728<NA>1영업/정상BBBB영업<NA><NA><NA><NA><NA><NA><NA>경기도 김포시 풍무동 **통 *반 신안아파트 ***동 ****호경기도 김포시 풍무로 ***, ***동 ****호 (풍무동,신안아파트)<NA>(주)엔에스티와이2020-07-28 00:00:00I2021-12-08 22:08:00.0<NA>175250.934231455962.378063<NA><NA>
28661100002020611000001100000320200331<NA>1영업/정상BBBB영업<NA><NA><NA><NA><NA><NA><NA>경기도 파주시 목동동 *통 *반 ***호경기도 파주시 가재울로 ***-**, ***호 (목동동)<NA>(주)브이앤라이프2020-03-31 00:00:00I2021-12-08 22:08:00.0<NA><NA><NA><NA><NA>
28761100002019611000001100002020190711<NA>1영업/정상BBBB영업<NA><NA><NA><NA>24906045<NA><NA>서울특별시 용산구 한강로*가 **번지 ***호서울특별시 용산구 한강대로**길 **, 용산역 (한강로*가)<NA>에이치디씨신라면세점㈜2021-01-20 00:00:00U2021-12-08 22:08:00.0<NA>196762.077395447480.039577<NA><NA>
28861100002019611000001100001620190613<NA>1영업/정상BBBB영업<NA><NA><NA><NA>260485358<NA><NA>서울특별시 서초구 서초동 **통 *반 ***동 ****호서울특별시 중구 퇴계로 **, 신세계백화점 **,**층 (충무로*가)<NA>㈜신세계디에프2021-01-12 00:00:00U2021-12-08 22:08:00.0<NA>201508.155823444032.949142<NA><NA>
28961100002019611000001100000820190410<NA>3폐업2폐업<NA>20201221<NA><NA>222303238<NA><NA>서울특별시 강남구 대치동 **통 *반 코오롱 R&F G*호서울특별시 강남구 역삼로**길 **, G*호 (대치동,코오롱 R&F)<NA>(주)호텔신라2020-12-21 00:00:00U2021-12-08 22:08:00.0<NA>205424.117389444573.538901<NA><NA>
29061100002018611000001100000320180108<NA>2휴업1휴업<NA>2021010120211231<NA>221353040<NA><NA>충청북도 청주시 흥덕구 복대동 ****번지 **통 *반 금호어울림아파트 ***동 ****호충청북도 청주시 흥덕구 대신로**번길 **, ***동 ****호 (복대동,금호어울림아파트)<NA>(주)니코텍코리아2020-12-16 00:00:00U2021-12-08 22:08:00.0<NA>238773.923541348939.67556<NA><NA>
29161100002016611000001100001720160926<NA>1영업/정상BBBB영업<NA><NA><NA><NA><NA><NA><NA><NA>서울특별시 서초구 사평대로**길 * (서초동)<NA>(주)엠케이캠프2020-12-10 00:00:00I2021-12-08 22:08:00.0<NA><NA><NA><NA><NA>
29261100001998611000001100000219980730<NA>1영업/정상3영업<NA><NA><NA><NA>24431856<NA><NA>서울특별시 송파구 가락동서울특별시 송파구 송파대로 ***, 제일오피스텔 ***호 (가락동)<NA>세은통상(주)2021-01-28 00:00:00U2021-12-08 22:08:00.0<NA>210455.417422443434.038943<NA><NA>
29361100001989611000001100000319890429<NA>1영업/정상BBBB영업<NA><NA><NA><NA>237070700<NA><NA><NA>서울특별시 영등포구 국제금융로 **, 서울국제금융센터 **층 (여의도동)<NA>한국필립모리스 주식회사2020-11-04 00:00:00I2021-12-08 22:08:00.0<NA>193326.931019446973.91457<NA><NA>
2946110000201261100000110000082012-08-08<NA>3폐업2폐업<NA>2023-10-04<NA><NA>221127100<NA><NA><NA>서울특별시 강남구 테헤란로 ***, **층 (역삼동, 강남파이낸스센터)<NA>브리티쉬아메리칸토바코코리아2023-10-10 00:00:00U2022-10-30 23:02:00.0<NA>203169.223281444178.357262<NA><NA>