Overview

Dataset statistics

Number of variables4
Number of observations695
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.5 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description서울특별시 금천구 관내 담배소매인 지정현황으로 업소명, 업소도로명주소, 데이터기준일자 등의 항목을 제공하고 있습니다.
Author서울특별시 금천구
URLhttps://www.data.go.kr/data/3081092/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:33:04.576749
Analysis finished2023-12-12 07:33:05.250976
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct695
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean348
Minimum1
Maximum695
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.2 KiB
2023-12-12T16:33:05.348141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile35.7
Q1174.5
median348
Q3521.5
95-th percentile660.3
Maximum695
Range694
Interquartile range (IQR)347

Descriptive statistics

Standard deviation200.7735
Coefficient of variation (CV)0.57693536
Kurtosis-1.2
Mean348
Median Absolute Deviation (MAD)174
Skewness0
Sum241860
Variance40310
MonotonicityStrictly increasing
2023-12-12T16:33:05.505014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
468 1
 
0.1%
460 1
 
0.1%
461 1
 
0.1%
462 1
 
0.1%
463 1
 
0.1%
464 1
 
0.1%
465 1
 
0.1%
466 1
 
0.1%
467 1
 
0.1%
Other values (685) 685
98.6%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
695 1
0.1%
694 1
0.1%
693 1
0.1%
692 1
0.1%
691 1
0.1%
690 1
0.1%
689 1
0.1%
688 1
0.1%
687 1
0.1%
686 1
0.1%
Distinct675
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2023-12-12T16:33:05.811368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length7.6877698
Min length1

Characters and Unicode

Total characters5343
Distinct characters417
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique657 ?
Unique (%)94.5%

Sample

1st row씨유 가산현대아울렛점
2nd row씨유 독산롯데캐슬점
3rd row미니스톱 가산에이스점
4th row지에스(GS)25 독산반수점
5th row(주)코리아세븐 LDCC점
ValueCountFrequency (%)
씨유 74
 
7.5%
gs25 40
 
4.0%
세븐일레븐 32
 
3.2%
미니스톱 21
 
2.1%
이마트24 18
 
1.8%
주)코리아세븐 13
 
1.3%
지에스(gs)25 7
 
0.7%
지에스25 6
 
0.6%
코레일유통(주 6
 
0.6%
편의점 4
 
0.4%
Other values (719) 768
77.7%
2023-12-12T16:33:06.251054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
303
 
5.7%
297
 
5.6%
186
 
3.5%
2 124
 
2.3%
118
 
2.2%
118
 
2.2%
115
 
2.2%
114
 
2.1%
104
 
1.9%
103
 
1.9%
Other values (407) 3761
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4379
82.0%
Space Separator 297
 
5.6%
Decimal Number 276
 
5.2%
Uppercase Letter 225
 
4.2%
Open Punctuation 69
 
1.3%
Close Punctuation 69
 
1.3%
Lowercase Letter 18
 
0.3%
Other Punctuation 7
 
0.1%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
303
 
6.9%
186
 
4.2%
118
 
2.7%
118
 
2.7%
115
 
2.6%
114
 
2.6%
104
 
2.4%
103
 
2.4%
97
 
2.2%
95
 
2.2%
Other values (358) 3026
69.1%
Uppercase Letter
ValueCountFrequency (%)
G 79
35.1%
S 77
34.2%
C 18
 
8.0%
U 10
 
4.4%
L 7
 
3.1%
K 5
 
2.2%
I 3
 
1.3%
W 3
 
1.3%
T 3
 
1.3%
N 3
 
1.3%
Other values (10) 17
 
7.6%
Lowercase Letter
ValueCountFrequency (%)
e 4
22.2%
s 2
11.1%
t 2
11.1%
o 1
 
5.6%
y 1
 
5.6%
u 1
 
5.6%
b 1
 
5.6%
r 1
 
5.6%
a 1
 
5.6%
g 1
 
5.6%
Other values (3) 3
16.7%
Decimal Number
ValueCountFrequency (%)
2 124
44.9%
5 92
33.3%
4 27
 
9.8%
1 12
 
4.3%
3 9
 
3.3%
8 5
 
1.8%
0 2
 
0.7%
6 2
 
0.7%
7 2
 
0.7%
9 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 6
85.7%
& 1
 
14.3%
Space Separator
ValueCountFrequency (%)
297
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4379
82.0%
Common 721
 
13.5%
Latin 243
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
303
 
6.9%
186
 
4.2%
118
 
2.7%
118
 
2.7%
115
 
2.6%
114
 
2.6%
104
 
2.4%
103
 
2.4%
97
 
2.2%
95
 
2.2%
Other values (358) 3026
69.1%
Latin
ValueCountFrequency (%)
G 79
32.5%
S 77
31.7%
C 18
 
7.4%
U 10
 
4.1%
L 7
 
2.9%
K 5
 
2.1%
e 4
 
1.6%
I 3
 
1.2%
W 3
 
1.2%
T 3
 
1.2%
Other values (23) 34
14.0%
Common
ValueCountFrequency (%)
297
41.2%
2 124
17.2%
5 92
 
12.8%
( 69
 
9.6%
) 69
 
9.6%
4 27
 
3.7%
1 12
 
1.7%
3 9
 
1.2%
. 6
 
0.8%
8 5
 
0.7%
Other values (6) 11
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4379
82.0%
ASCII 964
 
18.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
303
 
6.9%
186
 
4.2%
118
 
2.7%
118
 
2.7%
115
 
2.6%
114
 
2.6%
104
 
2.4%
103
 
2.4%
97
 
2.2%
95
 
2.2%
Other values (358) 3026
69.1%
ASCII
ValueCountFrequency (%)
297
30.8%
2 124
12.9%
5 92
 
9.5%
G 79
 
8.2%
S 77
 
8.0%
( 69
 
7.2%
) 69
 
7.2%
4 27
 
2.8%
C 18
 
1.9%
1 12
 
1.2%
Other values (39) 100
 
10.4%
Distinct688
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2023-12-12T16:33:06.602134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length51
Mean length31.207194
Min length17

Characters and Unicode

Total characters21689
Distinct characters267
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique681 ?
Unique (%)98.0%

Sample

1st row서울특별시 금천구 디지털로10길 9. 현대시티아울렛 가산점 7층 (가산동)
2nd row서울특별시 금천구 범안로12길 44. B105.106호 (독산동)
3rd row서울특별시 금천구 가산디지털1로 145. 에이스하이엔드타워3차 104호 (가산동)
4th row서울특별시 금천구 독산로 313 (독산동)
5th row서울특별시 금천구 가산디지털2로 179. 롯데정보통신 1층 (가산동)
ValueCountFrequency (%)
서울특별시 695
 
16.8%
금천구 695
 
16.8%
독산동 263
 
6.3%
시흥동 218
 
5.3%
1층 174
 
4.2%
가산동 160
 
3.9%
시흥대로 68
 
1.6%
가산디지털1로 47
 
1.1%
독산로 46
 
1.1%
101호 36
 
0.9%
Other values (853) 1742
42.0%
2023-12-12T16:33:07.160582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3719
 
17.1%
1153
 
5.3%
1 1099
 
5.1%
768
 
3.5%
746
 
3.4%
743
 
3.4%
710
 
3.3%
704
 
3.2%
703
 
3.2%
697
 
3.2%
Other values (257) 10647
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12560
57.9%
Space Separator 3719
 
17.1%
Decimal Number 3467
 
16.0%
Open Punctuation 672
 
3.1%
Close Punctuation 672
 
3.1%
Other Punctuation 453
 
2.1%
Uppercase Letter 78
 
0.4%
Dash Punctuation 65
 
0.3%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1153
 
9.2%
768
 
6.1%
746
 
5.9%
743
 
5.9%
710
 
5.7%
704
 
5.6%
703
 
5.6%
697
 
5.5%
695
 
5.5%
695
 
5.5%
Other values (222) 4946
39.4%
Uppercase Letter
ValueCountFrequency (%)
B 17
21.8%
A 12
15.4%
G 7
9.0%
C 7
9.0%
T 6
 
7.7%
L 5
 
6.4%
I 4
 
5.1%
K 4
 
5.1%
S 3
 
3.8%
E 3
 
3.8%
Other values (7) 10
12.8%
Decimal Number
ValueCountFrequency (%)
1 1099
31.7%
2 413
 
11.9%
0 356
 
10.3%
3 333
 
9.6%
4 285
 
8.2%
5 216
 
6.2%
6 209
 
6.0%
7 198
 
5.7%
8 197
 
5.7%
9 161
 
4.6%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
b 1
33.3%
a 1
33.3%
Space Separator
ValueCountFrequency (%)
3719
100.0%
Open Punctuation
ValueCountFrequency (%)
( 672
100.0%
Close Punctuation
ValueCountFrequency (%)
) 672
100.0%
Other Punctuation
ValueCountFrequency (%)
. 453
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 65
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12560
57.9%
Common 9048
41.7%
Latin 81
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1153
 
9.2%
768
 
6.1%
746
 
5.9%
743
 
5.9%
710
 
5.7%
704
 
5.6%
703
 
5.6%
697
 
5.5%
695
 
5.5%
695
 
5.5%
Other values (222) 4946
39.4%
Latin
ValueCountFrequency (%)
B 17
21.0%
A 12
14.8%
G 7
8.6%
C 7
8.6%
T 6
 
7.4%
L 5
 
6.2%
I 4
 
4.9%
K 4
 
4.9%
S 3
 
3.7%
E 3
 
3.7%
Other values (10) 13
16.0%
Common
ValueCountFrequency (%)
3719
41.1%
1 1099
 
12.1%
( 672
 
7.4%
) 672
 
7.4%
. 453
 
5.0%
2 413
 
4.6%
0 356
 
3.9%
3 333
 
3.7%
4 285
 
3.1%
5 216
 
2.4%
Other values (5) 830
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12560
57.9%
ASCII 9129
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3719
40.7%
1 1099
 
12.0%
( 672
 
7.4%
) 672
 
7.4%
. 453
 
5.0%
2 413
 
4.5%
0 356
 
3.9%
3 333
 
3.6%
4 285
 
3.1%
5 216
 
2.4%
Other values (25) 911
 
10.0%
Hangul
ValueCountFrequency (%)
1153
 
9.2%
768
 
6.1%
746
 
5.9%
743
 
5.9%
710
 
5.7%
704
 
5.6%
703
 
5.6%
697
 
5.5%
695
 
5.5%
695
 
5.5%
Other values (222) 4946
39.4%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size5.6 KiB
2023-09-15
695 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-15
2nd row2023-09-15
3rd row2023-09-15
4th row2023-09-15
5th row2023-09-15

Common Values

ValueCountFrequency (%)
2023-09-15 695
100.0%

Length

2023-12-12T16:33:07.343471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:33:07.445774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-15 695
100.0%

Interactions

2023-12-12T16:33:04.947682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T16:33:05.093633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:33:05.208134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호업소명업소도로명주소데이터기준일자
01씨유 가산현대아울렛점서울특별시 금천구 디지털로10길 9. 현대시티아울렛 가산점 7층 (가산동)2023-09-15
12씨유 독산롯데캐슬점서울특별시 금천구 범안로12길 44. B105.106호 (독산동)2023-09-15
23미니스톱 가산에이스점서울특별시 금천구 가산디지털1로 145. 에이스하이엔드타워3차 104호 (가산동)2023-09-15
34지에스(GS)25 독산반수점서울특별시 금천구 독산로 313 (독산동)2023-09-15
45(주)코리아세븐 LDCC점서울특별시 금천구 가산디지털2로 179. 롯데정보통신 1층 (가산동)2023-09-15
56미니스톱 가산노블루체점서울특별시 금천구 가산로9길 17. G밸리노블루체스위트 104동 105호 (가산동)2023-09-15
67씨유(CU)백광점서울특별시 금천구 독산로 348 (독산동)2023-09-15
78지에스25 시흥웨스트점서울특별시 금천구 시흥대로57길 5. 웨스트밸리 1층 108호 (시흥동)2023-09-15
89지에스25(GS25)금천해가든서울특별시 금천구 벚꽃로6길 3. 101동 지층 B04호 (독산동. 이랜드해가든아파트)2023-09-15
910세븐일레븐 독산역롯데캐슬점서울특별시 금천구 벚꽃로 100. 상가 지하1층 6.7호 (독산동. 독산역 롯데캐슬)2023-09-15
번호업소명업소도로명주소데이터기준일자
685686제일상회서울특별시 금천구 가마산로 70 (가산동)2023-09-15
686687가미정서울특별시 금천구 가마산로 76 (가산동)2023-09-15
687688산천초목서울특별시 금천구 문성로 45 (독산동)2023-09-15
688689현슈퍼서울특별시 금천구 시흥대로84나길 2 (독산동)2023-09-15
689690리빙데코서울특별시 금천구 시흥동 910번지 2 호2023-09-15
690691현대마트서울특별시 금천구 시흥대로47길 43 (시흥동.럭키상가)2023-09-15
691692동네슈퍼서울특별시 금천구 범안로12가길 20 (독산동)2023-09-15
692693잉꼬부동산서울특별시 금천구 독산로43가길 24-8 (시흥동)2023-09-15
693694마마방서울특별시 금천구 시흥대로58길 18 (시흥동)2023-09-15
694695공주슈퍼서울특별시 금천구 두산로3길 60 (가산동)2023-09-15