Overview

Dataset statistics

Number of variables4
Number of observations581
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.9 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text3

Dataset

Description연수구 담배소매인 지정현황(업소명, 주소 등)입니다- 담배소매인 신규지정/ 정상영업/기준일자/주소등으로 작성함
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3081033&srcSe=7661IVAWM27C61E190

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 10:38:44.942740
Analysis finished2024-01-28 10:38:45.604515
Duration0.66 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct581
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean291
Minimum1
Maximum581
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-01-28T19:38:45.658946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30
Q1146
median291
Q3436
95-th percentile552
Maximum581
Range580
Interquartile range (IQR)290

Descriptive statistics

Standard deviation167.86453
Coefficient of variation (CV)0.57685405
Kurtosis-1.2
Mean291
Median Absolute Deviation (MAD)145
Skewness0
Sum169071
Variance28178.5
MonotonicityStrictly increasing
2024-01-28T19:38:45.972733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
365 1
 
0.2%
385 1
 
0.2%
386 1
 
0.2%
387 1
 
0.2%
388 1
 
0.2%
389 1
 
0.2%
390 1
 
0.2%
391 1
 
0.2%
392 1
 
0.2%
Other values (571) 571
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
581 1
0.2%
580 1
0.2%
579 1
0.2%
578 1
0.2%
577 1
0.2%
576 1
0.2%
575 1
0.2%
574 1
0.2%
573 1
0.2%
572 1
0.2%
Distinct572
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2024-01-28T19:38:46.167418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length24
Mean length10.149742
Min length2

Characters and Unicode

Total characters5897
Distinct characters397
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique566 ?
Unique (%)97.4%

Sample

1st row씨유 연수대림점
2nd row지엠25
3rd row담배멀티샵
4th row복권명당 롯데몰점
5th row지에스25 송도써밋점
ValueCountFrequency (%)
씨유 75
 
7.5%
gs25 49
 
4.9%
이마트24 48
 
4.8%
세븐일레븐 35
 
3.5%
지에스25 30
 
3.0%
주)코리아세븐 21
 
2.1%
전자담배 13
 
1.3%
cu 10
 
1.0%
편의점 6
 
0.6%
송도점 5
 
0.5%
Other values (613) 713
70.9%
2024-01-28T19:38:46.491968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
424
 
7.2%
388
 
6.6%
198
 
3.4%
197
 
3.3%
2 170
 
2.9%
165
 
2.8%
158
 
2.7%
145
 
2.5%
115
 
2.0%
109
 
1.8%
Other values (387) 3828
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4651
78.9%
Space Separator 424
 
7.2%
Decimal Number 364
 
6.2%
Uppercase Letter 259
 
4.4%
Close Punctuation 85
 
1.4%
Open Punctuation 84
 
1.4%
Lowercase Letter 22
 
0.4%
Dash Punctuation 3
 
0.1%
Other Punctuation 3
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
388
 
8.3%
198
 
4.3%
197
 
4.2%
165
 
3.5%
158
 
3.4%
145
 
3.1%
115
 
2.5%
109
 
2.3%
106
 
2.3%
104
 
2.2%
Other values (334) 2966
63.8%
Uppercase Letter
ValueCountFrequency (%)
S 78
30.1%
G 75
29.0%
C 19
 
7.3%
U 17
 
6.6%
R 14
 
5.4%
E 10
 
3.9%
T 7
 
2.7%
L 6
 
2.3%
H 6
 
2.3%
A 5
 
1.9%
Other values (10) 22
 
8.5%
Lowercase Letter
ValueCountFrequency (%)
e 3
13.6%
m 3
13.6%
a 2
9.1%
t 2
9.1%
r 2
9.1%
k 2
9.1%
o 2
9.1%
i 1
 
4.5%
f 1
 
4.5%
l 1
 
4.5%
Other values (3) 3
13.6%
Decimal Number
ValueCountFrequency (%)
2 170
46.7%
5 109
29.9%
4 58
 
15.9%
1 14
 
3.8%
3 4
 
1.1%
0 3
 
0.8%
7 2
 
0.5%
8 2
 
0.5%
9 1
 
0.3%
6 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 83
97.6%
] 2
 
2.4%
Open Punctuation
ValueCountFrequency (%)
( 82
97.6%
[ 2
 
2.4%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
? 1
33.3%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
424
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4651
78.9%
Common 965
 
16.4%
Latin 281
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
388
 
8.3%
198
 
4.3%
197
 
4.2%
165
 
3.5%
158
 
3.4%
145
 
3.1%
115
 
2.5%
109
 
2.3%
106
 
2.3%
104
 
2.2%
Other values (334) 2966
63.8%
Latin
ValueCountFrequency (%)
S 78
27.8%
G 75
26.7%
C 19
 
6.8%
U 17
 
6.0%
R 14
 
5.0%
E 10
 
3.6%
T 7
 
2.5%
L 6
 
2.1%
H 6
 
2.1%
A 5
 
1.8%
Other values (23) 44
15.7%
Common
ValueCountFrequency (%)
424
43.9%
2 170
17.6%
5 109
 
11.3%
) 83
 
8.6%
( 82
 
8.5%
4 58
 
6.0%
1 14
 
1.5%
3 4
 
0.4%
- 3
 
0.3%
0 3
 
0.3%
Other values (10) 15
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4651
78.9%
ASCII 1246
 
21.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
424
34.0%
2 170
13.6%
5 109
 
8.7%
) 83
 
6.7%
( 82
 
6.6%
S 78
 
6.3%
G 75
 
6.0%
4 58
 
4.7%
C 19
 
1.5%
U 17
 
1.4%
Other values (43) 131
 
10.5%
Hangul
ValueCountFrequency (%)
388
 
8.3%
198
 
4.3%
197
 
4.2%
165
 
3.5%
158
 
3.4%
145
 
3.1%
115
 
2.5%
109
 
2.3%
106
 
2.3%
104
 
2.2%
Other values (334) 2966
63.8%
Distinct534
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2024-01-28T19:38:46.722353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length41
Mean length26.975904
Min length1

Characters and Unicode

Total characters15673
Distinct characters299
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique510 ?
Unique (%)87.8%

Sample

1st row인천광역시 연수구 연수동 532 대림아파트
2nd row인천광역시 연수구 송도동 406 한진인천컨테이너터미널
3rd row인천광역시 연수구 연수동 543-9
4th row인천광역시 연수구 송도동 8-4 송도모아프라자
5th row인천광역시 연수구 송도동 312-4 호반써밋 송도
ValueCountFrequency (%)
인천광역시 578
 
17.6%
연수구 578
 
17.6%
송도동 245
 
7.4%
연수동 104
 
3.2%
옥련동 66
 
2.0%
송도 62
 
1.9%
동춘동 60
 
1.8%
1호 54
 
1.6%
청학동 52
 
1.6%
선학동 33
 
1.0%
Other values (865) 1461
44.4%
2024-01-28T19:38:47.057513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2831
 
18.1%
717
 
4.6%
709
 
4.5%
708
 
4.5%
1 652
 
4.2%
604
 
3.9%
604
 
3.9%
600
 
3.8%
585
 
3.7%
582
 
3.7%
Other values (289) 7081
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9912
63.2%
Space Separator 2831
 
18.1%
Decimal Number 2557
 
16.3%
Dash Punctuation 185
 
1.2%
Uppercase Letter 120
 
0.8%
Other Punctuation 21
 
0.1%
Lowercase Letter 19
 
0.1%
Close Punctuation 10
 
0.1%
Open Punctuation 10
 
0.1%
Math Symbol 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
717
 
7.2%
709
 
7.2%
708
 
7.1%
604
 
6.1%
604
 
6.1%
600
 
6.1%
585
 
5.9%
582
 
5.9%
580
 
5.9%
413
 
4.2%
Other values (242) 3810
38.4%
Uppercase Letter
ValueCountFrequency (%)
B 15
12.5%
D 10
 
8.3%
T 9
 
7.5%
E 9
 
7.5%
L 9
 
7.5%
C 9
 
7.5%
A 8
 
6.7%
I 6
 
5.0%
M 6
 
5.0%
U 6
 
5.0%
Other values (10) 33
27.5%
Decimal Number
ValueCountFrequency (%)
1 652
25.5%
2 339
13.3%
3 292
11.4%
5 244
 
9.5%
0 235
 
9.2%
4 202
 
7.9%
9 187
 
7.3%
6 153
 
6.0%
8 132
 
5.2%
7 121
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 5
26.3%
s 5
26.3%
t 4
21.1%
a 2
 
10.5%
i 1
 
5.3%
m 1
 
5.3%
c 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 16
76.2%
@ 3
 
14.3%
& 2
 
9.5%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2831
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 185
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9912
63.2%
Common 5620
35.9%
Latin 141
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
717
 
7.2%
709
 
7.2%
708
 
7.1%
604
 
6.1%
604
 
6.1%
600
 
6.1%
585
 
5.9%
582
 
5.9%
580
 
5.9%
413
 
4.2%
Other values (242) 3810
38.4%
Latin
ValueCountFrequency (%)
B 15
 
10.6%
D 10
 
7.1%
T 9
 
6.4%
E 9
 
6.4%
L 9
 
6.4%
C 9
 
6.4%
A 8
 
5.7%
I 6
 
4.3%
M 6
 
4.3%
U 6
 
4.3%
Other values (19) 54
38.3%
Common
ValueCountFrequency (%)
2831
50.4%
1 652
 
11.6%
2 339
 
6.0%
3 292
 
5.2%
5 244
 
4.3%
0 235
 
4.2%
4 202
 
3.6%
9 187
 
3.3%
- 185
 
3.3%
6 153
 
2.7%
Other values (8) 300
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9912
63.2%
ASCII 5759
36.7%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2831
49.2%
1 652
 
11.3%
2 339
 
5.9%
3 292
 
5.1%
5 244
 
4.2%
0 235
 
4.1%
4 202
 
3.5%
9 187
 
3.2%
- 185
 
3.2%
6 153
 
2.7%
Other values (35) 439
 
7.6%
Hangul
ValueCountFrequency (%)
717
 
7.2%
709
 
7.2%
708
 
7.1%
604
 
6.1%
604
 
6.1%
600
 
6.1%
585
 
5.9%
582
 
5.9%
580
 
5.9%
413
 
4.2%
Other values (242) 3810
38.4%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct564
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size4.7 KiB
2024-01-28T19:38:47.309902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length52
Mean length37.383821
Min length1

Characters and Unicode

Total characters21720
Distinct characters341
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique562 ?
Unique (%)96.7%

Sample

1st row인천광역시 연수구 원인재로 286. 대림아파트 상가동 111.111-1.112호 (연수동)
2nd row인천광역시 연수구 인천신항대로 777. 한진인천컨테이너터미널 1층 (송도동)
3rd row인천광역시 연수구 먼우금로251번길 1. 104호 (연수동)
4th row인천광역시 연수구 인천타워대로132번길 24. 송도모아프라자 105호 (송도동)
5th row인천광역시 연수구 랜드마크로 20. 140호 (송도동. 호반써밋 송도)
ValueCountFrequency (%)
인천광역시 564
 
14.1%
연수구 564
 
14.1%
송도동 228
 
5.7%
1층 129
 
3.2%
연수동 103
 
2.6%
송도 61
 
1.5%
옥련동 60
 
1.5%
동춘동 52
 
1.3%
상가동 49
 
1.2%
청학동 43
 
1.1%
Other values (998) 2137
53.6%
2024-01-28T19:38:47.687338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3494
 
16.1%
1 1204
 
5.5%
785
 
3.6%
. 709
 
3.3%
698
 
3.2%
698
 
3.2%
646
 
3.0%
634
 
2.9%
614
 
2.8%
587
 
2.7%
Other values (331) 11651
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12627
58.1%
Space Separator 3494
 
16.1%
Decimal Number 3464
 
15.9%
Other Punctuation 715
 
3.3%
Close Punctuation 579
 
2.7%
Open Punctuation 579
 
2.7%
Uppercase Letter 147
 
0.7%
Dash Punctuation 63
 
0.3%
Math Symbol 31
 
0.1%
Lowercase Letter 19
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
785
 
6.2%
698
 
5.5%
698
 
5.5%
646
 
5.1%
634
 
5.0%
614
 
4.9%
587
 
4.6%
573
 
4.5%
568
 
4.5%
566
 
4.5%
Other values (284) 6258
49.6%
Uppercase Letter
ValueCountFrequency (%)
B 24
16.3%
C 16
10.9%
A 15
 
10.2%
D 12
 
8.2%
E 9
 
6.1%
L 7
 
4.8%
T 7
 
4.8%
F 7
 
4.8%
I 6
 
4.1%
S 6
 
4.1%
Other values (10) 38
25.9%
Decimal Number
ValueCountFrequency (%)
1 1204
34.8%
2 445
 
12.8%
0 440
 
12.7%
3 287
 
8.3%
4 223
 
6.4%
5 195
 
5.6%
8 179
 
5.2%
6 176
 
5.1%
7 162
 
4.7%
9 153
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
e 5
26.3%
s 5
26.3%
t 4
21.1%
a 2
 
10.5%
c 1
 
5.3%
i 1
 
5.3%
m 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
. 709
99.2%
& 3
 
0.4%
@ 3
 
0.4%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
3494
100.0%
Close Punctuation
ValueCountFrequency (%)
) 579
100.0%
Open Punctuation
ValueCountFrequency (%)
( 579
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%
Math Symbol
ValueCountFrequency (%)
~ 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12627
58.1%
Common 8925
41.1%
Latin 168
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
785
 
6.2%
698
 
5.5%
698
 
5.5%
646
 
5.1%
634
 
5.0%
614
 
4.9%
587
 
4.6%
573
 
4.5%
568
 
4.5%
566
 
4.5%
Other values (284) 6258
49.6%
Latin
ValueCountFrequency (%)
B 24
14.3%
C 16
 
9.5%
A 15
 
8.9%
D 12
 
7.1%
E 9
 
5.4%
L 7
 
4.2%
T 7
 
4.2%
F 7
 
4.2%
I 6
 
3.6%
S 6
 
3.6%
Other values (19) 59
35.1%
Common
ValueCountFrequency (%)
3494
39.1%
1 1204
 
13.5%
. 709
 
7.9%
) 579
 
6.5%
( 579
 
6.5%
2 445
 
5.0%
0 440
 
4.9%
3 287
 
3.2%
4 223
 
2.5%
5 195
 
2.2%
Other values (8) 770
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12627
58.1%
ASCII 9091
41.9%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3494
38.4%
1 1204
 
13.2%
. 709
 
7.8%
) 579
 
6.4%
( 579
 
6.4%
2 445
 
4.9%
0 440
 
4.8%
3 287
 
3.2%
4 223
 
2.5%
5 195
 
2.1%
Other values (35) 936
 
10.3%
Hangul
ValueCountFrequency (%)
785
 
6.2%
698
 
5.5%
698
 
5.5%
646
 
5.1%
634
 
5.0%
614
 
4.9%
587
 
4.6%
573
 
4.5%
568
 
4.5%
566
 
4.5%
Other values (284) 6258
49.6%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

Interactions

2024-01-28T19:38:45.423821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-28T19:38:45.515291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T19:38:45.578171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업소지번주소업소도로명주소
01씨유 연수대림점인천광역시 연수구 연수동 532 대림아파트인천광역시 연수구 원인재로 286. 대림아파트 상가동 111.111-1.112호 (연수동)
12지엠25인천광역시 연수구 송도동 406 한진인천컨테이너터미널인천광역시 연수구 인천신항대로 777. 한진인천컨테이너터미널 1층 (송도동)
23담배멀티샵인천광역시 연수구 연수동 543-9인천광역시 연수구 먼우금로251번길 1. 104호 (연수동)
34복권명당 롯데몰점인천광역시 연수구 송도동 8-4 송도모아프라자인천광역시 연수구 인천타워대로132번길 24. 송도모아프라자 105호 (송도동)
45지에스25 송도써밋점인천광역시 연수구 송도동 312-4 호반써밋 송도인천광역시 연수구 랜드마크로 20. 140호 (송도동. 호반써밋 송도)
56씨유 호반8공구점인천광역시 연수구 송도동 312-4 호반써밋 송도인천광역시 연수구 랜드마크로 20. 158~159호 (송도동. 호반써밋 송도)
67씨유 송도호반써밋인천광역시 연수구 송도동 312-4 호반써밋 송도인천광역시 연수구 랜드마크로 20. 114~115호 (송도동. 호반써밋 송도)
78최강마트인천광역시 연수구 연수동 577 영남스포츠센터인천광역시 연수구 새말로107번길 16. 영남스포츠센터 1층 (연수동)
891공구테마인천광역시 연수구 송도동 21-38 더 마란츠타워인천광역시 연수구 신송로 153. 더 마란츠타워 106호 (송도동)
910씨유 송도형지점인천광역시 연수구 송도동 11-2 형지글로벌 패션복합센터인천광역시 연수구 하모니로177번길 49. 형지글로벌 패션복합센터 (송도동)
연번업소명업소지번주소업소도로명주소
571572이마트24R 연수우성점인천광역시 연수구 연수동 634번지 연수우성2차아파트 상가동 103-6호인천광역시 연수구 원인재로 180. 상가동 103-6호 (연수동. 연수우성2차아파트)
572573(주)코리아세븐인천광역시 연수구 옥련동 552-7호
573574풍림상사인천광역시 연수구 연수동 582번지인천광역시 연수구 함박뫼로 250 (연수동)
574575연수신발인천광역시 연수구 연수동 호 410동 상가
575576세븐일레븐 인천연수동남점인천광역시 연수구 동춘2동 943번지 동남아파트 상가동 131호인천광역시 연수구 먼우금로 19. 상가동 131호 (동춘동. 동남아파트)
576577(주)코리아세븐 인천연수점인천광역시 연수구 연수동 631호인천광역시 연수구 샘말로8번길 4 (연수동)
577578장군슈퍼인천광역시 연수구 옥련동 580번지 9 호인천광역시 연수구 청량로 104 (옥련동)
578579남인천농협인천광역시 연수구 옥련동 334번지 1 호인천광역시 연수구 한나루로 188 (옥련동)
579580경북슈퍼인천광역시 연수구 옥련동 548번지 1 호인천광역시 연수구 능허대로 195 (옥련동)
580581대한슈퍼인천광역시 연수구 동춘동 940호 대림3차아파트상가 103인천광역시 연수구 먼우금로83번길 49. 103호 (동춘동.대림3차아파트상가)