Overview

Dataset statistics

Number of variables4
Number of observations6947
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory224.0 KiB
Average record size in memory33.0 B

Variable types

Numeric1
Text3

Dataset

Description복권 및 복권기금법에 따른 전국의 온라인복권 판매점 주소로 판매점명, 도로명주소, 일반주소로 데이터를 제공합니다.
Author기획재정부
URLhttps://www.data.go.kr/data/15086355/fileData.do

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:43:01.079825
Analysis finished2023-12-12 07:43:02.282645
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct6947
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3474
Minimum1
Maximum6947
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size61.2 KiB
2023-12-12T16:43:02.366440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile348.3
Q11737.5
median3474
Q35210.5
95-th percentile6599.7
Maximum6947
Range6946
Interquartile range (IQR)3473

Descriptive statistics

Standard deviation2005.5705
Coefficient of variation (CV)0.57730872
Kurtosis-1.2
Mean3474
Median Absolute Deviation (MAD)1737
Skewness0
Sum24133878
Variance4022313
MonotonicityStrictly increasing
2023-12-12T16:43:02.630980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
4629 1
 
< 0.1%
4640 1
 
< 0.1%
4639 1
 
< 0.1%
4638 1
 
< 0.1%
4637 1
 
< 0.1%
4636 1
 
< 0.1%
4635 1
 
< 0.1%
4634 1
 
< 0.1%
4633 1
 
< 0.1%
Other values (6937) 6937
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
6947 1
< 0.1%
6946 1
< 0.1%
6945 1
< 0.1%
6944 1
< 0.1%
6943 1
< 0.1%
6942 1
< 0.1%
6941 1
< 0.1%
6940 1
< 0.1%
6939 1
< 0.1%
6938 1
< 0.1%

상호
Text

Distinct5231
Distinct (%)75.3%
Missing0
Missing (%)0.0%
Memory size54.4 KiB
2023-12-12T16:43:02.946729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length6.0695264
Min length1

Characters and Unicode

Total characters42165
Distinct characters765
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4840 ?
Unique (%)69.7%

Sample

1st row가로판매점제55호
2nd row아싸 대박 복권
3rd row루이방
4th row드림복권판매점
5th row불모산로또탐험대
ValueCountFrequency (%)
복권방 130
 
1.7%
로또 87
 
1.1%
행운복권방 75
 
1.0%
로또복권 72
 
0.9%
로또복권방 67
 
0.9%
복권명당 66
 
0.9%
복권나라 64
 
0.8%
복권판매점 60
 
0.8%
로또명당 52
 
0.7%
노다지복권방 52
 
0.7%
Other values (5306) 6933
90.5%
2023-12-12T16:43:03.451179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2640
 
6.3%
2444
 
5.8%
1884
 
4.5%
1811
 
4.3%
1718
 
4.1%
1266
 
3.0%
749
 
1.8%
) 741
 
1.8%
( 741
 
1.8%
711
 
1.7%
Other values (755) 27460
65.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37128
88.1%
Uppercase Letter 1348
 
3.2%
Decimal Number 1289
 
3.1%
Close Punctuation 741
 
1.8%
Open Punctuation 741
 
1.8%
Space Separator 711
 
1.7%
Lowercase Letter 137
 
0.3%
Other Punctuation 52
 
0.1%
Dash Punctuation 14
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2640
 
7.1%
2444
 
6.6%
1884
 
5.1%
1811
 
4.9%
1718
 
4.6%
1266
 
3.4%
749
 
2.0%
606
 
1.6%
568
 
1.5%
539
 
1.5%
Other values (685) 22903
61.7%
Uppercase Letter
ValueCountFrequency (%)
S 352
26.1%
G 352
26.1%
C 197
14.6%
U 174
12.9%
O 35
 
2.6%
T 32
 
2.4%
L 32
 
2.4%
A 22
 
1.6%
K 21
 
1.6%
N 20
 
1.5%
Other values (17) 111
 
8.2%
Lowercase Letter
ValueCountFrequency (%)
o 20
14.6%
t 16
11.7%
e 11
 
8.0%
a 11
 
8.0%
s 10
 
7.3%
g 7
 
5.1%
n 6
 
4.4%
u 6
 
4.4%
l 6
 
4.4%
i 5
 
3.6%
Other values (11) 39
28.5%
Decimal Number
ValueCountFrequency (%)
2 518
40.2%
5 421
32.7%
4 135
 
10.5%
1 92
 
7.1%
3 38
 
2.9%
6 29
 
2.2%
0 16
 
1.2%
7 16
 
1.2%
8 16
 
1.2%
9 8
 
0.6%
Other Punctuation
ValueCountFrequency (%)
. 19
36.5%
& 18
34.6%
/ 11
21.2%
! 3
 
5.8%
# 1
 
1.9%
Math Symbol
ValueCountFrequency (%)
+ 1
50.0%
~ 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 741
100.0%
Open Punctuation
ValueCountFrequency (%)
( 741
100.0%
Space Separator
ValueCountFrequency (%)
711
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37120
88.0%
Common 3550
 
8.4%
Latin 1485
 
3.5%
Han 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2640
 
7.1%
2444
 
6.6%
1884
 
5.1%
1811
 
4.9%
1718
 
4.6%
1266
 
3.4%
749
 
2.0%
606
 
1.6%
568
 
1.5%
539
 
1.5%
Other values (677) 22895
61.7%
Latin
ValueCountFrequency (%)
S 352
23.7%
G 352
23.7%
C 197
13.3%
U 174
11.7%
O 35
 
2.4%
T 32
 
2.2%
L 32
 
2.2%
A 22
 
1.5%
K 21
 
1.4%
N 20
 
1.3%
Other values (38) 248
16.7%
Common
ValueCountFrequency (%)
) 741
20.9%
( 741
20.9%
711
20.0%
2 518
14.6%
5 421
11.9%
4 135
 
3.8%
1 92
 
2.6%
3 38
 
1.1%
6 29
 
0.8%
. 19
 
0.5%
Other values (11) 105
 
3.0%
Han
ValueCountFrequency (%)
2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37118
88.0%
ASCII 5033
 
11.9%
CJK 8
 
< 0.1%
None 4
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2640
 
7.1%
2444
 
6.6%
1884
 
5.1%
1811
 
4.9%
1718
 
4.6%
1266
 
3.4%
749
 
2.0%
606
 
1.6%
568
 
1.5%
539
 
1.5%
Other values (676) 22893
61.7%
ASCII
ValueCountFrequency (%)
) 741
14.7%
( 741
14.7%
711
14.1%
2 518
10.3%
5 421
8.4%
S 352
7.0%
G 352
7.0%
C 197
 
3.9%
U 174
 
3.5%
4 135
 
2.7%
Other values (57) 691
13.7%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
CJK
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Distinct6934
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size54.4 KiB
2023-12-12T16:43:03.820237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length37
Mean length17.000864
Min length9

Characters and Unicode

Total characters118105
Distinct characters662
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6924 ?
Unique (%)99.7%

Sample

1st row서울 용산구 청파로 286
2nd row서울 은평구 진흥로13길 13-4
3rd row경기 과천시 중앙로 135-1
4th row전북 전주시 완산구 밤나무2길 2
5th row경남 김해시 금관대로 187
ValueCountFrequency (%)
경기 1701
 
5.6%
서울 1300
 
4.3%
경남 472
 
1.6%
부산 445
 
1.5%
인천 385
 
1.3%
충남 349
 
1.1%
경북 331
 
1.1%
대구 319
 
1.1%
전북 252
 
0.8%
전남 233
 
0.8%
Other values (7532) 24585
80.9%
2023-12-12T16:43:04.323956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23425
 
19.8%
6584
 
5.6%
4833
 
4.1%
1 4789
 
4.1%
3719
 
3.1%
2 3112
 
2.6%
2772
 
2.3%
3 2464
 
2.1%
2232
 
1.9%
4 2026
 
1.7%
Other values (652) 62149
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71752
60.8%
Space Separator 23425
 
19.8%
Decimal Number 21788
 
18.4%
Dash Punctuation 959
 
0.8%
Uppercase Letter 100
 
0.1%
Other Punctuation 33
 
< 0.1%
Lowercase Letter 31
 
< 0.1%
Close Punctuation 6
 
< 0.1%
Open Punctuation 6
 
< 0.1%
Letter Number 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6584
 
9.2%
4833
 
6.7%
3719
 
5.2%
2772
 
3.9%
2232
 
3.1%
2011
 
2.8%
1953
 
2.7%
1919
 
2.7%
1829
 
2.5%
1825
 
2.5%
Other values (597) 42075
58.6%
Uppercase Letter
ValueCountFrequency (%)
S 19
19.0%
K 14
14.0%
I 8
 
8.0%
C 7
 
7.0%
G 6
 
6.0%
T 5
 
5.0%
H 5
 
5.0%
L 5
 
5.0%
E 4
 
4.0%
U 4
 
4.0%
Other values (12) 23
23.0%
Lowercase Letter
ValueCountFrequency (%)
e 6
19.4%
a 4
12.9%
i 4
12.9%
m 3
9.7%
y 2
 
6.5%
l 2
 
6.5%
r 2
 
6.5%
s 2
 
6.5%
k 1
 
3.2%
d 1
 
3.2%
Other values (4) 4
12.9%
Decimal Number
ValueCountFrequency (%)
1 4789
22.0%
2 3112
14.3%
3 2464
11.3%
4 2026
9.3%
5 1854
 
8.5%
6 1649
 
7.6%
7 1563
 
7.2%
0 1478
 
6.8%
8 1434
 
6.6%
9 1419
 
6.5%
Other Punctuation
ValueCountFrequency (%)
, 15
45.5%
. 9
27.3%
· 9
27.3%
Letter Number
ValueCountFrequency (%)
3
60.0%
2
40.0%
Space Separator
ValueCountFrequency (%)
23425
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 959
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71752
60.8%
Common 46217
39.1%
Latin 136
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6584
 
9.2%
4833
 
6.7%
3719
 
5.2%
2772
 
3.9%
2232
 
3.1%
2011
 
2.8%
1953
 
2.7%
1919
 
2.7%
1829
 
2.5%
1825
 
2.5%
Other values (597) 42075
58.6%
Latin
ValueCountFrequency (%)
S 19
 
14.0%
K 14
 
10.3%
I 8
 
5.9%
C 7
 
5.1%
e 6
 
4.4%
G 6
 
4.4%
T 5
 
3.7%
H 5
 
3.7%
L 5
 
3.7%
E 4
 
2.9%
Other values (28) 57
41.9%
Common
ValueCountFrequency (%)
23425
50.7%
1 4789
 
10.4%
2 3112
 
6.7%
3 2464
 
5.3%
4 2026
 
4.4%
5 1854
 
4.0%
6 1649
 
3.6%
7 1563
 
3.4%
0 1478
 
3.2%
8 1434
 
3.1%
Other values (7) 2423
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71752
60.8%
ASCII 46339
39.2%
None 9
 
< 0.1%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23425
50.6%
1 4789
 
10.3%
2 3112
 
6.7%
3 2464
 
5.3%
4 2026
 
4.4%
5 1854
 
4.0%
6 1649
 
3.6%
7 1563
 
3.4%
0 1478
 
3.2%
8 1434
 
3.1%
Other values (42) 2545
 
5.5%
Hangul
ValueCountFrequency (%)
6584
 
9.2%
4833
 
6.7%
3719
 
5.2%
2772
 
3.9%
2232
 
3.1%
2011
 
2.8%
1953
 
2.7%
1919
 
2.7%
1829
 
2.5%
1825
 
2.5%
Other values (597) 42075
58.6%
None
ValueCountFrequency (%)
· 9
100.0%
Number Forms
ValueCountFrequency (%)
3
60.0%
2
40.0%
Distinct6933
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size54.4 KiB
2023-12-12T16:43:04.669281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length18.078595
Min length10

Characters and Unicode

Total characters125592
Distinct characters336
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6923 ?
Unique (%)99.7%

Sample

1st row서울 용산구 청파동2가 120-35
2nd row서울 은평구 대조동 33-2
3rd row경기 과천시 중앙동 40-1
4th row전북 전주시 완산구 효자동1가 625-10
5th row경남 김해시 대청동 800-11
ValueCountFrequency (%)
경기 1701
 
5.6%
서울 1300
 
4.3%
경남 472
 
1.6%
부산 445
 
1.5%
인천 385
 
1.3%
충남 349
 
1.2%
경북 331
 
1.1%
대구 319
 
1.1%
전북 252
 
0.8%
전남 233
 
0.8%
Other values (9264) 24330
80.8%
2023-12-12T16:43:05.184748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23170
 
18.4%
6637
 
5.3%
1 5968
 
4.8%
- 5728
 
4.6%
4806
 
3.8%
2 3749
 
3.0%
3565
 
2.8%
3 3176
 
2.5%
4 2856
 
2.3%
5 2690
 
2.1%
Other values (326) 63247
50.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67283
53.6%
Decimal Number 29406
23.4%
Space Separator 23170
 
18.4%
Dash Punctuation 5728
 
4.6%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6637
 
9.9%
4806
 
7.1%
3565
 
5.3%
2610
 
3.9%
2537
 
3.8%
2411
 
3.6%
2071
 
3.1%
1964
 
2.9%
1887
 
2.8%
1798
 
2.7%
Other values (312) 36997
55.0%
Decimal Number
ValueCountFrequency (%)
1 5968
20.3%
2 3749
12.7%
3 3176
10.8%
4 2856
9.7%
5 2690
9.1%
6 2513
8.5%
7 2309
 
7.9%
8 2108
 
7.2%
0 2033
 
6.9%
9 2004
 
6.8%
Other Punctuation
ValueCountFrequency (%)
, 4
80.0%
. 1
 
20.0%
Space Separator
ValueCountFrequency (%)
23170
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5728
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67283
53.6%
Common 58309
46.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6637
 
9.9%
4806
 
7.1%
3565
 
5.3%
2610
 
3.9%
2537
 
3.8%
2411
 
3.6%
2071
 
3.1%
1964
 
2.9%
1887
 
2.8%
1798
 
2.7%
Other values (312) 36997
55.0%
Common
ValueCountFrequency (%)
23170
39.7%
1 5968
 
10.2%
- 5728
 
9.8%
2 3749
 
6.4%
3 3176
 
5.4%
4 2856
 
4.9%
5 2690
 
4.6%
6 2513
 
4.3%
7 2309
 
4.0%
8 2108
 
3.6%
Other values (4) 4042
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67283
53.6%
ASCII 58309
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23170
39.7%
1 5968
 
10.2%
- 5728
 
9.8%
2 3749
 
6.4%
3 3176
 
5.4%
4 2856
 
4.9%
5 2690
 
4.6%
6 2513
 
4.3%
7 2309
 
4.0%
8 2108
 
3.6%
Other values (4) 4042
 
6.9%
Hangul
ValueCountFrequency (%)
6637
 
9.9%
4806
 
7.1%
3565
 
5.3%
2610
 
3.9%
2537
 
3.8%
2411
 
3.6%
2071
 
3.1%
1964
 
2.9%
1887
 
2.8%
1798
 
2.7%
Other values (312) 36997
55.0%

Interactions

2023-12-12T16:43:02.028918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T16:43:02.158460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:43:02.245748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호상호도로명주소지번주소
01가로판매점제55호서울 용산구 청파로 286서울 용산구 청파동2가 120-35
12아싸 대박 복권서울 은평구 진흥로13길 13-4서울 은평구 대조동 33-2
23루이방경기 과천시 중앙로 135-1경기 과천시 중앙동 40-1
34드림복권판매점전북 전주시 완산구 밤나무2길 2전북 전주시 완산구 효자동1가 625-10
45불모산로또탐험대경남 김해시 금관대로 187경남 김해시 대청동 800-11
56단지정육점서울 노원구 섬밭로 287서울 노원구 중계동 504-1
67인생로또식물가게서울 종로구 삼일대로30길 21서울 종로구 낙원동 58-1
78충신 행운복권방서울 종로구 율곡로 253서울 종로구 충신동 27-17
89삼천리 하이퍼마켓서울 중랑구 겸재로36길 25서울 중랑구 면목동 506-42
910로또명당대구 북구 관음로 51대구 북구 관음동 1369
번호상호도로명주소지번주소
69376938CU(진주동문점)경남 진주시 망경로298번길 4경남 진주시 강남동 127
69386939GS25(서귀중문점)제주 서귀포시 천제연로 210제주 서귀포시 중문동 1964-4
69396940GS25(노형런던점)제주 제주시 다랑곶6길 51제주 제주시 노형동 1045-15
69406941GS25(노형현대점)제주 제주시 원노형로 84제주 제주시 노형동 723-1
69416942GS25(노형이화점)제주 제주시 원노형로 3제주 제주시 노형동 906-7
69426943GS25(제주도청점)제주 제주시 신대로 63제주 제주시 연동 301-10
69436944GS25(제주동교점)제주 제주시 동문로 68제주 제주시 일도이동 993-5
69446945GS25(제주삼화점)제주 제주시 화삼북로2길 32제주 제주시 화북일동 1023-2
69456946GS25(제주탑동점)제주 제주시 중앙로 21제주 제주시 건입동 1417
69466947GS25(노형대림점)제주 제주시 수덕로 75제주 제주시 노형동 2511-5