Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells3407
Missing cells (%)8.5%
Duplicate rows15
Duplicate rows (%)0.1%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Text3
Categorical1

Dataset

Description광주광역시 아동급식카드 가맹점 현황으로 가맹점명, 업종명, 소재지도로명주소, 전화번호 데이터를 제공합니다.
Author광주광역시
URLhttps://www.data.go.kr/data/15100167/fileData.do

Alerts

Dataset has 15 (0.1%) duplicate rowsDuplicates
업종명 is highly imbalanced (50.0%)Imbalance
전화번호 has 3399 (34.0%) missing valuesMissing

Reproduction

Analysis started2024-03-14 20:22:45.736312
Analysis finished2024-03-14 20:22:48.564113
Duration2.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9200
Distinct (%)92.0%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-15T05:22:49.457872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length30
Mean length6.9535814
Min length1

Characters and Unicode

Total characters69508
Distinct characters1055
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8674 ?
Unique (%)86.8%

Sample

1st row난타5000(문흥점)
2nd row에이원
3rd row뎅뎅이네덮밥&짜글이
4th row명랑시대쌀핫도그화정힐스테이트
5th row담은보쌈칼국수
ValueCountFrequency (%)
세븐일레븐 182
 
1.4%
gs25 121
 
1.0%
이마트24 102
 
0.8%
씨유(cu 85
 
0.7%
주식회사 84
 
0.7%
지에스(gs)25 53
 
0.4%
상무점 45
 
0.4%
주)코리아세븐 42
 
0.3%
수완점 41
 
0.3%
첨단점 38
 
0.3%
Other values (9359) 11887
93.7%
2024-03-15T05:22:50.823513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3064
 
4.4%
2689
 
3.9%
1567
 
2.3%
1310
 
1.9%
1224
 
1.8%
( 1042
 
1.5%
) 1040
 
1.5%
860
 
1.2%
781
 
1.1%
769
 
1.1%
Other values (1045) 55162
79.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59942
86.2%
Space Separator 2689
 
3.9%
Uppercase Letter 2007
 
2.9%
Decimal Number 1666
 
2.4%
Open Punctuation 1042
 
1.5%
Close Punctuation 1040
 
1.5%
Lowercase Letter 913
 
1.3%
Other Punctuation 159
 
0.2%
Dash Punctuation 33
 
< 0.1%
Modifier Symbol 9
 
< 0.1%
Other values (3) 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3064
 
5.1%
1567
 
2.6%
1310
 
2.2%
1224
 
2.0%
860
 
1.4%
781
 
1.3%
769
 
1.3%
714
 
1.2%
682
 
1.1%
639
 
1.1%
Other values (966) 48332
80.6%
Uppercase Letter
ValueCountFrequency (%)
S 282
14.1%
G 261
13.0%
C 231
11.5%
U 222
11.1%
B 112
 
5.6%
E 95
 
4.7%
T 92
 
4.6%
A 91
 
4.5%
O 63
 
3.1%
N 61
 
3.0%
Other values (16) 497
24.8%
Lowercase Letter
ValueCountFrequency (%)
e 125
13.7%
a 94
 
10.3%
o 77
 
8.4%
n 71
 
7.8%
i 59
 
6.5%
r 51
 
5.6%
t 48
 
5.3%
s 46
 
5.0%
m 38
 
4.2%
h 35
 
3.8%
Other values (16) 269
29.5%
Decimal Number
ValueCountFrequency (%)
2 532
31.9%
5 321
19.3%
0 174
 
10.4%
4 161
 
9.7%
1 155
 
9.3%
9 89
 
5.3%
8 74
 
4.4%
3 64
 
3.8%
6 49
 
2.9%
7 47
 
2.8%
Other Punctuation
ValueCountFrequency (%)
& 110
69.2%
. 40
 
25.2%
, 3
 
1.9%
! 2
 
1.3%
: 2
 
1.3%
· 1
 
0.6%
# 1
 
0.6%
Math Symbol
ValueCountFrequency (%)
= 3
50.0%
+ 2
33.3%
× 1
 
16.7%
Space Separator
ValueCountFrequency (%)
2689
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1042
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1040
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Modifier Symbol
ValueCountFrequency (%)
´ 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59933
86.2%
Common 6646
 
9.6%
Latin 2920
 
4.2%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3064
 
5.1%
1567
 
2.6%
1310
 
2.2%
1224
 
2.0%
860
 
1.4%
781
 
1.3%
769
 
1.3%
714
 
1.2%
682
 
1.1%
639
 
1.1%
Other values (959) 48323
80.6%
Latin
ValueCountFrequency (%)
S 282
 
9.7%
G 261
 
8.9%
C 231
 
7.9%
U 222
 
7.6%
e 125
 
4.3%
B 112
 
3.8%
E 95
 
3.3%
a 94
 
3.2%
T 92
 
3.2%
A 91
 
3.1%
Other values (42) 1315
45.0%
Common
ValueCountFrequency (%)
2689
40.5%
( 1042
 
15.7%
) 1040
 
15.6%
2 532
 
8.0%
5 321
 
4.8%
0 174
 
2.6%
4 161
 
2.4%
1 155
 
2.3%
& 110
 
1.7%
9 89
 
1.3%
Other values (17) 333
 
5.0%
Han
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59933
86.2%
ASCII 9554
 
13.7%
None 11
 
< 0.1%
CJK 9
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3064
 
5.1%
1567
 
2.6%
1310
 
2.2%
1224
 
2.0%
860
 
1.4%
781
 
1.3%
769
 
1.3%
714
 
1.2%
682
 
1.1%
639
 
1.1%
Other values (959) 48323
80.6%
ASCII
ValueCountFrequency (%)
2689
28.1%
( 1042
 
10.9%
) 1040
 
10.9%
2 532
 
5.6%
5 321
 
3.4%
S 282
 
3.0%
G 261
 
2.7%
C 231
 
2.4%
U 222
 
2.3%
0 174
 
1.8%
Other values (65) 2760
28.9%
None
ValueCountFrequency (%)
´ 9
81.8%
· 1
 
9.1%
× 1
 
9.1%
CJK
ValueCountFrequency (%)
3
33.3%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
1
 
11.1%
Punctuation
ValueCountFrequency (%)
1
100.0%

업종명
Categorical

IMBALANCE 

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
한식
5596 
일반대중음식
2232 
편의점
892 
중식
 
325
패스트푸드
 
320
Other values (10)
635 

Length

Max length8
Median length2
Mean length3.1145
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row패스트푸드
2nd row한식
3rd row한식
4th row일반대중음식
5th row한식

Common Values

ValueCountFrequency (%)
한식 5596
56.0%
일반대중음식 2232
 
22.3%
편의점 892
 
8.9%
중식 325
 
3.2%
패스트푸드 320
 
3.2%
제과점 270
 
2.7%
일식 180
 
1.8%
양식 156
 
1.6%
식품잡화 10
 
0.1%
농협(마트) 7
 
0.1%
Other values (5) 12
 
0.1%

Length

2024-03-15T05:22:51.078351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 5596
56.0%
일반대중음식 2232
 
22.3%
편의점 892
 
8.9%
중식 325
 
3.2%
패스트푸드 320
 
3.2%
제과점 270
 
2.7%
일식 180
 
1.8%
양식 156
 
1.6%
식품잡화 10
 
0.1%
농협(마트 7
 
0.1%
Other values (5) 12
 
0.1%

전화번호
Text

MISSING 

Distinct6357
Distinct (%)96.3%
Missing3399
Missing (%)34.0%
Memory size156.2 KiB
2024-03-15T05:22:52.140625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.002121
Min length12

Characters and Unicode

Total characters79226
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6234 ?
Unique (%)94.4%

Sample

1st row062-262-7000
2nd row062-366-8596
3rd row062-381-7734
4th row062-373-5919
5th row062-367-9993
ValueCountFrequency (%)
062-443-7375 24
 
0.4%
062-514-0580 20
 
0.3%
062-371-3471 16
 
0.2%
062-371-3470 15
 
0.2%
062-514-8899 13
 
0.2%
062-463-3090 12
 
0.2%
062-267-2009 11
 
0.2%
062-512-0209 9
 
0.1%
062-385-3181 6
 
0.1%
062-522-3191 5
 
0.1%
Other values (6353) 6477
98.0%
2024-03-15T05:22:53.622300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 13209
16.7%
- 13195
16.7%
6 10836
13.7%
0 10229
12.9%
5 6072
7.7%
9 5150
 
6.5%
3 5135
 
6.5%
7 4286
 
5.4%
1 4088
 
5.2%
4 3532
 
4.5%
Other values (4) 3494
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 66010
83.3%
Dash Punctuation 13195
 
16.7%
Open Punctuation 7
 
< 0.1%
Close Punctuation 7
 
< 0.1%
Space Separator 7
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 13209
20.0%
6 10836
16.4%
0 10229
15.5%
5 6072
9.2%
9 5150
 
7.8%
3 5135
 
7.8%
7 4286
 
6.5%
1 4088
 
6.2%
4 3532
 
5.4%
8 3473
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 13195
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 79226
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 13209
16.7%
- 13195
16.7%
6 10836
13.7%
0 10229
12.9%
5 6072
7.7%
9 5150
 
6.5%
3 5135
 
6.5%
7 4286
 
5.4%
1 4088
 
5.2%
4 3532
 
4.5%
Other values (4) 3494
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79226
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 13209
16.7%
- 13195
16.7%
6 10836
13.7%
0 10229
12.9%
5 6072
7.7%
9 5150
 
6.5%
3 5135
 
6.5%
7 4286
 
5.4%
1 4088
 
5.2%
4 3532
 
4.5%
Other values (4) 3494
 
4.4%
Distinct9723
Distinct (%)97.3%
Missing4
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-15T05:22:55.067810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length56
Mean length24.927871
Min length12

Characters and Unicode

Total characters249179
Distinct characters498
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9483 ?
Unique (%)94.9%

Sample

1st row광주 북구 문흥2동 1011-1번지 중흥아파트상가동 4호
2nd row광주광역시 남구 효덕로 291 1동 2층 204호
3rd row광주광역시 광산구 첨단내촌로36번길 26 1층
4th row광주광역시 서구 화정로 256 1층 (화정동)
5th row광주광역시 서구 월드컵4강로229번길 3-1 1층
ValueCountFrequency (%)
광주광역시 6720
 
12.3%
1층 4777
 
8.7%
광주 3287
 
6.0%
광산구 3001
 
5.5%
북구 2602
 
4.8%
서구 2067
 
3.8%
남구 1238
 
2.3%
동구 1106
 
2.0%
668
 
1.2%
2층 343
 
0.6%
Other values (5395) 28856
52.8%
2024-03-15T05:22:56.819915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47458
19.0%
20037
 
8.0%
1 16752
 
6.7%
10278
 
4.1%
10106
 
4.1%
9464
 
3.8%
7440
 
3.0%
6890
 
2.8%
6724
 
2.7%
2 6668
 
2.7%
Other values (488) 107362
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138354
55.5%
Decimal Number 50414
 
20.2%
Space Separator 47458
 
19.0%
Close Punctuation 4588
 
1.8%
Open Punctuation 4587
 
1.8%
Dash Punctuation 2813
 
1.1%
Other Punctuation 690
 
0.3%
Uppercase Letter 227
 
0.1%
Math Symbol 26
 
< 0.1%
Lowercase Letter 22
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20037
 
14.5%
10278
 
7.4%
10106
 
7.3%
9464
 
6.8%
7440
 
5.4%
6890
 
5.0%
6724
 
4.9%
6270
 
4.5%
5083
 
3.7%
4789
 
3.5%
Other values (440) 51273
37.1%
Uppercase Letter
ValueCountFrequency (%)
B 74
32.6%
A 58
25.6%
S 23
 
10.1%
C 12
 
5.3%
L 9
 
4.0%
K 9
 
4.0%
E 7
 
3.1%
D 6
 
2.6%
R 5
 
2.2%
H 5
 
2.2%
Other values (10) 19
 
8.4%
Decimal Number
ValueCountFrequency (%)
1 16752
33.2%
2 6668
 
13.2%
0 4710
 
9.3%
3 4346
 
8.6%
4 3490
 
6.9%
5 3411
 
6.8%
6 3078
 
6.1%
7 2837
 
5.6%
8 2618
 
5.2%
9 2504
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
t 4
18.2%
b 4
18.2%
e 4
18.2%
h 3
13.6%
a 3
13.6%
p 1
 
4.5%
k 1
 
4.5%
s 1
 
4.5%
c 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 682
98.8%
, 7
 
1.0%
& 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 13
50.0%
= 13
50.0%
Space Separator
ValueCountFrequency (%)
47458
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4588
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4587
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2813
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138354
55.5%
Common 110576
44.4%
Latin 249
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20037
 
14.5%
10278
 
7.4%
10106
 
7.3%
9464
 
6.8%
7440
 
5.4%
6890
 
5.0%
6724
 
4.9%
6270
 
4.5%
5083
 
3.7%
4789
 
3.5%
Other values (440) 51273
37.1%
Latin
ValueCountFrequency (%)
B 74
29.7%
A 58
23.3%
S 23
 
9.2%
C 12
 
4.8%
L 9
 
3.6%
K 9
 
3.6%
E 7
 
2.8%
D 6
 
2.4%
R 5
 
2.0%
H 5
 
2.0%
Other values (19) 41
16.5%
Common
ValueCountFrequency (%)
47458
42.9%
1 16752
 
15.1%
2 6668
 
6.0%
0 4710
 
4.3%
) 4588
 
4.1%
( 4587
 
4.1%
3 4346
 
3.9%
4 3490
 
3.2%
5 3411
 
3.1%
6 3078
 
2.8%
Other values (9) 11488
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138353
55.5%
ASCII 110825
44.5%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
47458
42.8%
1 16752
 
15.1%
2 6668
 
6.0%
0 4710
 
4.2%
) 4588
 
4.1%
( 4587
 
4.1%
3 4346
 
3.9%
4 3490
 
3.1%
5 3411
 
3.1%
6 3078
 
2.8%
Other values (38) 11737
 
10.6%
Hangul
ValueCountFrequency (%)
20037
 
14.5%
10278
 
7.4%
10106
 
7.3%
9464
 
6.8%
7440
 
5.4%
6890
 
5.0%
6724
 
4.9%
6270
 
4.5%
5083
 
3.7%
4789
 
3.5%
Other values (439) 51272
37.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Missing values

2024-03-15T05:22:47.591994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:22:47.902544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-15T05:22:48.392695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

가맹점명업종명전화번호가맹점주소
2600난타5000(문흥점)패스트푸드062-262-7000광주 북구 문흥2동 1011-1번지 중흥아파트상가동 4호
9984에이원한식<NA>광주광역시 남구 효덕로 291 1동 2층 204호
3594뎅뎅이네덮밥&짜글이한식<NA>광주광역시 광산구 첨단내촌로36번길 26 1층
5284명랑시대쌀핫도그화정힐스테이트일반대중음식062-366-8596광주광역시 서구 화정로 256 1층 (화정동)
3269담은보쌈칼국수한식062-381-7734광주광역시 서구 월드컵4강로229번길 3-1 1층
15760해성정일반대중음식062-373-5919광주 서구 유덕로 17 (유촌동)
10733왕뼈사랑한식062-367-9993광주광역시 서구 무진대로 937 .
8120소문난집(찐빵 만두)한식062-953-1981광주 광산구 비아중앙로 13-1 (비아동)
12653주식회사한식062-676-3676광주광역시 남구 대남대로237번길 29-1 1층
6739북어전문점한식062-267-1782광주 북구 대천로 16 1층 (오치동)
가맹점명업종명전화번호가맹점주소
12546조아조아한식062-651-2281광주 남구 서동로 14 (서동)
4592롯데리아광주운천점패스트푸드062-385-5055광주광역시 서구 상무대로 879 1층
3705돈가네한식062-372-1833광주 서구 내방로246번길 4 (쌍촌동)
5079매곡식당한식062-527-4242광주광역시 북구 서하로 119 1층
3020다락금한식062-576-8292광주광역시 북구 송해로 36 1층
15686항아리맛집한식062-385-8266광주광역시 서구 시청서편로4번길 8 1층(치평동)
3652도야족발보쌈한식062-951-1779광주광역시 광산구 목련로153번안길 32 .
4713마라미녀중식062-972-9874광주광역시 광산구 첨단중앙로 96 118호
7680섬진강 계절음식 전문점한식062-374-4119광주광역시 서구 신촌길 20 1층
5545무등정한식062-376-9555광주 서구 상무대로 915-6 1층 (쌍촌동)

Duplicate rows

Most frequently occurring

가맹점명업종명전화번호가맹점주소# duplicates
14<NA><NA><NA><NA>4
12주식회사에이치티에이치한식062-383-1592광주광역시 서구 내방로 39-1 .3
0굽기의기술한식<NA>광주광역시 광산구 왕버들로265번길 5 1층2
1도시락에 정성을 담아한식<NA>광주광역시 북구 우치로27번길 24 1층2
2맛깔참죽운남점한식062-531-6288광주광역시 광산구 임방울대로 134 1층2
3수진초밥일식062-571-2228광주광역시 북구 첨단연신로108번길 83 (신용동)2
4순한한우 축산물 종합판매장한식062-574-4492광주광역시 북구 빛고을대로 771 1층2
5스위티움 (주)일반대중음식<NA>광주광역시 북구 용봉로 77 1층 용봉문화관2
6씨유 쌍촌성우점편의점062-382-1261광주 서구 마륵복개로 168 (쌍촌동)2
7얌샘김밥(매월점)한식062-603-0099광주광역시 서구 매월2로 53 23동 106호(광주산업용재유통센터)2