Overview

Dataset statistics

Number of variables3
Number of observations2954
Missing cells0
Missing cells (%)0.0%
Duplicate rows28
Duplicate rows (%)0.9%
Total size in memory69.4 KiB
Average record size in memory24.0 B

Variable types

Text3

Dataset

Description울산광역시 울주군에서 제공하는 급식카드 가맹점에 대한 데이터로 가맹점명, 주소, 전화번호 등의 정보를 제공합니다.
Author울산광역시 울주군
URLhttps://www.data.go.kr/data/15036697/fileData.do

Alerts

Dataset has 28 (0.9%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 21:26:33.364768
Analysis finished2023-12-12 21:26:34.114861
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2755
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T06:26:34.318468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length6.5287745
Min length2

Characters and Unicode

Total characters19286
Distinct characters736
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2594 ?
Unique (%)87.8%

Sample

1st row반올림피자샵 울산 선암청량점
2nd row황성주두유 범서대리점
3rd row카페051 언양교동점
4th row주식회사 일곱달팽이
5th row영꼬치엔칭따오
ValueCountFrequency (%)
언양점 29
 
0.9%
이마트24 27
 
0.8%
덕신점 16
 
0.5%
울산언양점 15
 
0.4%
지에스(gs)25 14
 
0.4%
천상점 11
 
0.3%
세븐일레븐 9
 
0.3%
남창점 9
 
0.3%
범서점 8
 
0.2%
구영점 8
 
0.2%
Other values (2835) 3206
95.6%
2023-12-13T06:26:34.745120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
663
 
3.4%
403
 
2.1%
398
 
2.1%
318
 
1.6%
313
 
1.6%
301
 
1.6%
299
 
1.6%
269
 
1.4%
266
 
1.4%
249
 
1.3%
Other values (726) 15807
82.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17765
92.1%
Space Separator 398
 
2.1%
Decimal Number 333
 
1.7%
Uppercase Letter 296
 
1.5%
Open Punctuation 175
 
0.9%
Close Punctuation 175
 
0.9%
Lowercase Letter 80
 
0.4%
Other Punctuation 59
 
0.3%
Math Symbol 3
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
663
 
3.7%
403
 
2.3%
318
 
1.8%
313
 
1.8%
301
 
1.7%
299
 
1.7%
269
 
1.5%
266
 
1.5%
249
 
1.4%
230
 
1.3%
Other values (658) 14454
81.4%
Uppercase Letter
ValueCountFrequency (%)
S 69
23.3%
G 63
21.3%
C 32
10.8%
U 22
 
7.4%
B 19
 
6.4%
H 11
 
3.7%
O 11
 
3.7%
I 9
 
3.0%
R 8
 
2.7%
T 7
 
2.4%
Other values (14) 45
15.2%
Lowercase Letter
ValueCountFrequency (%)
a 12
15.0%
e 10
12.5%
n 8
 
10.0%
l 6
 
7.5%
t 5
 
6.2%
s 5
 
6.2%
o 4
 
5.0%
m 3
 
3.8%
i 3
 
3.8%
y 3
 
3.8%
Other values (11) 21
26.2%
Decimal Number
ValueCountFrequency (%)
2 120
36.0%
5 81
24.3%
4 42
 
12.6%
1 25
 
7.5%
0 25
 
7.5%
9 13
 
3.9%
3 11
 
3.3%
6 9
 
2.7%
7 6
 
1.8%
8 1
 
0.3%
Other Punctuation
ValueCountFrequency (%)
& 17
28.8%
/ 17
28.8%
. 12
20.3%
, 8
13.6%
! 3
 
5.1%
' 2
 
3.4%
Math Symbol
ValueCountFrequency (%)
+ 2
66.7%
~ 1
33.3%
Space Separator
ValueCountFrequency (%)
398
100.0%
Open Punctuation
ValueCountFrequency (%)
( 175
100.0%
Close Punctuation
ValueCountFrequency (%)
) 175
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17762
92.1%
Common 1145
 
5.9%
Latin 376
 
1.9%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
663
 
3.7%
403
 
2.3%
318
 
1.8%
313
 
1.8%
301
 
1.7%
299
 
1.7%
269
 
1.5%
266
 
1.5%
249
 
1.4%
230
 
1.3%
Other values (656) 14451
81.4%
Latin
ValueCountFrequency (%)
S 69
18.4%
G 63
16.8%
C 32
 
8.5%
U 22
 
5.9%
B 19
 
5.1%
a 12
 
3.2%
H 11
 
2.9%
O 11
 
2.9%
e 10
 
2.7%
I 9
 
2.4%
Other values (35) 118
31.4%
Common
ValueCountFrequency (%)
398
34.8%
( 175
15.3%
) 175
15.3%
2 120
 
10.5%
5 81
 
7.1%
4 42
 
3.7%
1 25
 
2.2%
0 25
 
2.2%
& 17
 
1.5%
/ 17
 
1.5%
Other values (13) 70
 
6.1%
Han
ValueCountFrequency (%)
2
66.7%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17762
92.1%
ASCII 1521
 
7.9%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
663
 
3.7%
403
 
2.3%
318
 
1.8%
313
 
1.8%
301
 
1.7%
299
 
1.7%
269
 
1.5%
266
 
1.5%
249
 
1.4%
230
 
1.3%
Other values (656) 14451
81.4%
ASCII
ValueCountFrequency (%)
398
26.2%
( 175
11.5%
) 175
11.5%
2 120
 
7.9%
5 81
 
5.3%
S 69
 
4.5%
G 63
 
4.1%
4 42
 
2.8%
C 32
 
2.1%
1 25
 
1.6%
Other values (58) 341
22.4%
CJK
ValueCountFrequency (%)
2
66.7%
1
33.3%

주소
Text

Distinct2801
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T06:26:35.096277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length51
Mean length21.900474
Min length15

Characters and Unicode

Total characters64694
Distinct characters345
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2671 ?
Unique (%)90.4%

Sample

1st row울산 울주군 청량읍 삼정로 811 , B1층 101호 (울산덕하신일해피트리)
2nd row울산 울주군 언양읍 반천강변길 51 102동 1108호 (현대아파트)
3rd row울산 울주군 삼남읍 남상평3길 50 1층
4th row울산 울주군 서생면 나사해안길 109 2동
5th row울산 울주군 온산읍 덕남로 47 1층
ValueCountFrequency (%)
울산 2954
17.9%
울주군 2954
17.9%
1층 832
 
5.0%
범서읍 553
 
3.4%
언양읍 445
 
2.7%
온산읍 420
 
2.5%
온양읍 326
 
2.0%
삼남면 252
 
1.5%
서생면 197
 
1.2%
상북면 177
 
1.1%
Other values (2136) 7380
44.8%
2023-12-13T06:26:35.589913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13536
20.9%
5956
 
9.2%
3522
 
5.4%
1 3400
 
5.3%
2970
 
4.6%
2954
 
4.6%
1950
 
3.0%
1582
 
2.4%
2 1506
 
2.3%
3 1136
 
1.8%
Other values (335) 26182
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37603
58.1%
Space Separator 13536
 
20.9%
Decimal Number 11307
 
17.5%
Dash Punctuation 909
 
1.4%
Other Punctuation 711
 
1.1%
Open Punctuation 237
 
0.4%
Close Punctuation 236
 
0.4%
Uppercase Letter 154
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5956
15.8%
3522
 
9.4%
2970
 
7.9%
2954
 
7.9%
1950
 
5.2%
1582
 
4.2%
1026
 
2.7%
946
 
2.5%
946
 
2.5%
934
 
2.5%
Other values (303) 14817
39.4%
Uppercase Letter
ValueCountFrequency (%)
B 66
42.9%
A 34
22.1%
L 24
 
15.6%
N 10
 
6.5%
G 4
 
2.6%
S 3
 
1.9%
D 3
 
1.9%
M 2
 
1.3%
P 2
 
1.3%
E 2
 
1.3%
Other values (3) 4
 
2.6%
Decimal Number
ValueCountFrequency (%)
1 3400
30.1%
2 1506
13.3%
3 1136
 
10.0%
0 880
 
7.8%
4 860
 
7.6%
5 847
 
7.5%
7 717
 
6.3%
6 702
 
6.2%
8 674
 
6.0%
9 585
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 703
98.9%
. 5
 
0.7%
/ 2
 
0.3%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
13536
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 909
100.0%
Open Punctuation
ValueCountFrequency (%)
( 237
100.0%
Close Punctuation
ValueCountFrequency (%)
) 236
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37603
58.1%
Common 26936
41.6%
Latin 155
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5956
15.8%
3522
 
9.4%
2970
 
7.9%
2954
 
7.9%
1950
 
5.2%
1582
 
4.2%
1026
 
2.7%
946
 
2.5%
946
 
2.5%
934
 
2.5%
Other values (303) 14817
39.4%
Common
ValueCountFrequency (%)
13536
50.3%
1 3400
 
12.6%
2 1506
 
5.6%
3 1136
 
4.2%
- 909
 
3.4%
0 880
 
3.3%
4 860
 
3.2%
5 847
 
3.1%
7 717
 
2.7%
, 703
 
2.6%
Other values (8) 2442
 
9.1%
Latin
ValueCountFrequency (%)
B 66
42.6%
A 34
21.9%
L 24
 
15.5%
N 10
 
6.5%
G 4
 
2.6%
S 3
 
1.9%
D 3
 
1.9%
M 2
 
1.3%
P 2
 
1.3%
E 2
 
1.3%
Other values (4) 5
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37603
58.1%
ASCII 27091
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13536
50.0%
1 3400
 
12.6%
2 1506
 
5.6%
3 1136
 
4.2%
- 909
 
3.4%
0 880
 
3.2%
4 860
 
3.2%
5 847
 
3.1%
7 717
 
2.6%
, 703
 
2.6%
Other values (22) 2597
 
9.6%
Hangul
ValueCountFrequency (%)
5956
15.8%
3522
 
9.4%
2970
 
7.9%
2954
 
7.9%
1950
 
5.2%
1582
 
4.2%
1026
 
2.7%
946
 
2.5%
946
 
2.5%
934
 
2.5%
Other values (303) 14817
39.4%
Distinct2370
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Memory size23.2 KiB
2023-12-13T06:26:35.915560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.046378
Min length11

Characters and Unicode

Total characters35585
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2247 ?
Unique (%)76.1%

Sample

1st row052-227-3733
2nd row052-000-0000
3rd row052-111-1111
4th row052-700-8468
5th row070-4220-6848
ValueCountFrequency (%)
02-0000-0000 149
 
5.0%
052-000-0000 98
 
3.3%
052-111-1111 79
 
2.7%
052-1111-1111 31
 
1.0%
052-1577-0711 28
 
0.9%
02-1577-8007 27
 
0.9%
080-080-3663 17
 
0.6%
052-0000-0000 13
 
0.4%
051-643-0607 9
 
0.3%
052-1577-9621 8
 
0.3%
Other values (2360) 2495
84.5%
2023-12-13T06:26:36.350837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 6826
19.2%
0 6591
18.5%
- 5908
16.6%
5 4263
12.0%
1 2299
 
6.5%
3 1928
 
5.4%
8 1601
 
4.5%
4 1591
 
4.5%
7 1580
 
4.4%
6 1577
 
4.4%
Other values (2) 1421
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29676
83.4%
Dash Punctuation 5908
 
16.6%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 6826
23.0%
0 6591
22.2%
5 4263
14.4%
1 2299
 
7.7%
3 1928
 
6.5%
8 1601
 
5.4%
4 1591
 
5.4%
7 1580
 
5.3%
6 1577
 
5.3%
9 1420
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 5908
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35585
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 6826
19.2%
0 6591
18.5%
- 5908
16.6%
5 4263
12.0%
1 2299
 
6.5%
3 1928
 
5.4%
8 1601
 
4.5%
4 1591
 
4.5%
7 1580
 
4.4%
6 1577
 
4.4%
Other values (2) 1421
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35585
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 6826
19.2%
0 6591
18.5%
- 5908
16.6%
5 4263
12.0%
1 2299
 
6.5%
3 1928
 
5.4%
8 1601
 
4.5%
4 1591
 
4.5%
7 1580
 
4.4%
6 1577
 
4.4%
Other values (2) 1421
 
4.0%

Missing values

2023-12-13T06:26:33.988536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:26:34.071698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가맹점명주소전화번호
0반올림피자샵 울산 선암청량점울산 울주군 청량읍 삼정로 811 , B1층 101호 (울산덕하신일해피트리)052-227-3733
1황성주두유 범서대리점울산 울주군 언양읍 반천강변길 51 102동 1108호 (현대아파트)052-000-0000
2카페051 언양교동점울산 울주군 삼남읍 남상평3길 50 1층052-111-1111
3주식회사 일곱달팽이울산 울주군 서생면 나사해안길 109 2동052-700-8468
4영꼬치엔칭따오울산 울주군 온산읍 덕남로 47 1층070-4220-6848
5씨스페이스 비클래시간절곶점울산 울주군 서생면 평동1길 12-3052-1111-1111
6다있다무인아이스크림24시울산 울주군 삼남읍 울산역로 274 , 301동 212호 (울산역 신도시 동문굿모닝힐)052-111-1111
7피자야울산 울주군 범서읍 굴화길 64052-277-3083
8안녕,덕하리울산 울주군 청량읍 상남길 75 , 1층051-305-3302
9자가제빵 선명희피자 울산언양점울산 울주군 언양읍 헌양길 38052-254-4777
가맹점명주소전화번호
2944세븐일레븐울산울주대복점울산 울주군 웅촌면 삼동로 1620 , A동 1층052-1577-0711
2945씨유온양스타힐스점울산 울주군 온양읍 대안리 173-10번지080-080-3663
2946청량농업협동조합울산 울주군 청량면 상남리 586번지052-268-6994
2947서생농업협동조합울산 울주군 서생면 해맞이로 871052-239-2264
2948윈한솔마트울산 울주군 언양읍 북문8길 18052-249-5151
2949서생농업협동조합울산 울주군 서생면 해맞이로 871052-239-2264
2950온양농업협동조합울산 울주군 온양읍 남창3길 11052-238-4411
2951울산원예농협율리사업소울산 울주군 청량면 율리98-3052-224-7210
2952온산농업협동조합울산 울주군 온산읍 덕신리 230-10052-238-2311
2953성동슈퍼울산 울주군 서생면 진하리 82-3번지052-239-9024

Duplicate rows

Most frequently occurring

가맹점명주소전화번호# duplicates
9서생농업협동조합울산 울주군 서생면 해맞이로 871052-239-22644
1가마솥시골옛날통닭울산 울주군 온산읍 신경10길 8052-239-98993
19유명천지한우숯불구이울산 울주군 두서면 활천리 290-5번지052-262-71833
0BHC덕신점울산 울주군 온산읍 덕신2길 14052-238-82522
2궁중떡집울산 울주군 언양읍 장터2길 27052-254-89872
3대관령식당울산 울주군 온산읍 덕신리 1285-4번지052-238-64242
4대복농원식당울산 울주군 웅촌면 대복리 442-3번지052-268-04862
5라라김밥울산 울주군 웅촌면 곡천동문길 9 ,1층052-225-09092
6맛있는김밥울산 울주군 온양읍 태화8길 57052-238-02772
7본향울산 울주군 범서읍 대방골길 17-12052-211-51292