Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory390.6 KiB
Average record size in memory40.0 B

Variable types

Categorical2
Text2

Dataset

Description한국문화예술위원회에서 제공하는 문화누리카드 홈페이지입니다.
Author한국문화예술위원회
URLhttps://www.data.go.kr/data/15045194/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 07:18:05.990813
Analysis finished2023-12-12 07:18:07.639144
Duration1.65 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경기
1784 
서울
1438 
강원
756 
경북
735 
전북
652 
Other values (12)
4635 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기
2nd row강원
3rd row경북
4th row강원
5th row전남

Common Values

ValueCountFrequency (%)
경기 1784
17.8%
서울 1438
14.4%
강원 756
 
7.6%
경북 735
 
7.3%
전북 652
 
6.5%
경남 639
 
6.4%
부산 560
 
5.6%
전남 559
 
5.6%
충남 458
 
4.6%
광주 408
 
4.1%
Other values (7) 2011
20.1%

Length

2023-12-12T16:18:07.727798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 1784
17.8%
서울 1438
14.4%
강원 756
 
7.6%
경북 735
 
7.3%
전북 652
 
6.5%
경남 639
 
6.4%
부산 560
 
5.6%
전남 559
 
5.6%
충남 458
 
4.6%
광주 408
 
4.1%
Other values (7) 2011
20.1%
Distinct9579
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T16:18:08.123582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length8.4583
Min length1

Characters and Unicode

Total characters84583
Distinct characters983
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9282 ?
Unique (%)92.8%

Sample

1st row용강볼링쎈타
2nd row명성칼라
3rd row불국사
4th row㈜다다엔터테인먼트
5th row나라사진관
ValueCountFrequency (%)
삼천리자전거 384
 
2.8%
주식회사 169
 
1.2%
정보화마을 115
 
0.8%
cgv 87
 
0.6%
스튜디오 79
 
0.6%
놀숲 60
 
0.4%
메가박스 46
 
0.3%
43
 
0.3%
롯데컬처웍스㈜롯데시네마 42
 
0.3%
터미널 37
 
0.3%
Other values (10693) 12665
92.3%
2023-12-12T16:18:08.706996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3763
 
4.4%
) 2217
 
2.6%
( 2202
 
2.6%
2075
 
2.5%
1832
 
2.2%
1595
 
1.9%
1471
 
1.7%
1342
 
1.6%
1165
 
1.4%
1142
 
1.4%
Other values (973) 65779
77.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 73243
86.6%
Space Separator 3763
 
4.4%
Close Punctuation 2217
 
2.6%
Open Punctuation 2202
 
2.6%
Uppercase Letter 1807
 
2.1%
Lowercase Letter 672
 
0.8%
Decimal Number 329
 
0.4%
Other Symbol 167
 
0.2%
Other Punctuation 143
 
0.2%
Dash Punctuation 28
 
< 0.1%
Other values (2) 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2075
 
2.8%
1832
 
2.5%
1595
 
2.2%
1471
 
2.0%
1342
 
1.8%
1165
 
1.6%
1142
 
1.6%
1122
 
1.5%
951
 
1.3%
895
 
1.2%
Other values (892) 59653
81.4%
Uppercase Letter
ValueCountFrequency (%)
C 162
 
9.0%
G 158
 
8.7%
O 149
 
8.2%
S 134
 
7.4%
T 126
 
7.0%
M 114
 
6.3%
V 113
 
6.3%
B 85
 
4.7%
A 84
 
4.6%
K 74
 
4.1%
Other values (16) 608
33.6%
Lowercase Letter
ValueCountFrequency (%)
o 81
12.1%
e 76
11.3%
i 57
 
8.5%
a 56
 
8.3%
t 54
 
8.0%
l 40
 
6.0%
n 37
 
5.5%
r 37
 
5.5%
s 33
 
4.9%
d 31
 
4.6%
Other values (15) 170
25.3%
Other Punctuation
ValueCountFrequency (%)
. 49
34.3%
& 40
28.0%
, 33
23.1%
/ 7
 
4.9%
' 6
 
4.2%
: 2
 
1.4%
· 2
 
1.4%
1
 
0.7%
* 1
 
0.7%
! 1
 
0.7%
Decimal Number
ValueCountFrequency (%)
2 84
25.5%
1 82
24.9%
0 34
10.3%
4 31
 
9.4%
3 29
 
8.8%
7 21
 
6.4%
5 18
 
5.5%
9 11
 
3.3%
8 11
 
3.3%
6 8
 
2.4%
Math Symbol
ValueCountFrequency (%)
> 5
45.5%
< 5
45.5%
+ 1
 
9.1%
Space Separator
ValueCountFrequency (%)
3763
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2217
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2202
100.0%
Other Symbol
ValueCountFrequency (%)
167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 28
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 73404
86.8%
Common 8694
 
10.3%
Latin 2479
 
2.9%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2075
 
2.8%
1832
 
2.5%
1595
 
2.2%
1471
 
2.0%
1342
 
1.8%
1165
 
1.6%
1142
 
1.6%
1122
 
1.5%
951
 
1.3%
895
 
1.2%
Other values (889) 59814
81.5%
Latin
ValueCountFrequency (%)
C 162
 
6.5%
G 158
 
6.4%
O 149
 
6.0%
S 134
 
5.4%
T 126
 
5.1%
M 114
 
4.6%
V 113
 
4.6%
B 85
 
3.4%
A 84
 
3.4%
o 81
 
3.3%
Other values (41) 1273
51.4%
Common
ValueCountFrequency (%)
3763
43.3%
) 2217
25.5%
( 2202
25.3%
2 84
 
1.0%
1 82
 
0.9%
. 49
 
0.6%
& 40
 
0.5%
0 34
 
0.4%
, 33
 
0.4%
4 31
 
0.4%
Other values (19) 159
 
1.8%
Han
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 73237
86.6%
ASCII 11170
 
13.2%
None 169
 
0.2%
CJK 6
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3763
33.7%
) 2217
19.8%
( 2202
19.7%
C 162
 
1.5%
G 158
 
1.4%
O 149
 
1.3%
S 134
 
1.2%
T 126
 
1.1%
M 114
 
1.0%
V 113
 
1.0%
Other values (68) 2032
18.2%
Hangul
ValueCountFrequency (%)
2075
 
2.8%
1832
 
2.5%
1595
 
2.2%
1471
 
2.0%
1342
 
1.8%
1165
 
1.6%
1142
 
1.6%
1122
 
1.5%
951
 
1.3%
895
 
1.2%
Other values (888) 59647
81.4%
None
ValueCountFrequency (%)
167
98.8%
· 2
 
1.2%
CJK
ValueCountFrequency (%)
2
33.3%
2
33.3%
1
16.7%
1
16.7%
Punctuation
ValueCountFrequency (%)
1
100.0%

분류
Categorical

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미술
1777 
숙박
1621 
도서
1574 
체육시설
1061 
체육용품
817 
Other values (8)
3150 

Length

Max length5
Median length2
Mean length2.7503
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row체육시설
2nd row미술
3rd row관광지
4th row공연
5th row미술

Common Values

ValueCountFrequency (%)
미술 1777
17.8%
숙박 1621
16.2%
도서 1574
15.7%
체육시설 1061
10.6%
체육용품 817
8.2%
문화체험 692
 
6.9%
교통수단 626
 
6.3%
관광지 584
 
5.8%
여행사 473
 
4.7%
영상 287
 
2.9%
Other values (3) 488
 
4.9%

Length

2023-12-12T16:18:08.861451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
미술 1777
17.8%
숙박 1621
16.2%
도서 1574
15.7%
체육시설 1061
10.6%
체육용품 817
8.2%
문화체험 692
 
6.9%
교통수단 626
 
6.3%
관광지 584
 
5.8%
여행사 473
 
4.7%
영상 287
 
2.9%
Other values (3) 488
 
4.9%

주소
Text

Distinct9950
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T16:18:09.221444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length53
Mean length24.1634
Min length11

Characters and Unicode

Total characters241634
Distinct characters807
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9904 ?
Unique (%)99.0%

Sample

1st row경기 안성시 죽산면 죽산리 435-2 301
2nd row강원도 속초시 중앙로 148-2
3rd row경상북도 경주시 불국로 385 불국사
4th row강원도 춘천시 금강로68-12 엠백화첨 별관4층 (조양동)
5th row전남 영광군 영광읍 물무로 108
ValueCountFrequency (%)
경기 997
 
1.9%
경기도 791
 
1.5%
서울 790
 
1.5%
1층 646
 
1.2%
서울특별시 644
 
1.2%
경북 527
 
1.0%
강원 480
 
0.9%
2층 463
 
0.9%
경남 435
 
0.8%
전북 411
 
0.8%
Other values (15670) 46676
88.3%
2023-12-12T16:18:09.815191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55002
22.8%
1 9697
 
4.0%
7508
 
3.1%
7036
 
2.9%
2 6068
 
2.5%
5923
 
2.5%
4560
 
1.9%
3 4441
 
1.8%
3663
 
1.5%
4 3592
 
1.5%
Other values (797) 134144
55.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138329
57.2%
Space Separator 55002
 
22.8%
Decimal Number 40885
 
16.9%
Dash Punctuation 2935
 
1.2%
Open Punctuation 1554
 
0.6%
Close Punctuation 1552
 
0.6%
Other Punctuation 701
 
0.3%
Uppercase Letter 520
 
0.2%
Lowercase Letter 75
 
< 0.1%
Math Symbol 74
 
< 0.1%
Other values (2) 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7508
 
5.4%
7036
 
5.1%
5923
 
4.3%
4560
 
3.3%
3663
 
2.6%
3362
 
2.4%
3094
 
2.2%
3019
 
2.2%
2879
 
2.1%
2788
 
2.0%
Other values (726) 94497
68.3%
Uppercase Letter
ValueCountFrequency (%)
B 123
23.7%
A 56
10.8%
C 50
9.6%
S 38
 
7.3%
G 35
 
6.7%
F 27
 
5.2%
K 24
 
4.6%
L 19
 
3.7%
E 16
 
3.1%
V 14
 
2.7%
Other values (16) 118
22.7%
Lowercase Letter
ValueCountFrequency (%)
e 16
21.3%
s 8
10.7%
a 7
9.3%
l 6
 
8.0%
t 6
 
8.0%
b 5
 
6.7%
k 4
 
5.3%
o 3
 
4.0%
u 3
 
4.0%
n 3
 
4.0%
Other values (9) 14
18.7%
Decimal Number
ValueCountFrequency (%)
1 9697
23.7%
2 6068
14.8%
3 4441
10.9%
4 3592
 
8.8%
0 3563
 
8.7%
5 3310
 
8.1%
6 2913
 
7.1%
7 2667
 
6.5%
8 2355
 
5.8%
9 2279
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 642
91.6%
. 38
 
5.4%
/ 7
 
1.0%
& 5
 
0.7%
· 5
 
0.7%
: 3
 
0.4%
@ 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 73
98.6%
+ 1
 
1.4%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Space Separator
ValueCountFrequency (%)
55002
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2935
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1554
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1552
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138332
57.2%
Common 102703
42.5%
Latin 598
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7508
 
5.4%
7036
 
5.1%
5923
 
4.3%
4560
 
3.3%
3663
 
2.6%
3362
 
2.4%
3094
 
2.2%
3019
 
2.2%
2879
 
2.1%
2788
 
2.0%
Other values (726) 94500
68.3%
Latin
ValueCountFrequency (%)
B 123
20.6%
A 56
 
9.4%
C 50
 
8.4%
S 38
 
6.4%
G 35
 
5.9%
F 27
 
4.5%
K 24
 
4.0%
L 19
 
3.2%
e 16
 
2.7%
E 16
 
2.7%
Other values (37) 194
32.4%
Common
ValueCountFrequency (%)
55002
53.6%
1 9697
 
9.4%
2 6068
 
5.9%
3 4441
 
4.3%
4 3592
 
3.5%
0 3563
 
3.5%
5 3310
 
3.2%
- 2935
 
2.9%
6 2913
 
2.8%
7 2667
 
2.6%
Other values (13) 8515
 
8.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 138328
57.2%
ASCII 103293
42.7%
None 9
 
< 0.1%
Number Forms 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
55002
53.2%
1 9697
 
9.4%
2 6068
 
5.9%
3 4441
 
4.3%
4 3592
 
3.5%
0 3563
 
3.4%
5 3310
 
3.2%
- 2935
 
2.8%
6 2913
 
2.8%
7 2667
 
2.6%
Other values (57) 9105
 
8.8%
Hangul
ValueCountFrequency (%)
7508
 
5.4%
7036
 
5.1%
5923
 
4.3%
4560
 
3.3%
3663
 
2.6%
3362
 
2.4%
3094
 
2.2%
3019
 
2.2%
2879
 
2.1%
2788
 
2.0%
Other values (725) 94496
68.3%
None
ValueCountFrequency (%)
· 5
55.6%
4
44.4%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
100.0%

Correlations

2023-12-12T16:18:09.921576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역분류
지역1.0000.286
분류0.2861.000
2023-12-12T16:18:10.004337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역분류
지역1.0000.104
분류0.1041.000
2023-12-12T16:18:10.098069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역분류
지역1.0000.104
분류0.1041.000

Missing values

2023-12-12T16:18:07.215989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:18:07.584508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역가맹점명분류주소
8260경기용강볼링쎈타체육시설경기 안성시 죽산면 죽산리 435-2 301
12667강원명성칼라미술강원도 속초시 중앙로 148-2
20576경북불국사관광지경상북도 경주시 불국로 385 불국사
10085강원㈜다다엔터테인먼트공연강원도 춘천시 금강로68-12 엠백화첨 별관4층 (조양동)
9593전남나라사진관미술전남 영광군 영광읍 물무로 108
19383인천CGV 인천도화영상인천광역시 미추홀구 숙골로88번길 12, 지하1층(도화동)
289부산조광포토피아미술부산 동구 범일동 830-269 3층
1966부산맘모스사진관미술부산 사하구 다대동 120-1
10162서울포토바이홍대점미술서울 마포구 와우산로27길 50 (서교동)
14597세종이도스포츠(이도스포츠산업)체육용품세종특별자치시 나성동 한누리대로 312 108호
지역가맹점명분류주소
5187경기가평역교통수단경기 가평군 가평읍 달전리 567번지
8746경남젠모텔숙박경남 창원시 마산회원구 양덕북18길 34
23355경남한복랑문화체험경상남도 사천시 주공로 25 지산왕수학교실
13673경남고현터미널교통수단경남 거제시 고현동 979-2 고현터미널
22641경남마운틴헬스클럽체육시설경상남도 양산시 덕계로 91 부산은행 4층
106전북풍남관광호텔숙박전북 전주시 완산구 전주객사2길 45-7
6811경북경주시근로자종합복지관문화체험경상북도 경주시 현곡면 용담로 423-23
14248경기동탄서점도서경기도 화성시 동탄순환대로 127-9 우성애비뉴타워 2층 동탄서점
19792광주심포니악기사음악광주광역시 남구 봉선로 127 솔뫼 REX-K1 310호, 311호
5362강원강릉 시외버스 터미널교통수단강원 강릉시 홍제동 992-1

Duplicate rows

Most frequently occurring

지역가맹점명분류주소# duplicates
0전남순천 북부 정류소교통수단전남 순천시 중앙로 1482