Overview

Dataset statistics

Number of variables5
Number of observations79
Missing cells49
Missing cells (%)12.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.2 KiB
Average record size in memory41.7 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description충청남도 공주시 문화유통업현황에 대한 데이터로 (구분, 상호, 영업소소재지, 전화번호) 등의 항목을 제공합니다,
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=419&beforeMenuCd=DOM_000000201001001000&publicdatapk=3084499

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 49 (62.0%) missing valuesMissing
상호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 23:04:02.634936
Analysis finished2024-01-09 23:04:03.083592
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct3
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size764.0 B
노래연습장
55 
인터넷게임시설제공업
23 
영화관
 
1

Length

Max length10
Median length5
Mean length6.4303797
Min length3

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row인터넷게임시설제공업
2nd row인터넷게임시설제공업
3rd row인터넷게임시설제공업
4th row인터넷게임시설제공업
5th row인터넷게임시설제공업

Common Values

ValueCountFrequency (%)
노래연습장 55
69.6%
인터넷게임시설제공업 23
29.1%
영화관 1
 
1.3%

Length

2024-01-10T08:04:03.157607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T08:04:03.259619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노래연습장 55
69.6%
인터넷게임시설제공업 23
29.1%
영화관 1
 
1.3%

상호
Text

UNIQUE 

Distinct79
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size764.0 B
2024-01-10T08:04:03.501202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length7.8860759
Min length4

Characters and Unicode

Total characters623
Distinct characters165
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)100.0%

Sample

1st rowLIBERTY PC방
2nd row슈퍼맨PC방
3rd row컴스쿨pc방
4th row게임존pc방
5th rowEONpc방
ValueCountFrequency (%)
노래연습장 12
 
10.9%
pc방 4
 
3.6%
pc 3
 
2.7%
슈퍼스타k 2
 
1.8%
슈퍼스타 1
 
0.9%
쏠래 1
 
0.9%
21세기 1
 
0.9%
아우성노래연습장 1
 
0.9%
궁노래연습장 1
 
0.9%
볼가리노래연습장 1
 
0.9%
Other values (83) 83
75.5%
2024-01-10T08:04:03.894643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54
 
8.7%
54
 
8.7%
50
 
8.0%
48
 
7.7%
48
 
7.7%
31
 
5.0%
C 18
 
2.9%
P 17
 
2.7%
16
 
2.6%
8
 
1.3%
Other values (155) 279
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 495
79.5%
Uppercase Letter 59
 
9.5%
Space Separator 31
 
5.0%
Lowercase Letter 26
 
4.2%
Decimal Number 8
 
1.3%
Other Punctuation 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
54
 
10.9%
54
 
10.9%
50
 
10.1%
48
 
9.7%
48
 
9.7%
16
 
3.2%
8
 
1.6%
6
 
1.2%
6
 
1.2%
6
 
1.2%
Other values (124) 199
40.2%
Uppercase Letter
ValueCountFrequency (%)
C 18
30.5%
P 17
28.8%
L 4
 
6.8%
K 3
 
5.1%
S 3
 
5.1%
E 3
 
5.1%
I 2
 
3.4%
G 1
 
1.7%
R 1
 
1.7%
T 1
 
1.7%
Other values (6) 6
 
10.2%
Lowercase Letter
ValueCountFrequency (%)
c 7
26.9%
p 6
23.1%
n 3
11.5%
o 3
11.5%
i 2
 
7.7%
e 2
 
7.7%
a 1
 
3.8%
f 1
 
3.8%
t 1
 
3.8%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
1 2
 
25.0%
3 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
. 1
 
25.0%
Space Separator
ValueCountFrequency (%)
31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 495
79.5%
Latin 85
 
13.6%
Common 43
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
54
 
10.9%
54
 
10.9%
50
 
10.1%
48
 
9.7%
48
 
9.7%
16
 
3.2%
8
 
1.6%
6
 
1.2%
6
 
1.2%
6
 
1.2%
Other values (124) 199
40.2%
Latin
ValueCountFrequency (%)
C 18
21.2%
P 17
20.0%
c 7
 
8.2%
p 6
 
7.1%
L 4
 
4.7%
n 3
 
3.5%
o 3
 
3.5%
K 3
 
3.5%
S 3
 
3.5%
E 3
 
3.5%
Other values (15) 18
21.2%
Common
ValueCountFrequency (%)
31
72.1%
2 5
 
11.6%
& 3
 
7.0%
1 2
 
4.7%
3 1
 
2.3%
. 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 495
79.5%
ASCII 128
 
20.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
54
 
10.9%
54
 
10.9%
50
 
10.1%
48
 
9.7%
48
 
9.7%
16
 
3.2%
8
 
1.6%
6
 
1.2%
6
 
1.2%
6
 
1.2%
Other values (124) 199
40.2%
ASCII
ValueCountFrequency (%)
31
24.2%
C 18
14.1%
P 17
13.3%
c 7
 
5.5%
p 6
 
4.7%
2 5
 
3.9%
L 4
 
3.1%
& 3
 
2.3%
n 3
 
2.3%
o 3
 
2.3%
Other values (21) 31
24.2%
Distinct76
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size764.0 B
2024-01-10T08:04:04.195267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length30
Mean length24.556962
Min length18

Characters and Unicode

Total characters1940
Distinct characters89
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)92.4%

Sample

1st row충청남도 공주시 번영1로 154 (신관동)
2nd row충청남도 공주시 매산동길 36 (신관동)
3rd row충청남도 공주시 국고개길 3-1 (중동)
4th row충청남도 공주시 무안길 17 (금학동)
5th row충청남도 공주시 공주대학로 59 (신관동, 대창빌딩)
ValueCountFrequency (%)
충청남도 79
18.5%
공주시 79
18.5%
신관동 43
 
10.1%
번영2로 11
 
2.6%
2층 11
 
2.6%
반포면 9
 
2.1%
번영1로 8
 
1.9%
흑수골길 8
 
1.9%
금성동 7
 
1.6%
1층 6
 
1.4%
Other values (114) 166
38.9%
2024-01-10T08:04:04.627106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
348
17.9%
85
 
4.4%
85
 
4.4%
84
 
4.3%
1 82
 
4.2%
79
 
4.1%
79
 
4.1%
79
 
4.1%
79
 
4.1%
72
 
3.7%
Other values (79) 868
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1122
57.8%
Space Separator 348
 
17.9%
Decimal Number 283
 
14.6%
Open Punctuation 63
 
3.2%
Close Punctuation 63
 
3.2%
Dash Punctuation 35
 
1.8%
Other Punctuation 26
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
7.6%
85
 
7.6%
84
 
7.5%
79
 
7.0%
79
 
7.0%
79
 
7.0%
79
 
7.0%
72
 
6.4%
48
 
4.3%
46
 
4.1%
Other values (64) 386
34.4%
Decimal Number
ValueCountFrequency (%)
1 82
29.0%
2 48
17.0%
3 32
 
11.3%
7 26
 
9.2%
8 24
 
8.5%
5 21
 
7.4%
4 18
 
6.4%
6 15
 
5.3%
9 10
 
3.5%
0 7
 
2.5%
Space Separator
ValueCountFrequency (%)
348
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%
Other Punctuation
ValueCountFrequency (%)
, 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1122
57.8%
Common 818
42.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
7.6%
85
 
7.6%
84
 
7.5%
79
 
7.0%
79
 
7.0%
79
 
7.0%
79
 
7.0%
72
 
6.4%
48
 
4.3%
46
 
4.1%
Other values (64) 386
34.4%
Common
ValueCountFrequency (%)
348
42.5%
1 82
 
10.0%
( 63
 
7.7%
) 63
 
7.7%
2 48
 
5.9%
- 35
 
4.3%
3 32
 
3.9%
, 26
 
3.2%
7 26
 
3.2%
8 24
 
2.9%
Other values (5) 71
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1122
57.8%
ASCII 818
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
348
42.5%
1 82
 
10.0%
( 63
 
7.7%
) 63
 
7.7%
2 48
 
5.9%
- 35
 
4.3%
3 32
 
3.9%
, 26
 
3.2%
7 26
 
3.2%
8 24
 
2.9%
Other values (5) 71
 
8.7%
Hangul
ValueCountFrequency (%)
85
 
7.6%
85
 
7.6%
84
 
7.5%
79
 
7.0%
79
 
7.0%
79
 
7.0%
79
 
7.0%
72
 
6.4%
48
 
4.3%
46
 
4.1%
Other values (64) 386
34.4%

전화번호
Text

MISSING 

Distinct29
Distinct (%)96.7%
Missing49
Missing (%)62.0%
Memory size764.0 B
2024-01-10T08:04:04.832434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row041-854-0035
2nd row041-857-8624
3rd row041-856-6060
4th row041-825-9379
5th row041-825-3976
ValueCountFrequency (%)
041-825-4700 2
 
6.7%
041-825-0022 1
 
3.3%
041-854-3821 1
 
3.3%
041-825-7007 1
 
3.3%
041-855-5339 1
 
3.3%
041-857-9400 1
 
3.3%
041-881-2711 1
 
3.3%
041-855-0668 1
 
3.3%
041-854-2255 1
 
3.3%
041-858-0999 1
 
3.3%
Other values (19) 19
63.3%
2024-01-10T08:04:05.248214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.7%
0 58
16.1%
4 45
12.5%
8 45
12.5%
1 41
11.4%
5 36
10.0%
2 18
 
5.0%
6 17
 
4.7%
7 15
 
4.2%
9 14
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
83.3%
Dash Punctuation 60
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 58
19.3%
4 45
15.0%
8 45
15.0%
1 41
13.7%
5 36
12.0%
2 18
 
6.0%
6 17
 
5.7%
7 15
 
5.0%
9 14
 
4.7%
3 11
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.7%
0 58
16.1%
4 45
12.5%
8 45
12.5%
1 41
11.4%
5 36
10.0%
2 18
 
5.0%
6 17
 
4.7%
7 15
 
4.2%
9 14
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.7%
0 58
16.1%
4 45
12.5%
8 45
12.5%
1 41
11.4%
5 36
10.0%
2 18
 
5.0%
6 17
 
4.7%
7 15
 
4.2%
9 14
 
3.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size764.0 B
Minimum2020-09-03 00:00:00
Maximum2020-09-03 00:00:00
2024-01-10T08:04:05.363365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T08:04:05.458134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-01-10T08:04:05.559609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분상호영업소소재지(도로명)전화번호
구분1.0001.0000.959NaN
상호1.0001.0001.0001.000
영업소소재지(도로명)0.9591.0001.0001.000
전화번호NaN1.0001.0001.000

Missing values

2024-01-10T08:04:02.955319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:04:03.046079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분상호영업소소재지(도로명)전화번호데이터기준일자
0인터넷게임시설제공업LIBERTY PC방충청남도 공주시 번영1로 154 (신관동)<NA>2020-09-03
1인터넷게임시설제공업슈퍼맨PC방충청남도 공주시 매산동길 36 (신관동)<NA>2020-09-03
2인터넷게임시설제공업컴스쿨pc방충청남도 공주시 국고개길 3-1 (중동)<NA>2020-09-03
3인터넷게임시설제공업게임존pc방충청남도 공주시 무안길 17 (금학동)<NA>2020-09-03
4인터넷게임시설제공업EONpc방충청남도 공주시 공주대학로 59 (신관동, 대창빌딩)<NA>2020-09-03
5인터넷게임시설제공업프렌드pc방충청남도 공주시 금성길 20 (금성동)<NA>2020-09-03
6인터넷게임시설제공업힐링PC방충청남도 공주시 신관로 59 (신관동)<NA>2020-09-03
7인터넷게임시설제공업IL PC방충청남도 공주시 신금1길 62-17, 1층 (신관동)<NA>2020-09-03
8인터넷게임시설제공업노리터 PC방충청남도 공주시 흑수골길 38, 2층 (신관동)<NA>2020-09-03
9인터넷게임시설제공업CLASS PC충청남도 공주시 무령로 232 (중동)<NA>2020-09-03
구분상호영업소소재지(도로명)전화번호데이터기준일자
69노래연습장신파라오 노래연습장충청남도 공주시 계룡면 영규대사로 67-1, 1층<NA>2020-09-03
70노래연습장코인잼노래연습장충청남도 공주시 공주대학로 77-1 (신관동)<NA>2020-09-03
71노래연습장까치노래연습장충청남도 공주시 산성시장5길 78-12 (금성동)041-855-32352020-09-03
72노래연습장게임1번지&코인노래연습장충청남도 공주시 흑수골길 10 (신관동)<NA>2020-09-03
73노래연습장슈퍼스타K 코인노래연습장충청남도 공주시 번영3로 69, 1층 (신관동)<NA>2020-09-03
74노래연습장G.coin 노래연습장충청남도 공주시 번영2로 78-1, 1층 (신관동)<NA>2020-09-03
75노래연습장핫플코인노래연습장충청남도 공주시 흑수골길 33, 2층 (신관동)<NA>2020-09-03
76노래연습장황금노래연습장충청남도 공주시 반포면 임금봉길 11<NA>2020-09-03
77노래연습장슈퍼스타K 시즌2충청남도 공주시 무령로 229, 2층 (산성동)<NA>2020-09-03
78노래연습장원멀티게임장&코인노래연습장충청남도 공주시 흑수골길 41, 지하호 (신관동)<NA>2020-09-03