Overview

Dataset statistics

Number of variables5
Number of observations1346
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.0 KiB
Average record size in memory41.1 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description충청남도 소재 노래연습장 현황을 연번, 15개 시군별구분, 상호명, 형태 순으로 나열하여 범도민 공공데이터로 개방합니다.
Author충청남도
URLhttps://www.data.go.kr/data/15039275/fileData.do

Alerts

연번 is highly overall correlated with 시군별High correlation
시군별 is highly overall correlated with 연번High correlation
형태 is highly imbalanced (68.2%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:49:31.635981
Analysis finished2024-03-15 02:49:34.821924
Duration3.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1346
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean673.5
Minimum1
Maximum1346
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.0 KiB
2024-03-15T11:49:35.073449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile68.25
Q1337.25
median673.5
Q31009.75
95-th percentile1278.75
Maximum1346
Range1345
Interquartile range (IQR)672.5

Descriptive statistics

Standard deviation388.70104
Coefficient of variation (CV)0.57713592
Kurtosis-1.2
Mean673.5
Median Absolute Deviation (MAD)336.5
Skewness0
Sum906531
Variance151088.5
MonotonicityStrictly increasing
2024-03-15T11:49:35.442403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
896 1
 
0.1%
904 1
 
0.1%
903 1
 
0.1%
902 1
 
0.1%
901 1
 
0.1%
900 1
 
0.1%
899 1
 
0.1%
898 1
 
0.1%
897 1
 
0.1%
Other values (1336) 1336
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1346 1
0.1%
1345 1
0.1%
1344 1
0.1%
1343 1
0.1%
1342 1
0.1%
1341 1
0.1%
1340 1
0.1%
1339 1
0.1%
1338 1
0.1%
1337 1
0.1%

시군별
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size10.6 KiB
천안시 서북구
254 
천안시 동남구
142 
아산시
127 
당진시
126 
예산군
121 
Other values (11)
576 

Length

Max length7
Median length3
Mean length4.1768202
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천안시 동남구
2nd row천안시 동남구
3rd row천안시 동남구
4th row천안시 동남구
5th row천안시 동남구

Common Values

ValueCountFrequency (%)
천안시 서북구 254
18.9%
천안시 동남구 142
10.5%
아산시 127
9.4%
당진시 126
9.4%
예산군 121
9.0%
태안군 114
8.5%
보령시 101
 
7.5%
서산시 89
 
6.6%
논산시 82
 
6.1%
홍성군 65
 
4.8%
Other values (6) 125
9.3%

Length

2024-03-15T11:49:36.016604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
천안시 396
22.7%
서북구 254
14.6%
동남구 142
 
8.2%
아산시 127
 
7.3%
당진시 126
 
7.2%
예산군 121
 
6.9%
태안군 114
 
6.5%
보령시 101
 
5.8%
서산시 89
 
5.1%
논산시 82
 
4.7%
Other values (7) 190
10.9%
Distinct989
Distinct (%)73.5%
Missing0
Missing (%)0.0%
Memory size10.6 KiB
2024-03-15T11:49:37.007203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length7.9658247
Min length3

Characters and Unicode

Total characters10722
Distinct characters545
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique785 ?
Unique (%)58.3%

Sample

1st row산장노래연습장
2nd row쌈바노래연습장
3rd row꾼노래연습장
4th row바이브 코인 노래연습장
5th row애플노래연습장
ValueCountFrequency (%)
노래연습장 225
 
13.7%
코인노래연습장 19
 
1.2%
스타노래연습장 13
 
0.8%
팡팡노래연습장 12
 
0.7%
궁노래연습장 11
 
0.7%
세븐스타코인노래연습장 9
 
0.5%
vip노래연습장 7
 
0.4%
오렌지노래연습장 7
 
0.4%
썸노래연습장 7
 
0.4%
노래방 7
 
0.4%
Other values (1000) 1331
80.8%
2024-03-15T11:49:38.405071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1351
 
12.6%
1351
 
12.6%
1245
 
11.6%
1242
 
11.6%
1234
 
11.5%
302
 
2.8%
137
 
1.3%
137
 
1.3%
136
 
1.3%
116
 
1.1%
Other values (535) 3471
32.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10112
94.3%
Space Separator 302
 
2.8%
Uppercase Letter 190
 
1.8%
Decimal Number 40
 
0.4%
Lowercase Letter 38
 
0.4%
Other Punctuation 14
 
0.1%
Open Punctuation 10
 
0.1%
Close Punctuation 10
 
0.1%
Dash Punctuation 5
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1351
13.4%
1351
13.4%
1245
12.3%
1242
12.3%
1234
 
12.2%
137
 
1.4%
137
 
1.4%
136
 
1.3%
116
 
1.1%
76
 
0.8%
Other values (476) 3087
30.5%
Uppercase Letter
ValueCountFrequency (%)
K 23
12.1%
S 15
 
7.9%
O 14
 
7.4%
I 13
 
6.8%
P 13
 
6.8%
M 13
 
6.8%
N 12
 
6.3%
V 11
 
5.8%
A 11
 
5.8%
B 10
 
5.3%
Other values (16) 55
28.9%
Lowercase Letter
ValueCountFrequency (%)
o 7
18.4%
p 5
13.2%
a 4
10.5%
c 3
7.9%
i 3
7.9%
e 3
7.9%
r 3
7.9%
b 3
7.9%
n 2
 
5.3%
w 1
 
2.6%
Other values (4) 4
10.5%
Decimal Number
ValueCountFrequency (%)
2 14
35.0%
1 8
20.0%
3 5
 
12.5%
5 3
 
7.5%
0 3
 
7.5%
6 2
 
5.0%
4 2
 
5.0%
7 1
 
2.5%
9 1
 
2.5%
8 1
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 9
64.3%
& 3
 
21.4%
! 1
 
7.1%
# 1
 
7.1%
Space Separator
ValueCountFrequency (%)
302
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10111
94.3%
Common 382
 
3.6%
Latin 228
 
2.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1351
13.4%
1351
13.4%
1245
12.3%
1242
12.3%
1234
 
12.2%
137
 
1.4%
137
 
1.4%
136
 
1.3%
116
 
1.1%
76
 
0.8%
Other values (475) 3086
30.5%
Latin
ValueCountFrequency (%)
K 23
 
10.1%
S 15
 
6.6%
O 14
 
6.1%
I 13
 
5.7%
P 13
 
5.7%
M 13
 
5.7%
N 12
 
5.3%
V 11
 
4.8%
A 11
 
4.8%
B 10
 
4.4%
Other values (30) 93
40.8%
Common
ValueCountFrequency (%)
302
79.1%
2 14
 
3.7%
( 10
 
2.6%
) 10
 
2.6%
. 9
 
2.4%
1 8
 
2.1%
3 5
 
1.3%
- 5
 
1.3%
5 3
 
0.8%
0 3
 
0.8%
Other values (9) 13
 
3.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10111
94.3%
ASCII 610
 
5.7%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1351
13.4%
1351
13.4%
1245
12.3%
1242
12.3%
1234
 
12.2%
137
 
1.4%
137
 
1.4%
136
 
1.3%
116
 
1.1%
76
 
0.8%
Other values (475) 3086
30.5%
ASCII
ValueCountFrequency (%)
302
49.5%
K 23
 
3.8%
S 15
 
2.5%
2 14
 
2.3%
O 14
 
2.3%
I 13
 
2.1%
P 13
 
2.1%
M 13
 
2.1%
N 12
 
2.0%
V 11
 
1.8%
Other values (49) 180
29.5%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct1230
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size10.6 KiB
2024-03-15T11:49:39.474380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length25.653789
Min length16

Characters and Unicode

Total characters34530
Distinct characters395
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1137 ?
Unique (%)84.5%

Sample

1st row충청남도 천안시 동남구 각원사길 119 (안서동)
2nd row충청남도 천안시 동남구 각원사길 201 (안서동)
3rd row충청남도 천안시 동남구 각원사길 48, 2층 (안서동)
4th row충청남도 천안시 동남구 각원사길 54, 203호 (안서동, 상명웰빙프라자)
5th row충청남도 천안시 동남구 각원사길 56, 4층 (안서동)
ValueCountFrequency (%)
충청남도 1345
 
17.6%
천안시 396
 
5.2%
서북구 254
 
3.3%
동남구 142
 
1.9%
아산시 127
 
1.7%
당진시 126
 
1.6%
예산군 121
 
1.6%
태안군 114
 
1.5%
2층 108
 
1.4%
보령시 101
 
1.3%
Other values (1544) 4822
63.0%
2024-03-15T11:49:41.089124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6393
 
18.5%
1535
 
4.4%
1405
 
4.1%
1380
 
4.0%
1363
 
3.9%
1 1174
 
3.4%
1029
 
3.0%
901
 
2.6%
841
 
2.4%
2 817
 
2.4%
Other values (385) 17692
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21071
61.0%
Space Separator 6393
 
18.5%
Decimal Number 4962
 
14.4%
Close Punctuation 656
 
1.9%
Open Punctuation 656
 
1.9%
Other Punctuation 447
 
1.3%
Dash Punctuation 332
 
1.0%
Uppercase Letter 12
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1535
 
7.3%
1405
 
6.7%
1380
 
6.5%
1363
 
6.5%
1029
 
4.9%
901
 
4.3%
841
 
4.0%
677
 
3.2%
646
 
3.1%
615
 
2.9%
Other values (359) 10679
50.7%
Decimal Number
ValueCountFrequency (%)
1 1174
23.7%
2 817
16.5%
3 559
11.3%
4 440
 
8.9%
5 381
 
7.7%
0 381
 
7.7%
6 334
 
6.7%
7 311
 
6.3%
8 296
 
6.0%
9 269
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
P 3
25.0%
C 2
16.7%
B 2
16.7%
A 1
 
8.3%
I 1
 
8.3%
V 1
 
8.3%
G 1
 
8.3%
L 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
, 444
99.3%
. 2
 
0.4%
& 1
 
0.2%
Space Separator
ValueCountFrequency (%)
6393
100.0%
Close Punctuation
ValueCountFrequency (%)
) 656
100.0%
Open Punctuation
ValueCountFrequency (%)
( 656
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 332
100.0%
Lowercase Letter
ValueCountFrequency (%)
j 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21071
61.0%
Common 13446
38.9%
Latin 13
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1535
 
7.3%
1405
 
6.7%
1380
 
6.5%
1363
 
6.5%
1029
 
4.9%
901
 
4.3%
841
 
4.0%
677
 
3.2%
646
 
3.1%
615
 
2.9%
Other values (359) 10679
50.7%
Common
ValueCountFrequency (%)
6393
47.5%
1 1174
 
8.7%
2 817
 
6.1%
) 656
 
4.9%
( 656
 
4.9%
3 559
 
4.2%
, 444
 
3.3%
4 440
 
3.3%
5 381
 
2.8%
0 381
 
2.8%
Other values (7) 1545
 
11.5%
Latin
ValueCountFrequency (%)
P 3
23.1%
C 2
15.4%
B 2
15.4%
A 1
 
7.7%
I 1
 
7.7%
V 1
 
7.7%
j 1
 
7.7%
G 1
 
7.7%
L 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21071
61.0%
ASCII 13459
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6393
47.5%
1 1174
 
8.7%
2 817
 
6.1%
) 656
 
4.9%
( 656
 
4.9%
3 559
 
4.2%
, 444
 
3.3%
4 440
 
3.3%
5 381
 
2.8%
0 381
 
2.8%
Other values (16) 1558
 
11.6%
Hangul
ValueCountFrequency (%)
1535
 
7.3%
1405
 
6.7%
1380
 
6.5%
1363
 
6.5%
1029
 
4.9%
901
 
4.3%
841
 
4.0%
677
 
3.2%
646
 
3.1%
615
 
2.9%
Other values (359) 10679
50.7%

형태
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.6 KiB
노래연습장
1206 
동전노래연습장
135 
혼합(노래연습장+동전노래연습장)
 
5

Length

Max length17
Median length5
Mean length5.2451709
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노래연습장
2nd row노래연습장
3rd row노래연습장
4th row동전노래연습장
5th row노래연습장

Common Values

ValueCountFrequency (%)
노래연습장 1206
89.6%
동전노래연습장 135
 
10.0%
혼합(노래연습장+동전노래연습장) 5
 
0.4%

Length

2024-03-15T11:49:41.524422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:49:41.876207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
노래연습장 1206
89.6%
동전노래연습장 135
 
10.0%
혼합(노래연습장+동전노래연습장 5
 
0.4%

Interactions

2024-03-15T11:49:34.248430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T11:49:42.151052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군별형태
연번1.0000.9550.288
시군별0.9551.0000.275
형태0.2880.2751.000
2024-03-15T11:49:42.400015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
형태시군별
형태1.0000.154
시군별0.1541.000
2024-03-15T11:49:42.713785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시군별형태
연번1.0000.8000.179
시군별0.8001.0000.154
형태0.1790.1541.000

Missing values

2024-03-15T11:49:34.570807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T11:49:34.754772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시군별상호명소재지형태
01천안시 동남구산장노래연습장충청남도 천안시 동남구 각원사길 119 (안서동)노래연습장
12천안시 동남구쌈바노래연습장충청남도 천안시 동남구 각원사길 201 (안서동)노래연습장
23천안시 동남구꾼노래연습장충청남도 천안시 동남구 각원사길 48, 2층 (안서동)노래연습장
34천안시 동남구바이브 코인 노래연습장충청남도 천안시 동남구 각원사길 54, 203호 (안서동, 상명웰빙프라자)동전노래연습장
45천안시 동남구애플노래연습장충청남도 천안시 동남구 각원사길 56, 4층 (안서동)노래연습장
56천안시 동남구매드락코인노래연습장충청남도 천안시 동남구 각원사길 66, 2층 (안서동)동전노래연습장
67천안시 동남구팡팡노래연습장충청남도 천안시 동남구 고재10길 3 (원성동)노래연습장
78천안시 동남구극동노래연습장충청남도 천안시 동남구 고재17길 5 (원성동)노래연습장
89천안시 동남구필드노래연습장충청남도 천안시 동남구 고재20길 14, 2층 (원성동)노래연습장
910천안시 동남구탈출노래연습장충청남도 천안시 동남구 광덕면 신흥리1길 168노래연습장
연번시군별상호명소재지형태
13361337태안군축제 노래연습장충청남도 태안군 태안읍 독샘로 19노래연습장
13371338태안군궁노래연습장충청남도 태안군 태안읍 동문2길 25노래연습장
13381339태안군쏘나타노래연습장충청남도 태안군 태안읍 동문1길 26노래연습장
13391340태안군썬비치노래연습장충청남도 태안군 안면읍 방포1길 16 (지하1층)노래연습장
13401341태안군소원노래연습장충청남도 태안군 소원면 서해로 667노래연습장
13411342태안군로꼬동전노래연습장충청남도 태안군 태안읍 동문3길 3노래연습장
13421343태안군코니코인노래연습장충청남도 태안군 남면 곰섬로 236-220노래연습장
13431344태안군레인보우 코인 노래연습장충청남도 태안군 태안읍 동문2길 3, 3층노래연습장
13441345태안군해피존 노래연습장충청남도 태안군 원북면 옥파로 1166노래연습장
13451346태안군벗과뱃나루 노래연습장충청남도 태안군 남면 마검포길 201-42노래연습장