Overview

Dataset statistics

Number of variables4
Number of observations90
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory34.5 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description2013년 음반, 음악 영상물 제작 배급업 현황
Author광주광역시
URLhttps://www.data.go.kr/data/3076354/fileData.do

Alerts

연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번 High correlation
업종 is highly imbalanced (69.0%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:52:40.741475
Analysis finished2023-12-12 05:52:41.538563
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.5
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-12T14:52:41.649075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.45
Q123.25
median45.5
Q367.75
95-th percentile85.55
Maximum90
Range89
Interquartile range (IQR)44.5

Descriptive statistics

Standard deviation26.124701
Coefficient of variation (CV)0.57416925
Kurtosis-1.2
Mean45.5
Median Absolute Deviation (MAD)22.5
Skewness0
Sum4095
Variance682.5
MonotonicityStrictly increasing
2023-12-12T14:52:41.821506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
69 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
Other values (80) 80
88.9%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%

업종
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size852.0 B
제작업
85 
배급업
 
5

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row배급업
2nd row배급업
3rd row배급업
4th row배급업
5th row배급업

Common Values

ValueCountFrequency (%)
제작업 85
94.4%
배급업 5
 
5.6%

Length

2023-12-12T14:52:41.954201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:52:42.068404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제작업 85
94.4%
배급업 5
 
5.6%
Distinct74
Distinct (%)82.2%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-12T14:52:42.333163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length7
Mean length7.8222222
Min length3

Characters and Unicode

Total characters704
Distinct characters160
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)71.1%

Sample

1st row제이램
2nd row유한회사 제이이사컴퍼니
3rd row엠엔 트레코(MN TREKO)
4th rowAN'T SOUND CONTENTS
5th row(주)디자인허브
ValueCountFrequency (%)
놀자노래뮤비방 6
 
5.6%
꼬셔봐노래뮤비방 3
 
2.8%
투투노래뮤비방 3
 
2.8%
센트럴노래뮤비방 2
 
1.9%
sound 2
 
1.9%
제이이사컴퍼니 2
 
1.9%
유한회사 2
 
1.9%
contents 2
 
1.9%
엔젤노래뮤비방 2
 
1.9%
제이램 2
 
1.9%
Other values (78) 81
75.7%
2023-12-12T14:52:42.742352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
10.1%
71
 
10.1%
69
 
9.8%
68
 
9.7%
66
 
9.4%
17
 
2.4%
11
 
1.6%
N 9
 
1.3%
8
 
1.1%
T 8
 
1.1%
Other values (150) 306
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 612
86.9%
Uppercase Letter 42
 
6.0%
Space Separator 17
 
2.4%
Lowercase Letter 14
 
2.0%
Open Punctuation 7
 
1.0%
Close Punctuation 7
 
1.0%
Other Punctuation 5
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
 
11.6%
71
 
11.6%
69
 
11.3%
68
 
11.1%
66
 
10.8%
11
 
1.8%
8
 
1.3%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (122) 227
37.1%
Uppercase Letter
ValueCountFrequency (%)
N 9
21.4%
T 8
19.0%
O 6
14.3%
S 4
9.5%
E 3
 
7.1%
C 2
 
4.8%
D 2
 
4.8%
U 2
 
4.8%
K 2
 
4.8%
A 2
 
4.8%
Other values (2) 2
 
4.8%
Lowercase Letter
ValueCountFrequency (%)
o 2
14.3%
n 2
14.3%
e 2
14.3%
g 1
7.1%
i 1
7.1%
k 1
7.1%
h 1
7.1%
t 1
7.1%
s 1
7.1%
m 1
7.1%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
' 1
 
20.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 612
86.9%
Latin 56
 
8.0%
Common 36
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
 
11.6%
71
 
11.6%
69
 
11.3%
68
 
11.1%
66
 
10.8%
11
 
1.8%
8
 
1.3%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (122) 227
37.1%
Latin
ValueCountFrequency (%)
N 9
16.1%
T 8
14.3%
O 6
 
10.7%
S 4
 
7.1%
E 3
 
5.4%
C 2
 
3.6%
D 2
 
3.6%
U 2
 
3.6%
o 2
 
3.6%
n 2
 
3.6%
Other values (13) 16
28.6%
Common
ValueCountFrequency (%)
17
47.2%
( 7
19.4%
) 7
19.4%
. 4
 
11.1%
' 1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 612
86.9%
ASCII 92
 
13.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
71
 
11.6%
71
 
11.6%
69
 
11.3%
68
 
11.1%
66
 
10.8%
11
 
1.8%
8
 
1.3%
7
 
1.1%
7
 
1.1%
7
 
1.1%
Other values (122) 227
37.1%
ASCII
ValueCountFrequency (%)
17
18.5%
N 9
 
9.8%
T 8
 
8.7%
( 7
 
7.6%
) 7
 
7.6%
O 6
 
6.5%
S 4
 
4.3%
. 4
 
4.3%
E 3
 
3.3%
C 2
 
2.2%
Other values (18) 25
27.2%
Distinct87
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-12T14:52:43.145205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length31
Mean length24.355556
Min length20

Characters and Unicode

Total characters2192
Distinct characters112
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)93.3%

Sample

1st row광주광역시 북구 중흥동 683번지 14호
2nd row광주광역시 서구 화정동 23번지 16호
3rd row광주광역시 서구 화정동 400번지 16호 103동
4th row광주광역시 남구 사동 177번지 39호 광주영상예술센터
5th row광주광역시 동구 금남로2가 20-2번지 무등빌딩 10층
ValueCountFrequency (%)
광주광역시 90
 
19.7%
북구 29
 
6.4%
광산구 25
 
5.5%
서구 18
 
3.9%
0호 12
 
2.6%
남구 12
 
2.6%
용봉동 11
 
2.4%
3호 8
 
1.8%
동구 6
 
1.3%
2호 6
 
1.3%
Other values (165) 239
52.4%
2023-12-12T14:52:43.688087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
527
24.0%
207
 
9.4%
100
 
4.6%
95
 
4.3%
90
 
4.1%
90
 
4.1%
90
 
4.1%
90
 
4.1%
90
 
4.1%
1 86
 
3.9%
Other values (102) 727
33.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1244
56.8%
Space Separator 527
24.0%
Decimal Number 413
 
18.8%
Uppercase Letter 3
 
0.1%
Dash Punctuation 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
207
16.6%
100
 
8.0%
95
 
7.6%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
78
 
6.3%
33
 
2.7%
Other values (85) 281
22.6%
Decimal Number
ValueCountFrequency (%)
1 86
20.8%
3 49
11.9%
0 47
11.4%
8 38
9.2%
2 37
9.0%
9 33
 
8.0%
5 33
 
8.0%
7 31
 
7.5%
4 31
 
7.5%
6 28
 
6.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
527
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1244
56.8%
Common 945
43.1%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
207
16.6%
100
 
8.0%
95
 
7.6%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
78
 
6.3%
33
 
2.7%
Other values (85) 281
22.6%
Common
ValueCountFrequency (%)
527
55.8%
1 86
 
9.1%
3 49
 
5.2%
0 47
 
5.0%
8 38
 
4.0%
2 37
 
3.9%
9 33
 
3.5%
5 33
 
3.5%
7 31
 
3.3%
4 31
 
3.3%
Other values (5) 33
 
3.5%
Latin
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1244
56.8%
ASCII 948
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
527
55.6%
1 86
 
9.1%
3 49
 
5.2%
0 47
 
5.0%
8 38
 
4.0%
2 37
 
3.9%
9 33
 
3.5%
5 33
 
3.5%
7 31
 
3.3%
4 31
 
3.3%
Other values (7) 36
 
3.8%
Hangul
ValueCountFrequency (%)
207
16.6%
100
 
8.0%
95
 
7.6%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
90
 
7.2%
78
 
6.3%
33
 
2.7%
Other values (85) 281
22.6%

Interactions

2023-12-12T14:52:41.269316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:52:43.807943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종상호명영업소주소
연번1.0000.8580.6100.938
업종0.8581.0000.0000.000
상호명0.6100.0001.0000.998
영업소주소0.9380.0000.9981.000
2023-12-12T14:52:43.913939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.658
업종0.6581.000

Missing values

2023-12-12T14:52:41.378592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:52:41.494741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종상호명영업소주소
01배급업제이램광주광역시 북구 중흥동 683번지 14호
12배급업유한회사 제이이사컴퍼니광주광역시 서구 화정동 23번지 16호
23배급업엠엔 트레코(MN TREKO)광주광역시 서구 화정동 400번지 16호 103동
34배급업AN'T SOUND CONTENTS광주광역시 남구 사동 177번지 39호 광주영상예술센터
45배급업(주)디자인허브광주광역시 동구 금남로2가 20-2번지 무등빌딩 10층
56제작업(주)전남일보광주광역시 동구 대의동 39번지 1호
67제작업투투노래뮤비방광주광역시 동구 산수동 522번지 80호
78제작업더키스톤광주광역시 동구 중앙로160번길 청소년삶디자인센터 304호
89제작업(주)시민의소리광주광역시 동구 학동 71-3번지 영동빌딩
910제작업케이팝노래뮤비방광주광역시 동구 학동 873번지 3호
연번업종상호명영업소주소
8081제작업엔젤뮤비연습장광주광역시 광산구 월곡동 547번지 5호
8182제작업꼬셔봐노래뮤비방광주광역시 광산구 월곡동 679번지 12호
8283제작업오렌지뮤비노래방광주광역시 광산구 월곡동 696번지 2호
8384제작업노라조 노래영상제작실광주광역시 광산구 장덕동 1467번지
8485제작업체리노래뮤비방광주광역시 광산구 장덕동 1677번지 0호
8586제작업칸노래뮤비방광주광역시 광산구 하남동 713번지
8687제작업앵두노래뮤비방광주광역시 광산구 하남동 779번지
8788제작업우수노래뮤비방광주광역시 광산구 하남동 780번지 0호
8889제작업탑노래뮤비방광주광역시 광산구 하남동 788번지
8990제작업킹덤노래뮤비방광주광역시 광산구 흑석동 607번지 0호