Overview

Dataset statistics

Number of variables4
Number of observations45
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory35.9 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description경상남도 중소기업협동조합(실크, 가구, 아스콘, 수퍼마켓, 공업 등) 현황과 관련된 자료로써, 조합명,주소,휴면여부에 관한 데이터를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15068316

Alerts

연 번 is highly overall correlated with 휴면 여부High correlation
휴면 여부 is highly overall correlated with 연 번High correlation
휴면 여부 is highly imbalanced (56.7%)Imbalance
연 번 has unique valuesUnique
조합명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-08-15 04:26:46.129987
Analysis finished2023-08-15 04:26:47.117416
Duration0.99 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연 번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean23
Minimum1
Maximum45
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size537.0 B
2023-08-15T13:26:47.489922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.2
Q112
median23
Q334
95-th percentile42.8
Maximum45
Range44
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.133926
Coefficient of variation (CV)0.57104024
Kurtosis-1.2
Mean23
Median Absolute Deviation (MAD)11
Skewness0
Sum1035
Variance172.5
MonotonicityStrictly increasing
2023-08-15T13:26:48.142067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1 1
 
2.2%
35 1
 
2.2%
26 1
 
2.2%
27 1
 
2.2%
28 1
 
2.2%
29 1
 
2.2%
30 1
 
2.2%
31 1
 
2.2%
32 1
 
2.2%
33 1
 
2.2%
Other values (35) 35
77.8%
ValueCountFrequency (%)
1 1
2.2%
2 1
2.2%
3 1
2.2%
4 1
2.2%
5 1
2.2%
6 1
2.2%
7 1
2.2%
8 1
2.2%
9 1
2.2%
10 1
2.2%
ValueCountFrequency (%)
45 1
2.2%
44 1
2.2%
43 1
2.2%
42 1
2.2%
41 1
2.2%
40 1
2.2%
39 1
2.2%
38 1
2.2%
37 1
2.2%
36 1
2.2%

조합명
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-08-15T13:26:48.670312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length13.266667
Min length8

Characters and Unicode

Total characters597
Distinct characters101
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row경남가스판매업협동조합
2nd row경남거제수퍼마켓협동조합
3rd row경남공예협동조합
4th row경남니트공업협동조합
5th row경남레미콘공업협동조합
ValueCountFrequency (%)
경남가스판매업협동조합 1
 
2.2%
경남서부패션인터넷사업협동조합 1
 
2.2%
경남거제시급식사업협동조합 1
 
2.2%
경남공동물류사업협동조합 1
 
2.2%
경남김해부산강서생활용품유통사업조합 1
 
2.2%
경남레미콘사업협동조합 1
 
2.2%
경남서부급식사업협동조합 1
 
2.2%
경남자동차재활용사업협동조합 1
 
2.2%
경남중부레미콘사업협동조합 1
 
2.2%
경남중부의류판매사업협동조합 1
 
2.2%
Other values (35) 35
77.8%
2023-08-15T13:26:49.430541image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
8.0%
46
 
7.7%
45
 
7.5%
44
 
7.4%
42
 
7.0%
37
 
6.2%
36
 
6.0%
23
 
3.9%
18
 
3.0%
18
 
3.0%
Other values (91) 240
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 597
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
8.0%
46
 
7.7%
45
 
7.5%
44
 
7.4%
42
 
7.0%
37
 
6.2%
36
 
6.0%
23
 
3.9%
18
 
3.0%
18
 
3.0%
Other values (91) 240
40.2%

Most occurring scripts

ValueCountFrequency (%)
Hangul 597
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
8.0%
46
 
7.7%
45
 
7.5%
44
 
7.4%
42
 
7.0%
37
 
6.2%
36
 
6.0%
23
 
3.9%
18
 
3.0%
18
 
3.0%
Other values (91) 240
40.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 597
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
8.0%
46
 
7.7%
45
 
7.5%
44
 
7.4%
42
 
7.0%
37
 
6.2%
36
 
6.0%
23
 
3.9%
18
 
3.0%
18
 
3.0%
Other values (91) 240
40.2%

주소
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-08-15T13:26:49.833293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length32
Mean length25.222222
Min length15

Characters and Unicode

Total characters1135
Distinct characters141
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row경상남도 창원시 마산합포구 천하장사로 42-1 5층
2nd row경상남도 거제시 연초면 효촌길 124
3rd row경상남도 창원시 성산구 용지로111번길 3
4th row경상남도 창원시 마산회원구 회원동로 25
5th row경상남도 창원시 의창구 태복산로 7 번길 3
ValueCountFrequency (%)
경상남도 44
 
17.5%
창원시 24
 
9.5%
의창구 12
 
4.8%
진주시 9
 
3.6%
성산구 5
 
2.0%
마산합포구 4
 
1.6%
중앙대로 3
 
1.2%
김해시 3
 
1.2%
62 2
 
0.8%
남산로1번길 2
 
0.8%
Other values (127) 144
57.1%
2023-08-15T13:26:50.523769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
207
 
18.2%
52
 
4.6%
47
 
4.1%
45
 
4.0%
45
 
4.0%
45
 
4.0%
1 45
 
4.0%
39
 
3.4%
35
 
3.1%
30
 
2.6%
Other values (131) 545
48.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 729
64.2%
Space Separator 207
 
18.2%
Decimal Number 184
 
16.2%
Dash Punctuation 9
 
0.8%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
7.1%
47
 
6.4%
45
 
6.2%
45
 
6.2%
45
 
6.2%
39
 
5.3%
35
 
4.8%
30
 
4.1%
25
 
3.4%
24
 
3.3%
Other values (117) 342
46.9%
Decimal Number
ValueCountFrequency (%)
1 45
24.5%
2 25
13.6%
3 23
12.5%
4 17
 
9.2%
0 15
 
8.2%
8 14
 
7.6%
5 13
 
7.1%
9 11
 
6.0%
6 11
 
6.0%
7 10
 
5.4%
Space Separator
ValueCountFrequency (%)
207
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 729
64.2%
Common 406
35.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
7.1%
47
 
6.4%
45
 
6.2%
45
 
6.2%
45
 
6.2%
39
 
5.3%
35
 
4.8%
30
 
4.1%
25
 
3.4%
24
 
3.3%
Other values (117) 342
46.9%
Common
ValueCountFrequency (%)
207
51.0%
1 45
 
11.1%
2 25
 
6.2%
3 23
 
5.7%
4 17
 
4.2%
0 15
 
3.7%
8 14
 
3.4%
5 13
 
3.2%
9 11
 
2.7%
6 11
 
2.7%
Other values (4) 25
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 729
64.2%
ASCII 406
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
207
51.0%
1 45
 
11.1%
2 25
 
6.2%
3 23
 
5.7%
4 17
 
4.2%
0 15
 
3.7%
8 14
 
3.4%
5 13
 
3.2%
9 11
 
2.7%
6 11
 
2.7%
Other values (4) 25
 
6.2%
Hangul
ValueCountFrequency (%)
52
 
7.1%
47
 
6.4%
45
 
6.2%
45
 
6.2%
45
 
6.2%
39
 
5.3%
35
 
4.8%
30
 
4.1%
25
 
3.4%
24
 
3.3%
Other values (117) 342
46.9%

휴면 여부
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
<NA>
41 
휴면
 
4

Length

Max length4
Median length4
Mean length3.8222222
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 41
91.1%
휴면 4
 
8.9%

Length

2023-08-15T13:26:50.815684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-08-15T13:26:51.072183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 41
91.1%
휴면 4
 
8.9%

Interactions

2023-08-15T13:26:46.484519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-08-15T13:26:51.207016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연 번조합명주소
연 번1.0001.0001.000
조합명1.0001.0001.000
주소1.0001.0001.000
2023-08-15T13:26:51.456528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연 번휴면 여부
연 번1.0001.000
휴면 여부1.0001.000

Missing values

2023-08-15T13:26:46.710567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-08-15T13:26:46.907469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연 번조합명주소휴면 여부
01경남가스판매업협동조합경상남도 창원시 마산합포구 천하장사로 42-1 5층<NA>
12경남거제수퍼마켓협동조합경상남도 거제시 연초면 효촌길 124<NA>
23경남공예협동조합경상남도 창원시 성산구 용지로111번길 3<NA>
34경남니트공업협동조합경상남도 창원시 마산회원구 회원동로 25<NA>
45경남레미콘공업협동조합경상남도 창원시 의창구 태복산로 7 번길 3<NA>
56경남서부의류판매업협동조합경상남도 진주시 장대로 39휴면
67경남재활용업협동조합경상남도 창원시 마산합포구 몽고정길 119<NA>
78경남직물진주실크공업협동조합경상남도 진주시 문산읍 월아산로996번길 43<NA>
89경남콘크리트공업협동조합경상남도 창원시 의창구 도계두리길6번길 28<NA>
910부산울산경남아스콘공업협동조합경상남도 창원시 의창구 남산로1번길 62<NA>
연 번조합명주소휴면 여부
3536마산어시장활어사업협동조합경상남도 창원시 마산합포구 어시장8길 8 (신포동2가)<NA>
3637밀양자동차부품소재공단사업협동조합경상남도 밀양시 삼랑진읍 용전산업단지길 54<NA>
3738밀양하남기계소재공단사업협동조합경상남도 밀양시 하남읍 온천로 1842 2층<NA>
3839부울경신기술사업협동조합경상남도 김해시 주촌면 골든루트로 80-16 중소기업비즈니스센터 413호<NA>
3940부울경아스콘사업협동조합경상남도 진주시 사들로 26 행복빌딩 401호<NA>
4041양산시재생용사업협동조합경남 양산시 상북면 양산대로 1815<NA>
4142진해마천주물공단사업협동조합경상남도 창원시 진해구 남의로 32<NA>
4243창녕성산자동차부품사업협동조합경상남도 창녕군 성산면 후천공단길 32<NA>
4344통영선박기관수리공업사업협동조합경상남도 통영시 멘데산업길 79-12<NA>
4445통영중앙시장사업협동조합경상남도 통영시 중앙로 160<NA>