Overview

Dataset statistics

Number of variables5
Number of observations22
Missing cells2
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory47.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description경상남도 내 정수기 관련업체 현황 데이터로 정수기 관련 업체명, 업체소재지, 제품명, 모델명 항목에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3083310/fileData.do

Alerts

소재지 is highly overall correlated with 업체명High correlation
업체명 is highly overall correlated with 소재지High correlation
제품명 has 1 (4.5%) missing valuesMissing
모델명 has 1 (4.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:35:07.659777
Analysis finished2023-12-11 23:35:08.123558
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.5
Minimum1
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T08:35:08.174479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.05
Q16.25
median11.5
Q316.75
95-th percentile20.95
Maximum22
Range21
Interquartile range (IQR)10.5

Descriptive statistics

Standard deviation6.4935866
Coefficient of variation (CV)0.5646597
Kurtosis-1.2
Mean11.5
Median Absolute Deviation (MAD)5.5
Skewness0
Sum253
Variance42.166667
MonotonicityStrictly increasing
2023-12-12T08:35:08.272927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1 1
 
4.5%
13 1
 
4.5%
22 1
 
4.5%
21 1
 
4.5%
20 1
 
4.5%
19 1
 
4.5%
18 1
 
4.5%
17 1
 
4.5%
16 1
 
4.5%
15 1
 
4.5%
Other values (12) 12
54.5%
ValueCountFrequency (%)
1 1
4.5%
2 1
4.5%
3 1
4.5%
4 1
4.5%
5 1
4.5%
6 1
4.5%
7 1
4.5%
8 1
4.5%
9 1
4.5%
10 1
4.5%
ValueCountFrequency (%)
22 1
4.5%
21 1
4.5%
20 1
4.5%
19 1
4.5%
18 1
4.5%
17 1
4.5%
16 1
4.5%
15 1
4.5%
14 1
4.5%
13 1
4.5%

업체명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size308.0 B
(주)진텍
13 
LG전자(주) 창원1공장
(주)태영이앤티
(주)유한시스템
 
1
(주)ACE전자
 
1
Other values (2)

Length

Max length13
Median length5
Mean length6.5909091
Min length4

Unique

Unique4 ?
Unique (%)18.2%

Sample

1st row(주)유한시스템
2nd row(주)ACE전자
3rd row(주)태영이앤티
4th row(주)태영이앤티
5th row(주)진텍

Common Values

ValueCountFrequency (%)
(주)진텍 13
59.1%
LG전자(주) 창원1공장 3
 
13.6%
(주)태영이앤티 2
 
9.1%
(주)유한시스템 1
 
4.5%
(주)ACE전자 1
 
4.5%
티엠시㈜ 1
 
4.5%
유림기업㈜ 1
 
4.5%

Length

2023-12-12T08:35:08.398241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:08.487839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주)진텍 13
52.0%
lg전자(주 3
 
12.0%
창원1공장 3
 
12.0%
주)태영이앤티 2
 
8.0%
주)유한시스템 1
 
4.0%
주)ace전자 1
 
4.0%
티엠시㈜ 1
 
4.0%
유림기업㈜ 1
 
4.0%

소재지
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size308.0 B
경상남도 진영읍 하계로155번길 48
13 
경상남도 창원시 성산구 성산패총로 170
경상남도 양산시 웅상대로 908
경상남도 김해시 생림대로 49-1
 
1
경상남도 창원시 마산합포구 진북면 진북산업로 613
 
1
Other values (2)

Length

Max length28
Median length20
Mean length20.636364
Min length17

Unique

Unique4 ?
Unique (%)18.2%

Sample

1st row경상남도 김해시 생림대로 49-1
2nd row경상남도 창원시 마산합포구 진북면 진북산업로 613
3rd row경상남도 양산시 웅상대로 908
4th row경상남도 양산시 웅상대로 908
5th row경상남도 진영읍 하계로155번길 48

Common Values

ValueCountFrequency (%)
경상남도 진영읍 하계로155번길 48 13
59.1%
경상남도 창원시 성산구 성산패총로 170 3
 
13.6%
경상남도 양산시 웅상대로 908 2
 
9.1%
경상남도 김해시 생림대로 49-1 1
 
4.5%
경상남도 창원시 마산합포구 진북면 진북산업로 613 1
 
4.5%
경상남도 함안군 칠북면 삼칠로 1826 1
 
4.5%
경상남도 창원시 마산회원구 내서읍 광려천남로 59 1
 
4.5%

Length

2023-12-12T08:35:08.597751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:08.697402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 22
22.9%
하계로155번길 13
13.5%
48 13
13.5%
진영읍 13
13.5%
창원시 5
 
5.2%
성산구 3
 
3.1%
성산패총로 3
 
3.1%
170 3
 
3.1%
908 2
 
2.1%
웅상대로 2
 
2.1%
Other values (16) 17
17.7%

제품명
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing1
Missing (%)4.5%
Memory size308.0 B
2023-12-12T08:35:08.873235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10
Mean length8.8095238
Min length5

Characters and Unicode

Total characters185
Distinct characters64
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row가족사랑냉온정수기
2nd row스퀘어정수기
3rd row레틴파워트정수기
4th rowUNIKUL냉온정수기
5th row현대렌탈서비스정수기
ValueCountFrequency (%)
정수기 4
 
12.9%
부착된 2
 
6.5%
냉장고에 2
 
6.5%
가족사랑냉온정수기 1
 
3.2%
현대렌탈케어냉온정수기 1
 
3.2%
코스타 1
 
3.2%
1
 
3.2%
lg정수기 1
 
3.2%
간이 1
 
3.2%
유버스정수기 1
 
3.2%
Other values (16) 16
51.6%
2023-12-12T08:35:09.144799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
11.9%
22
 
11.9%
21
 
11.4%
10
 
5.4%
9
 
4.9%
8
 
4.3%
7
 
3.8%
4
 
2.2%
3
 
1.6%
3
 
1.6%
Other values (54) 76
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 161
87.0%
Space Separator 10
 
5.4%
Uppercase Letter 10
 
5.4%
Decimal Number 3
 
1.6%
Dash Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
13.7%
22
 
13.7%
21
 
13.0%
9
 
5.6%
8
 
5.0%
7
 
4.3%
4
 
2.5%
3
 
1.9%
3
 
1.9%
2
 
1.2%
Other values (41) 60
37.3%
Uppercase Letter
ValueCountFrequency (%)
L 2
20.0%
U 2
20.0%
R 1
10.0%
G 1
10.0%
O 1
10.0%
N 1
10.0%
I 1
10.0%
K 1
10.0%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
5 1
33.3%
0 1
33.3%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 161
87.0%
Common 14
 
7.6%
Latin 10
 
5.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
13.7%
22
 
13.7%
21
 
13.0%
9
 
5.6%
8
 
5.0%
7
 
4.3%
4
 
2.5%
3
 
1.9%
3
 
1.9%
2
 
1.2%
Other values (41) 60
37.3%
Latin
ValueCountFrequency (%)
L 2
20.0%
U 2
20.0%
R 1
10.0%
G 1
10.0%
O 1
10.0%
N 1
10.0%
I 1
10.0%
K 1
10.0%
Common
ValueCountFrequency (%)
10
71.4%
1 1
 
7.1%
5 1
 
7.1%
0 1
 
7.1%
- 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 161
87.0%
ASCII 24
 
13.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
13.7%
22
 
13.7%
21
 
13.0%
9
 
5.6%
8
 
5.0%
7
 
4.3%
4
 
2.5%
3
 
1.9%
3
 
1.9%
2
 
1.2%
Other values (41) 60
37.3%
ASCII
ValueCountFrequency (%)
10
41.7%
L 2
 
8.3%
U 2
 
8.3%
R 1
 
4.2%
1 1
 
4.2%
5 1
 
4.2%
0 1
 
4.2%
G 1
 
4.2%
O 1
 
4.2%
- 1
 
4.2%
Other values (3) 3
 
12.5%

모델명
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing1
Missing (%)4.5%
Memory size308.0 B
2023-12-12T08:35:09.339016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length10.380952
Min length4

Characters and Unicode

Total characters218
Distinct characters41
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st rowACE-2000
2nd rowTY-2004
3rd rowTY-2009
4th rowJCP-4011 외 5종
5th rowMLP-300H 외 1종
ValueCountFrequency (%)
1종 5
 
14.3%
4
 
11.4%
ace-2000 1
 
2.9%
jcp-8020 1
 
2.9%
khan 1
 
2.9%
57종 1
 
2.9%
ws400gw 1
 
2.9%
6종 1
 
2.9%
lt700(s 1
 
2.9%
adq736939 1
 
2.9%
Other values (18) 18
51.4%
2023-12-12T08:35:09.644815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 26
 
11.9%
- 15
 
6.9%
15
 
6.9%
1 14
 
6.4%
W 11
 
5.0%
2 11
 
5.0%
10
 
4.6%
10
 
4.6%
P 10
 
4.6%
3 7
 
3.2%
Other values (31) 89
40.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 83
38.1%
Uppercase Letter 80
36.7%
Other Letter 20
 
9.2%
Dash Punctuation 15
 
6.9%
Space Separator 15
 
6.9%
Lowercase Letter 3
 
1.4%
Close Punctuation 1
 
0.5%
Open Punctuation 1
 
0.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
W 11
13.8%
P 10
12.5%
S 6
 
7.5%
C 6
 
7.5%
A 6
 
7.5%
K 5
 
6.2%
R 3
 
3.8%
U 3
 
3.8%
N 3
 
3.8%
H 3
 
3.8%
Other values (12) 24
30.0%
Decimal Number
ValueCountFrequency (%)
0 26
31.3%
1 14
16.9%
2 11
13.3%
3 7
 
8.4%
5 6
 
7.2%
4 5
 
6.0%
7 5
 
6.0%
9 4
 
4.8%
6 3
 
3.6%
8 2
 
2.4%
Lowercase Letter
ValueCountFrequency (%)
n 1
33.3%
a 1
33.3%
h 1
33.3%
Other Letter
ValueCountFrequency (%)
10
50.0%
10
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 115
52.8%
Latin 83
38.1%
Hangul 20
 
9.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
W 11
13.3%
P 10
 
12.0%
S 6
 
7.2%
C 6
 
7.2%
A 6
 
7.2%
K 5
 
6.0%
R 3
 
3.6%
U 3
 
3.6%
N 3
 
3.6%
H 3
 
3.6%
Other values (15) 27
32.5%
Common
ValueCountFrequency (%)
0 26
22.6%
- 15
13.0%
15
13.0%
1 14
12.2%
2 11
9.6%
3 7
 
6.1%
5 6
 
5.2%
4 5
 
4.3%
7 5
 
4.3%
9 4
 
3.5%
Other values (4) 7
 
6.1%
Hangul
ValueCountFrequency (%)
10
50.0%
10
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 198
90.8%
Hangul 20
 
9.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 26
 
13.1%
- 15
 
7.6%
15
 
7.6%
1 14
 
7.1%
W 11
 
5.6%
2 11
 
5.6%
P 10
 
5.1%
3 7
 
3.5%
S 6
 
3.0%
5 6
 
3.0%
Other values (29) 77
38.9%
Hangul
ValueCountFrequency (%)
10
50.0%
10
50.0%

Interactions

2023-12-12T08:35:07.860910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:35:09.835691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명소재지제품명모델명
연번1.0000.1940.1941.0001.000
업체명0.1941.0001.0001.0001.000
소재지0.1941.0001.0001.0001.000
제품명1.0001.0001.0001.0001.000
모델명1.0001.0001.0001.0001.000
2023-12-12T08:35:09.965171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지업체명
소재지1.0001.000
업체명1.0001.000
2023-12-12T08:35:10.079104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명소재지
연번1.0000.0000.000
업체명0.0001.0001.000
소재지0.0001.0001.000

Missing values

2023-12-12T08:35:07.943667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:35:08.019118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:35:08.087573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업체명소재지제품명모델명
01(주)유한시스템경상남도 김해시 생림대로 49-1<NA><NA>
12(주)ACE전자경상남도 창원시 마산합포구 진북면 진북산업로 613가족사랑냉온정수기ACE-2000
23(주)태영이앤티경상남도 양산시 웅상대로 908스퀘어정수기TY-2004
34(주)태영이앤티경상남도 양산시 웅상대로 908레틴파워트정수기TY-2009
45(주)진텍경상남도 진영읍 하계로155번길 48UNIKUL냉온정수기JCP-4011 외 5종
56(주)진텍경상남도 진영읍 하계로155번길 48현대렌탈서비스정수기MLP-300H 외 1종
67(주)진텍경상남도 진영읍 하계로155번길 48청호나이스정수기WP-35S90010N외 1종
78(주)진텍경상남도 진영읍 하계로155번길 48애터미용기형간이정수기AWFP-KR22L외 1종
89(주)진텍경상남도 진영읍 하계로155번길 48애터미 올-케어 정수기AWP-KR22
910(주)진텍경상남도 진영읍 하계로155번길 48웰스냉온정수기WM171UWA외 1종
연번업체명소재지제품명모델명
1213(주)진텍경상남도 진영읍 하계로155번길 48진텍냉온정수기JCP-8020
1314(주)진텍경상남도 진영읍 하계로155번길 48현대렌탈케어냉온정수기HP-830C외 14종
1415(주)진텍경상남도 진영읍 하계로155번길 48청호직수 정수기150WP-15C6500N
1516(주)진텍경상남도 진영읍 하계로155번길 48진텍정수기JCP-K4-2
1617(주)진텍경상남도 진영읍 하계로155번길 48유버스정수기UBUS-P20HNF
1718LG전자(주) 창원1공장경상남도 창원시 성산구 성산패총로 170냉장고에 부착된 간이 정수기ADQ736939
1819LG전자(주) 창원1공장경상남도 창원시 성산구 성산패총로 170냉장고에 부착된 정수기LT700(S) 외 6종
1920LG전자(주) 창원1공장경상남도 창원시 성산구 성산패총로 170LG정수기WS400GW 외 57종
2021티엠시㈜경상남도 함안군 칠북면 삼칠로 1826칸 정수기Khan
2122유림기업㈜경상남도 창원시 마산회원구 내서읍 광려천남로 59코스타 간이정수기KOS-300