Overview

Dataset statistics

Number of variables4
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory35.5 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description전라남도 광양시에 위치한 스마트 공장 현황입니다. 기업명, 위치(읍면동 or 산단), 생산품(업종) 목록을 제공합니다.
URLhttps://www.data.go.kr/data/15105086/fileData.do

Alerts

연번 has unique valuesUnique
기업명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:52:31.792132
Analysis finished2023-12-12 11:52:32.434682
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T20:52:32.537315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-12T20:52:32.701346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

기업명
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T20:52:33.012095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9.5
Mean length6.5192308
Min length2

Characters and Unicode

Total characters339
Distinct characters122
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row엠알씨
2nd row㈜한창산업
3rd row조선내화(주)
4th row한성철강공업(주)
5th row지본
ValueCountFrequency (%)
광양공장 3
 
5.5%
엠알씨 1
 
1.8%
㈜에쓰큐씨 1
 
1.8%
주)영진이엔지 1
 
1.8%
주)대신이엔에스 1
 
1.8%
주)유원산업기술 1
 
1.8%
이에스바이오㈜ 1
 
1.8%
시이케어서비스㈜ 1
 
1.8%
주)라인이엔지 1
 
1.8%
해덕세라믹 1
 
1.8%
Other values (43) 43
78.2%
2023-12-12T20:52:33.538726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
7.1%
18
 
5.3%
15
 
4.4%
( 14
 
4.1%
) 14
 
4.1%
11
 
3.2%
10
 
2.9%
10
 
2.9%
10
 
2.9%
9
 
2.7%
Other values (112) 204
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 284
83.8%
Other Symbol 24
 
7.1%
Open Punctuation 14
 
4.1%
Close Punctuation 14
 
4.1%
Space Separator 3
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
6.3%
15
 
5.3%
11
 
3.9%
10
 
3.5%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
8
 
2.8%
6
 
2.1%
Other values (108) 179
63.0%
Other Symbol
ValueCountFrequency (%)
24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 308
90.9%
Common 31
 
9.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
7.8%
18
 
5.8%
15
 
4.9%
11
 
3.6%
10
 
3.2%
10
 
3.2%
10
 
3.2%
9
 
2.9%
8
 
2.6%
8
 
2.6%
Other values (109) 185
60.1%
Common
ValueCountFrequency (%)
( 14
45.2%
) 14
45.2%
3
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 284
83.8%
ASCII 31
 
9.1%
None 24
 
7.1%

Most frequent character per block

None
ValueCountFrequency (%)
24
100.0%
Hangul
ValueCountFrequency (%)
18
 
6.3%
15
 
5.3%
11
 
3.9%
10
 
3.5%
10
 
3.5%
10
 
3.5%
9
 
3.2%
8
 
2.8%
8
 
2.8%
6
 
2.1%
Other values (108) 179
63.0%
ASCII
ValueCountFrequency (%)
( 14
45.2%
) 14
45.2%
3
 
9.7%

업종
Text

Distinct47
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T20:52:33.918276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length16
Mean length12.173077
Min length4

Characters and Unicode

Total characters633
Distinct characters117
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)82.7%

Sample

1st row금속원료 재생
2nd row탭, 밸브 및 유사 장치 제조
3rd row구조용 정형 내화 제품 제조
4th row금속 조립 구조재 제조
5th row그 외 기타 전자부품 제조
ValueCountFrequency (%)
제조 41
 
20.1%
17
 
8.3%
기타 8
 
3.9%
금속 6
 
2.9%
제품 6
 
2.9%
5
 
2.5%
5
 
2.5%
구조재 4
 
2.0%
조립 3
 
1.5%
철강 3
 
1.5%
Other values (88) 106
52.0%
2023-12-12T20:52:34.455373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
153
24.2%
64
 
10.1%
58
 
9.2%
21
 
3.3%
17
 
2.7%
17
 
2.7%
16
 
2.5%
14
 
2.2%
12
 
1.9%
9
 
1.4%
Other values (107) 252
39.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 471
74.4%
Space Separator 153
 
24.2%
Other Punctuation 5
 
0.8%
Decimal Number 2
 
0.3%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
13.6%
58
 
12.3%
21
 
4.5%
17
 
3.6%
17
 
3.6%
16
 
3.4%
14
 
3.0%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (101) 235
49.9%
Other Punctuation
ValueCountFrequency (%)
, 3
60.0%
· 2
40.0%
Space Separator
ValueCountFrequency (%)
153
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 471
74.4%
Common 162
 
25.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
13.6%
58
 
12.3%
21
 
4.5%
17
 
3.6%
17
 
3.6%
16
 
3.4%
14
 
3.0%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (101) 235
49.9%
Common
ValueCountFrequency (%)
153
94.4%
, 3
 
1.9%
1 2
 
1.2%
· 2
 
1.2%
( 1
 
0.6%
) 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 471
74.4%
ASCII 160
 
25.3%
None 2
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
153
95.6%
, 3
 
1.9%
1 2
 
1.2%
( 1
 
0.6%
) 1
 
0.6%
Hangul
ValueCountFrequency (%)
64
 
13.6%
58
 
12.3%
21
 
4.5%
17
 
3.6%
17
 
3.6%
16
 
3.4%
14
 
3.0%
12
 
2.5%
9
 
1.9%
8
 
1.7%
Other values (101) 235
49.9%
None
ValueCountFrequency (%)
· 2
100.0%

위치
Categorical

Distinct18
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Memory size548.0 B
태인동(국가산단)
초남공단
진월면
익신산단
도이동(동측배후)
Other values (13)
22 

Length

Max length9
Median length4
Mean length5.2692308
Min length2

Unique

Unique8 ?
Unique (%)15.4%

Sample

1st row금호동(국가산단)
2nd row익신산단
3rd row태인동(국가산단)
4th row초남공단
5th row중동

Common Values

ValueCountFrequency (%)
태인동(국가산단) 9
17.3%
초남공단 7
13.5%
진월면 6
11.5%
익신산단 4
7.7%
도이동(동측배후) 4
7.7%
신금산단 4
7.7%
금호동(국가산단) 3
 
5.8%
광양읍 3
 
5.8%
태인동 2
 
3.8%
옥곡면 2
 
3.8%
Other values (8) 8
15.4%

Length

2023-12-12T20:52:34.665118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
태인동(국가산단 9
17.3%
초남공단 7
13.5%
진월면 6
11.5%
익신산단 4
7.7%
도이동(동측배후 4
7.7%
신금산단 4
7.7%
금호동(국가산단 3
 
5.8%
광양읍 3
 
5.8%
옥곡면 2
 
3.8%
태인동 2
 
3.8%
Other values (8) 8
15.4%

Interactions

2023-12-12T20:52:32.137928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:52:34.772542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기업명업종위치
연번1.0001.0000.8550.000
기업명1.0001.0001.0001.000
업종0.8551.0001.0000.989
위치0.0001.0000.9891.000
2023-12-12T20:52:34.921106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위치
연번1.0000.000
위치0.0001.000

Missing values

2023-12-12T20:52:32.289778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:52:32.393836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기업명업종위치
01엠알씨금속원료 재생금호동(국가산단)
12㈜한창산업탭, 밸브 및 유사 장치 제조익신산단
23조선내화(주)구조용 정형 내화 제품 제조태인동(국가산단)
34한성철강공업(주)금속 조립 구조재 제조초남공단
45지본그 외 기타 전자부품 제조중동
56대동강업 광양공장그 외 기타 1차 철강 제조신금산단
67㈜교보스틸1차 금속제품 도매진월면
78삼우에코금속 조립 구조재 제조초남공단
89삼표시멘트 광양공장비금속 광물 제품 제조태인동(국가산단)
910부국산업(주)입철(환원에 의해 제조된 금속)금호동(국가산단)
연번기업명업종위치
4243㈜경덕유기질 비료 및 상토 제조도이동(동측배후)
4344㈜금홍기업그 외 기타 분류 안된 비금속 광물 제품 제조초남공단
4445라엔텍육상 금속 골조 구조재 제조옥곡면
4546㈜사카팬코리아산업용 그 외 비경화 고무제품 제조진월면
4647㈜원희기타 비철금속 제련, 정련 및 합금 제조태인동
4748㈜파워엔지니어링배전반 및 전기 자동제어반 제조익신산단
4849광양청매실농원영농조합법인과실·채소 가공 및 저장 처리다압면
4950(주)에스이엔티금속탱크 및 저장용기 제조초남공단
5051(주)창조간판 및 광고물 제조태인동
5152이앤티스소재(주)광양지점금속류 원료 재생진월면