Overview

Dataset statistics

Number of variables5
Number of observations48
Missing cells1
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory43.8 B

Variable types

Numeric1
Text4

Dataset

Description목포시에 위치한 삽진산업단지내 입주업체 현황에 대하여 회사명, 주소, 전화번호, 주요 생산품의 정보를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/3040508/fileData.do

Alerts

전화번호 has 1 (2.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:41:05.290709
Analysis finished2023-12-12 05:41:06.039164
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct48
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.5
Minimum1
Maximum48
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-12T14:41:06.134968image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.35
Q112.75
median24.5
Q336.25
95-th percentile45.65
Maximum48
Range47
Interquartile range (IQR)23.5

Descriptive statistics

Standard deviation14
Coefficient of variation (CV)0.57142857
Kurtosis-1.2
Mean24.5
Median Absolute Deviation (MAD)12
Skewness0
Sum1176
Variance196
MonotonicityStrictly increasing
2023-12-12T14:41:06.310511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
1 1
 
2.1%
26 1
 
2.1%
28 1
 
2.1%
29 1
 
2.1%
30 1
 
2.1%
31 1
 
2.1%
32 1
 
2.1%
33 1
 
2.1%
34 1
 
2.1%
35 1
 
2.1%
Other values (38) 38
79.2%
ValueCountFrequency (%)
1 1
2.1%
2 1
2.1%
3 1
2.1%
4 1
2.1%
5 1
2.1%
6 1
2.1%
7 1
2.1%
8 1
2.1%
9 1
2.1%
10 1
2.1%
ValueCountFrequency (%)
48 1
2.1%
47 1
2.1%
46 1
2.1%
45 1
2.1%
44 1
2.1%
43 1
2.1%
42 1
2.1%
41 1
2.1%
40 1
2.1%
39 1
2.1%
Distinct45
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-12T14:41:06.601474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9.5
Mean length6.8125
Min length2

Characters and Unicode

Total characters327
Distinct characters100
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)87.5%

Sample

1st row(유)남양조선소
2nd row(유)대양조선
3rd row(유)동양조선소
4th row(유)미래로조선
5th row(유)부산프로펠러공사
ValueCountFrequency (%)
주식회사 4
 
7.4%
서울엔지니어링 2
 
3.7%
신원산업 2
 
3.7%
씨앤중공업 2
 
3.7%
푸로테크 2
 
3.7%
형제중공업 1
 
1.9%
유한회사 1
 
1.9%
현대마린 1
 
1.9%
전남디젤 1
 
1.9%
서해안농기계 1
 
1.9%
Other values (37) 37
68.5%
2023-12-12T14:41:07.094924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 16
 
4.9%
( 16
 
4.9%
14
 
4.3%
13
 
4.0%
11
 
3.4%
10
 
3.1%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.4%
Other values (90) 211
64.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 288
88.1%
Close Punctuation 16
 
4.9%
Open Punctuation 16
 
4.9%
Space Separator 6
 
1.8%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.9%
13
 
4.5%
11
 
3.8%
10
 
3.5%
10
 
3.5%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
Other values (86) 190
66.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 288
88.1%
Common 39
 
11.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.9%
13
 
4.5%
11
 
3.8%
10
 
3.5%
10
 
3.5%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
Other values (86) 190
66.0%
Common
ValueCountFrequency (%)
) 16
41.0%
( 16
41.0%
6
 
15.4%
2 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 288
88.1%
ASCII 39
 
11.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 16
41.0%
( 16
41.0%
6
 
15.4%
2 1
 
2.6%
Hangul
ValueCountFrequency (%)
14
 
4.9%
13
 
4.5%
11
 
3.8%
10
 
3.5%
10
 
3.5%
9
 
3.1%
9
 
3.1%
8
 
2.8%
7
 
2.4%
7
 
2.4%
Other values (86) 190
66.0%

주소
Text

Distinct31
Distinct (%)64.6%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-12T14:41:07.327234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length25.729167
Min length19

Characters and Unicode

Total characters1235
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)45.8%

Sample

1st row전라남도 목포시 삽진산단로 106(연산동)
2nd row전라남도 목포시 삽진산단로 112 (연산동) 외 1필지
3rd row전라남도 목포시 삽진산단로 106(연산동, 신안조선)
4th row전라남도 목포시 삽진산단로 89 (연산동)
5th row전라남도 목포시 삽진산단로 103 (연산동)
ValueCountFrequency (%)
전라남도 48
18.5%
목포시 48
18.5%
삽진산단로 46
17.7%
연산동 46
17.7%
89 6
 
2.3%
6
 
2.3%
1필지 5
 
1.9%
78 4
 
1.5%
60 4
 
1.5%
89-1 3
 
1.2%
Other values (33) 44
16.9%
2023-12-12T14:41:07.703122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
213
 
17.2%
94
 
7.6%
48
 
3.9%
48
 
3.9%
) 48
 
3.9%
48
 
3.9%
( 48
 
3.9%
48
 
3.9%
48
 
3.9%
48
 
3.9%
Other values (42) 544
44.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 779
63.1%
Space Separator 213
 
17.2%
Decimal Number 127
 
10.3%
Close Punctuation 48
 
3.9%
Open Punctuation 48
 
3.9%
Dash Punctuation 12
 
1.0%
Other Punctuation 8
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
12.1%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
Other values (27) 253
32.5%
Decimal Number
ValueCountFrequency (%)
8 24
18.9%
1 23
18.1%
9 20
15.7%
6 14
11.0%
0 12
9.4%
2 10
7.9%
3 9
 
7.1%
7 8
 
6.3%
4 6
 
4.7%
5 1
 
0.8%
Space Separator
ValueCountFrequency (%)
213
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 779
63.1%
Common 456
36.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
12.1%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
Other values (27) 253
32.5%
Common
ValueCountFrequency (%)
213
46.7%
) 48
 
10.5%
( 48
 
10.5%
8 24
 
5.3%
1 23
 
5.0%
9 20
 
4.4%
6 14
 
3.1%
- 12
 
2.6%
0 12
 
2.6%
2 10
 
2.2%
Other values (5) 32
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 779
63.1%
ASCII 456
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
213
46.7%
) 48
 
10.5%
( 48
 
10.5%
8 24
 
5.3%
1 23
 
5.0%
9 20
 
4.4%
6 14
 
3.1%
- 12
 
2.6%
0 12
 
2.6%
2 10
 
2.2%
Other values (5) 32
 
7.0%
Hangul
ValueCountFrequency (%)
94
 
12.1%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
48
 
6.2%
Other values (27) 253
32.5%

전화번호
Text

MISSING 

Distinct40
Distinct (%)85.1%
Missing1
Missing (%)2.1%
Memory size516.0 B
2023-12-12T14:41:07.943160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters564
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)72.3%

Sample

1st row061-244-0834
2nd row061-276-9090
3rd row061-272-0834
4th row061-247-9911
5th row061-279-3712
ValueCountFrequency (%)
061-279-0947 3
 
6.4%
061-284-1433 2
 
4.3%
061-274-6558 2
 
4.3%
061-278-4411 2
 
4.3%
061-276-9090 2
 
4.3%
061-274-2665 2
 
4.3%
061-270-7000 1
 
2.1%
061-277-0482 1
 
2.1%
061-244-0834 1
 
2.1%
061-278-0811 1
 
2.1%
Other values (30) 30
63.8%
2023-12-12T14:41:08.332068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 94
16.7%
0 87
15.4%
1 71
12.6%
6 70
12.4%
2 63
11.2%
7 58
10.3%
4 43
7.6%
8 26
 
4.6%
9 22
 
3.9%
3 17
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 470
83.3%
Dash Punctuation 94
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 87
18.5%
1 71
15.1%
6 70
14.9%
2 63
13.4%
7 58
12.3%
4 43
9.1%
8 26
 
5.5%
9 22
 
4.7%
3 17
 
3.6%
5 13
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 564
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 94
16.7%
0 87
15.4%
1 71
12.6%
6 70
12.4%
2 63
11.2%
7 58
10.3%
4 43
7.6%
8 26
 
4.6%
9 22
 
3.9%
3 17
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 564
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 94
16.7%
0 87
15.4%
1 71
12.6%
6 70
12.4%
2 63
11.2%
7 58
10.3%
4 43
7.6%
8 26
 
4.6%
9 22
 
3.9%
3 17
 
3.0%
Distinct35
Distinct (%)72.9%
Missing0
Missing (%)0.0%
Memory size516.0 B
2023-12-12T14:41:08.588045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length25
Mean length10.145833
Min length2

Characters and Unicode

Total characters487
Distinct characters103
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)64.6%

Sample

1st row선박 건조 및 수리
2nd rowFRP 선박건조
3rd row선박 건조 및 수리
4th row강선건조및선박
5th row프로펠라 및 축계
ValueCountFrequency (%)
선박부분품 12
 
10.3%
12
 
10.3%
선박 5
 
4.3%
선박건조 5
 
4.3%
수리 4
 
3.4%
건조 3
 
2.6%
선박용 3
 
2.6%
선박구성부분품 2
 
1.7%
샤후트 2
 
1.7%
펌프 2
 
1.7%
Other values (58) 67
57.3%
2023-12-12T14:41:09.010572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
14.4%
42
 
8.6%
37
 
7.6%
, 22
 
4.5%
21
 
4.3%
19
 
3.9%
15
 
3.1%
14
 
2.9%
13
 
2.7%
13
 
2.7%
Other values (93) 221
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 376
77.2%
Space Separator 70
 
14.4%
Other Punctuation 26
 
5.3%
Uppercase Letter 13
 
2.7%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
11.2%
37
 
9.8%
21
 
5.6%
19
 
5.1%
15
 
4.0%
14
 
3.7%
13
 
3.5%
13
 
3.5%
13
 
3.5%
9
 
2.4%
Other values (83) 180
47.9%
Uppercase Letter
ValueCountFrequency (%)
P 4
30.8%
F 3
23.1%
R 2
15.4%
T 2
15.4%
O 2
15.4%
Other Punctuation
ValueCountFrequency (%)
, 22
84.6%
. 4
 
15.4%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 376
77.2%
Common 98
 
20.1%
Latin 13
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
11.2%
37
 
9.8%
21
 
5.6%
19
 
5.1%
15
 
4.0%
14
 
3.7%
13
 
3.5%
13
 
3.5%
13
 
3.5%
9
 
2.4%
Other values (83) 180
47.9%
Common
ValueCountFrequency (%)
70
71.4%
, 22
 
22.4%
. 4
 
4.1%
( 1
 
1.0%
) 1
 
1.0%
Latin
ValueCountFrequency (%)
P 4
30.8%
F 3
23.1%
R 2
15.4%
T 2
15.4%
O 2
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 376
77.2%
ASCII 111
 
22.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
63.1%
, 22
 
19.8%
P 4
 
3.6%
. 4
 
3.6%
F 3
 
2.7%
R 2
 
1.8%
T 2
 
1.8%
O 2
 
1.8%
( 1
 
0.9%
) 1
 
0.9%
Hangul
ValueCountFrequency (%)
42
 
11.2%
37
 
9.8%
21
 
5.6%
19
 
5.1%
15
 
4.0%
14
 
3.7%
13
 
3.5%
13
 
3.5%
13
 
3.5%
9
 
2.4%
Other values (83) 180
47.9%

Interactions

2023-12-12T14:41:05.713896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:41:09.117805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번회사명주소전화번호주요 생산품
연번1.0001.0000.5410.9420.780
회사명1.0001.0000.9170.9950.983
주소0.5410.9171.0000.9610.946
전화번호0.9420.9950.9611.0000.000
주요 생산품0.7800.9830.9460.0001.000

Missing values

2023-12-12T14:41:05.856806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:41:05.988329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번회사명주소전화번호주요 생산품
01(유)남양조선소전라남도 목포시 삽진산단로 106(연산동)061-244-0834선박 건조 및 수리
12(유)대양조선전라남도 목포시 삽진산단로 112 (연산동) 외 1필지061-276-9090FRP 선박건조
23(유)동양조선소전라남도 목포시 삽진산단로 106(연산동, 신안조선)061-272-0834선박 건조 및 수리
34(유)미래로조선전라남도 목포시 삽진산단로 89 (연산동)061-247-9911강선건조및선박
45(유)부산프로펠러공사전라남도 목포시 삽진산단로 103 (연산동)061-279-3712프로펠라 및 축계
56(유)제이에이치해양전라남도 목포시 삽진산단로 78 (연산동)061-274-3001선박건조, 수리 외
67(유)풍양조선전라남도 목포시 삽진산단로 107 (연산동)061-276-9090합성수지(FRP) 건조 및 수리
78(유)한국엔지니어링전라남도 목포시 삽진산단로 60 (연산동, 한국엔지니어링)061-278-6777선박용 에어탱크, 소음기, 냉기, 펌프 샤후트, 배기메니폴드
89(주)대불조선 제2공장전라남도 목포시 삽진산단로 60 (연산동, 한국엔지니어링)061-461-0888선박건조.보트.요트
910(주)신호조선전라남도 목포시 삽진산단로 106 (연산동, 신안조선)061-285-0685선박
연번회사명주소전화번호주요 생산품
3839주식회사 에스에스테크전라남도 목포시 삽진산단로 60 (연산동)061-278-6778펌프 샤후트 등
3940주식회사 우성기공전라남도 목포시 삽진산단로 89-2 (연산동)061-276-0834벨트, 스크류, 체인 컨베이어
4041평화기공사전라남도 목포시 삽진산단로 89-4 (연산동)061-277-7565선박부분품
4142푸로테크전라남도 목포시 삽진산단로 99 (연산동)061-274-6558선박용 프로펠러 추진축계 장치 및 러더 제작
4243하나디젤기공사전라남도 목포시 삽진산단로 89-1 (연산동)061-276-1969선박부분품
4344한성디젤기공전라남도 목포시 삽진산단로 89 (연산동)061-274-2665선박부분품, 레저 및 선박엔진, 기계부품
4445한진기공사전라남도 목포시 삽진산단로 28 (연산동)061-244-8009해수펌프, 배기 메니폴더, P.T.O
4546현대마린전라남도 목포시 삽진산단로 89 (연산동)061-244-6591선박부분품
4647형제중공업전라남도 목포시 삽진산단로 46-2 (연산동)061-276-2242선박 엔진 및 부품
4748화성공업사전라남도 목포시 삽진산단로 46 (연산동)061-277-2010선박구성부분품