Overview

Dataset statistics

Number of variables12
Number of observations574
Missing cells176
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory56.2 KiB
Average record size in memory100.2 B

Variable types

Numeric4
Text6
Categorical2

Dataset

Description경상북도 김천시 관내에 위차한 제조 업체의 회사명(기업명), 생산물품, 공장대표주소, 업종명 등을 제공합니다.
Author경상북도 김천시
URLhttps://www.data.go.kr/data/15048870/fileData.do

Alerts

데이터기준일 has constant value ""Constant
순번 is highly overall correlated with 산업단지명High correlation
위도 is highly overall correlated with 산업단지명High correlation
경도 is highly overall correlated with 산업단지명High correlation
산업단지명 is highly overall correlated with 순번 and 2 other fieldsHigh correlation
전화번호 has 49 (8.5%) missing valuesMissing
팩스번호 has 123 (21.4%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:25:40.007433
Analysis finished2024-03-14 11:25:45.356941
Duration5.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct574
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean287.5
Minimum1
Maximum574
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-03-14T20:25:45.484990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.65
Q1144.25
median287.5
Q3430.75
95-th percentile545.35
Maximum574
Range573
Interquartile range (IQR)286.5

Descriptive statistics

Standard deviation165.8438
Coefficient of variation (CV)0.57684801
Kurtosis-1.2
Mean287.5
Median Absolute Deviation (MAD)143.5
Skewness0
Sum165025
Variance27504.167
MonotonicityStrictly increasing
2024-03-14T20:25:45.737536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
387 1
 
0.2%
381 1
 
0.2%
382 1
 
0.2%
383 1
 
0.2%
384 1
 
0.2%
385 1
 
0.2%
386 1
 
0.2%
388 1
 
0.2%
379 1
 
0.2%
Other values (564) 564
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
574 1
0.2%
573 1
0.2%
572 1
0.2%
571 1
0.2%
570 1
0.2%
569 1
0.2%
568 1
0.2%
567 1
0.2%
566 1
0.2%
565 1
0.2%
Distinct562
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-03-14T20:25:46.634376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.7857143
Min length2

Characters and Unicode

Total characters4469
Distinct characters362
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique551 ?
Unique (%)96.0%

Sample

1st row(주)건일산업
2nd row(주)골드테크
3rd row(주)나노코
4th row(주)대우테크
5th row(주)동희산업 김천공장
ValueCountFrequency (%)
주식회사 118
 
15.5%
김천공장 15
 
2.0%
농업회사법인 10
 
1.3%
2공장 9
 
1.2%
제2공장 4
 
0.5%
김천2공장 3
 
0.4%
주)다원넥스트 3
 
0.4%
주)우삼 3
 
0.4%
주)한독 2
 
0.3%
주)유에이썬 2
 
0.3%
Other values (564) 591
77.8%
2024-03-14T20:25:47.791191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
402
 
9.0%
( 277
 
6.2%
) 277
 
6.2%
188
 
4.2%
161
 
3.6%
144
 
3.2%
134
 
3.0%
130
 
2.9%
115
 
2.6%
80
 
1.8%
Other values (352) 2561
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3640
81.4%
Open Punctuation 277
 
6.2%
Close Punctuation 277
 
6.2%
Space Separator 188
 
4.2%
Uppercase Letter 34
 
0.8%
Decimal Number 29
 
0.6%
Other Symbol 14
 
0.3%
Lowercase Letter 6
 
0.1%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
402
 
11.0%
161
 
4.4%
144
 
4.0%
134
 
3.7%
130
 
3.6%
115
 
3.2%
80
 
2.2%
79
 
2.2%
56
 
1.5%
53
 
1.5%
Other values (322) 2286
62.8%
Uppercase Letter
ValueCountFrequency (%)
S 4
11.8%
H 4
11.8%
E 4
11.8%
G 3
8.8%
P 3
8.8%
T 3
8.8%
R 2
 
5.9%
O 2
 
5.9%
L 2
 
5.9%
C 2
 
5.9%
Other values (5) 5
14.7%
Lowercase Letter
ValueCountFrequency (%)
o 2
33.3%
h 1
16.7%
c 1
16.7%
e 1
16.7%
d 1
16.7%
Decimal Number
ValueCountFrequency (%)
2 21
72.4%
1 4
 
13.8%
3 2
 
6.9%
0 2
 
6.9%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
& 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 277
100.0%
Close Punctuation
ValueCountFrequency (%)
) 277
100.0%
Space Separator
ValueCountFrequency (%)
188
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3654
81.8%
Common 775
 
17.3%
Latin 40
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
402
 
11.0%
161
 
4.4%
144
 
3.9%
134
 
3.7%
130
 
3.6%
115
 
3.1%
80
 
2.2%
79
 
2.2%
56
 
1.5%
53
 
1.5%
Other values (323) 2300
62.9%
Latin
ValueCountFrequency (%)
S 4
 
10.0%
H 4
 
10.0%
E 4
 
10.0%
G 3
 
7.5%
P 3
 
7.5%
T 3
 
7.5%
R 2
 
5.0%
O 2
 
5.0%
L 2
 
5.0%
o 2
 
5.0%
Other values (10) 11
27.5%
Common
ValueCountFrequency (%)
( 277
35.7%
) 277
35.7%
188
24.3%
2 21
 
2.7%
1 4
 
0.5%
. 3
 
0.4%
3 2
 
0.3%
0 2
 
0.3%
& 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3640
81.4%
ASCII 815
 
18.2%
None 14
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
402
 
11.0%
161
 
4.4%
144
 
4.0%
134
 
3.7%
130
 
3.6%
115
 
3.2%
80
 
2.2%
79
 
2.2%
56
 
1.5%
53
 
1.5%
Other values (322) 2286
62.8%
ASCII
ValueCountFrequency (%)
( 277
34.0%
) 277
34.0%
188
23.1%
2 21
 
2.6%
S 4
 
0.5%
H 4
 
0.5%
E 4
 
0.5%
1 4
 
0.5%
G 3
 
0.4%
. 3
 
0.4%
Other values (19) 30
 
3.7%
None
ValueCountFrequency (%)
14
100.0%

사업자등록번호
Real number (ℝ)

Distinct536
Distinct (%)94.0%
Missing4
Missing (%)0.7%
Infinite0
Infinite (%)0.0%
Mean4.8416105 × 109
Minimum1.0181164 × 109
Maximum8.9385019 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-03-14T20:25:48.042772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.0181164 × 109
5-th percentile1.3081679 × 109
Q14.6438756 × 109
median5.1081111 × 109
Q35.1381246 × 109
95-th percentile7.8741661 × 109
Maximum8.9385019 × 109
Range7.9203855 × 109
Interquartile range (IQR)4.94249 × 108

Descriptive statistics

Standard deviation1.6964012 × 109
Coefficient of variation (CV)0.35037952
Kurtosis0.52258179
Mean4.8416105 × 109
Median Absolute Deviation (MAD)59992262
Skewness-0.35574601
Sum2.759718 × 1012
Variance2.877777 × 1018
MonotonicityNot monotonic
2024-03-14T20:25:48.355458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5108123076 3
 
0.5%
5108122946 3
 
0.5%
5038607694 3
 
0.5%
2798600823 2
 
0.3%
6201417961 2
 
0.3%
5108111003 2
 
0.3%
3378800050 2
 
0.3%
5088115200 2
 
0.3%
5108126073 2
 
0.3%
2098701780 2
 
0.3%
Other values (526) 547
95.3%
(Missing) 4
 
0.7%
ValueCountFrequency (%)
1018116406 1
0.2%
1058610049 1
0.2%
1058693297 1
0.2%
1078198139 1
0.2%
1088147195 1
0.2%
1103791935 1
0.2%
1117100300 1
0.2%
1138540154 1
0.2%
1138610545 1
0.2%
1170155246 1
0.2%
ValueCountFrequency (%)
8938501937 1
0.2%
8918800520 1
0.2%
8871401403 1
0.2%
8805700448 1
0.2%
8758101867 1
0.2%
8758100553 1
0.2%
8742500398 1
0.2%
8728501075 1
0.2%
8718600162 1
0.2%
8698700962 1
0.2%
Distinct529
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-03-14T20:25:49.598954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length47
Mean length25.142857
Min length16

Characters and Unicode

Total characters14432
Distinct characters216
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique492 ?
Unique (%)85.7%

Sample

1st row경상북도 김천시 어모면 산업단지5로 166
2nd row경상북도 김천시 어모면 산업단지3로 57
3rd row경상북도 김천시 어모면 산업단지2로 55
4th row경상북도 김천시 어모면 산업단지3로 63
5th row경상북도 김천시 어모면 산업단지1로 75
ValueCountFrequency (%)
경상북도 575
 
17.7%
김천시 575
 
17.7%
어모면 120
 
3.7%
106
 
3.3%
아포읍 74
 
2.3%
남면 65
 
2.0%
대광동 49
 
1.5%
1필지 38
 
1.2%
봉산면 34
 
1.0%
아포공단길 32
 
1.0%
Other values (691) 1581
48.7%
2024-03-14T20:25:50.996349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2675
 
18.5%
606
 
4.2%
585
 
4.1%
583
 
4.0%
582
 
4.0%
577
 
4.0%
575
 
4.0%
575
 
4.0%
1 531
 
3.7%
2 360
 
2.5%
Other values (206) 6783
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8618
59.7%
Space Separator 2675
 
18.5%
Decimal Number 2411
 
16.7%
Open Punctuation 219
 
1.5%
Close Punctuation 219
 
1.5%
Dash Punctuation 191
 
1.3%
Other Punctuation 91
 
0.6%
Uppercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
606
 
7.0%
585
 
6.8%
583
 
6.8%
582
 
6.8%
577
 
6.7%
575
 
6.7%
575
 
6.7%
327
 
3.8%
323
 
3.7%
313
 
3.6%
Other values (186) 3572
41.4%
Decimal Number
ValueCountFrequency (%)
1 531
22.0%
2 360
14.9%
3 283
11.7%
5 222
9.2%
0 218
9.0%
4 200
 
8.3%
8 162
 
6.7%
9 154
 
6.4%
6 145
 
6.0%
7 136
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
E 3
37.5%
A 2
25.0%
G 1
 
12.5%
D 1
 
12.5%
L 1
 
12.5%
Space Separator
ValueCountFrequency (%)
2675
100.0%
Open Punctuation
ValueCountFrequency (%)
( 219
100.0%
Close Punctuation
ValueCountFrequency (%)
) 219
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 191
100.0%
Other Punctuation
ValueCountFrequency (%)
, 91
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8618
59.7%
Common 5806
40.2%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
606
 
7.0%
585
 
6.8%
583
 
6.8%
582
 
6.8%
577
 
6.7%
575
 
6.7%
575
 
6.7%
327
 
3.8%
323
 
3.7%
313
 
3.6%
Other values (186) 3572
41.4%
Common
ValueCountFrequency (%)
2675
46.1%
1 531
 
9.1%
2 360
 
6.2%
3 283
 
4.9%
5 222
 
3.8%
( 219
 
3.8%
) 219
 
3.8%
0 218
 
3.8%
4 200
 
3.4%
- 191
 
3.3%
Other values (5) 688
 
11.8%
Latin
ValueCountFrequency (%)
E 3
37.5%
A 2
25.0%
G 1
 
12.5%
D 1
 
12.5%
L 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8618
59.7%
ASCII 5814
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2675
46.0%
1 531
 
9.1%
2 360
 
6.2%
3 283
 
4.9%
5 222
 
3.8%
( 219
 
3.8%
) 219
 
3.8%
0 218
 
3.7%
4 200
 
3.4%
- 191
 
3.3%
Other values (10) 696
 
12.0%
Hangul
ValueCountFrequency (%)
606
 
7.0%
585
 
6.8%
583
 
6.8%
582
 
6.8%
577
 
6.7%
575
 
6.7%
575
 
6.7%
327
 
3.8%
323
 
3.7%
313
 
3.6%
Other values (186) 3572
41.4%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct466
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.13832
Minimum35.906807
Maximum36.236833
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-03-14T20:25:51.240599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.906807
5-th percentile36.058407
Q136.125921
median36.149255
Q336.166593
95-th percentile36.195483
Maximum36.236833
Range0.33002583
Interquartile range (IQR)0.040672277

Descriptive statistics

Standard deviation0.044922594
Coefficient of variation (CV)0.0012430737
Kurtosis3.5376664
Mean36.13832
Median Absolute Deviation (MAD)0.019614905
Skewness-1.4167411
Sum20743.396
Variance0.0020180395
MonotonicityNot monotonic
2024-03-14T20:25:51.501830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.12774884 18
 
3.1%
36.12743501 9
 
1.6%
36.16609367 4
 
0.7%
36.16659306 4
 
0.7%
36.14031366 4
 
0.7%
36.14079989 4
 
0.7%
36.15407124 4
 
0.7%
36.1597217 4
 
0.7%
36.16665182 3
 
0.5%
36.14316634 3
 
0.5%
Other values (456) 517
90.1%
ValueCountFrequency (%)
35.90680736 1
0.2%
35.91776848 1
0.2%
35.94218172 1
0.2%
35.97554617 1
0.2%
35.97559093 1
0.2%
35.97630339 1
0.2%
35.97702429 1
0.2%
35.97873584 1
0.2%
35.9886856 1
0.2%
36.00030564 1
0.2%
ValueCountFrequency (%)
36.23683319 1
 
0.2%
36.22202881 1
 
0.2%
36.22178195 1
 
0.2%
36.22030296 1
 
0.2%
36.21953898 1
 
0.2%
36.2169104 1
 
0.2%
36.21511739 1
 
0.2%
36.21469533 1
 
0.2%
36.2146576 1
 
0.2%
36.21447816 3
0.5%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct466
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.1542
Minimum127.91285
Maximum128.29754
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.2 KiB
2024-03-14T20:25:51.859567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.91285
5-th percentile128.02602
Q1128.12588
median128.13967
Q3128.18736
95-th percentile128.25715
Maximum128.29754
Range0.3846918
Interquartile range (IQR)0.06148455

Descriptive statistics

Standard deviation0.066465296
Coefficient of variation (CV)0.00051863532
Kurtosis0.17667586
Mean128.1542
Median Absolute Deviation (MAD)0.0371119
Skewness-0.19814153
Sum73560.513
Variance0.0044176356
MonotonicityNot monotonic
2024-03-14T20:25:52.327212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.1816808 18
 
3.1%
128.1806135 9
 
1.6%
128.2535415 4
 
0.7%
128.2560329 4
 
0.7%
128.1413857 4
 
0.7%
128.1417755 4
 
0.7%
128.2166276 4
 
0.7%
128.1307733 4
 
0.7%
128.1393993 3
 
0.5%
128.1374794 3
 
0.5%
Other values (456) 517
90.1%
ValueCountFrequency (%)
127.9128477 1
0.2%
127.9389763 1
0.2%
127.9584392 1
0.2%
127.9874057 1
0.2%
128.0059512 1
0.2%
128.0079309 1
0.2%
128.0088053 1
0.2%
128.0096412 1
0.2%
128.009672 2
0.3%
128.0110167 1
0.2%
ValueCountFrequency (%)
128.2975395 1
 
0.2%
128.2929856 1
 
0.2%
128.2915427 1
 
0.2%
128.2910309 1
 
0.2%
128.2905453 1
 
0.2%
128.289326 1
 
0.2%
128.2892271 1
 
0.2%
128.2888251 3
0.5%
128.2704417 1
 
0.2%
128.2702578 2
0.3%
Distinct323
Distinct (%)56.3%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-03-14T20:25:53.668112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length19.304878
Min length5

Characters and Unicode

Total characters11081
Distinct characters267
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique213 ?
Unique (%)37.1%

Sample

1st row그 외 기타 분류 안된 비금속 광물제품 제조업
2nd row그 외 기타 전자부품 제조업 외 3 종
3rd row합성수지 및 기타 플라스틱 물질 제조업
4th row텔레비전 제조업
5th row자동차 차체용 신품 부품 제조업 외 4 종
ValueCountFrequency (%)
제조업 524
 
14.3%
389
 
10.6%
283
 
7.7%
254
 
7.0%
기타 188
 
5.1%
1 143
 
3.9%
106
 
2.9%
플라스틱 64
 
1.8%
2 48
 
1.3%
부품 44
 
1.2%
Other values (372) 1610
44.1%
2024-03-14T20:25:55.414602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3079
27.8%
712
 
6.4%
617
 
5.6%
596
 
5.4%
400
 
3.6%
334
 
3.0%
294
 
2.7%
257
 
2.3%
254
 
2.3%
195
 
1.8%
Other values (257) 4343
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7614
68.7%
Space Separator 3079
27.8%
Decimal Number 298
 
2.7%
Other Punctuation 68
 
0.6%
Open Punctuation 11
 
0.1%
Close Punctuation 11
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
712
 
9.4%
617
 
8.1%
596
 
7.8%
400
 
5.3%
334
 
4.4%
294
 
3.9%
257
 
3.4%
254
 
3.3%
195
 
2.6%
170
 
2.2%
Other values (243) 3785
49.7%
Decimal Number
ValueCountFrequency (%)
1 160
53.7%
2 49
 
16.4%
3 33
 
11.1%
4 23
 
7.7%
5 13
 
4.4%
8 7
 
2.3%
6 7
 
2.3%
9 3
 
1.0%
7 2
 
0.7%
0 1
 
0.3%
Space Separator
ValueCountFrequency (%)
3079
100.0%
Other Punctuation
ValueCountFrequency (%)
, 68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7614
68.7%
Common 3467
31.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
712
 
9.4%
617
 
8.1%
596
 
7.8%
400
 
5.3%
334
 
4.4%
294
 
3.9%
257
 
3.4%
254
 
3.3%
195
 
2.6%
170
 
2.2%
Other values (243) 3785
49.7%
Common
ValueCountFrequency (%)
3079
88.8%
1 160
 
4.6%
, 68
 
2.0%
2 49
 
1.4%
3 33
 
1.0%
4 23
 
0.7%
5 13
 
0.4%
( 11
 
0.3%
) 11
 
0.3%
8 7
 
0.2%
Other values (4) 13
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7610
68.7%
ASCII 3467
31.3%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3079
88.8%
1 160
 
4.6%
, 68
 
2.0%
2 49
 
1.4%
3 33
 
1.0%
4 23
 
0.7%
5 13
 
0.4%
( 11
 
0.3%
) 11
 
0.3%
8 7
 
0.2%
Other values (4) 13
 
0.4%
Hangul
ValueCountFrequency (%)
712
 
9.4%
617
 
8.1%
596
 
7.8%
400
 
5.3%
334
 
4.4%
294
 
3.9%
257
 
3.4%
254
 
3.3%
195
 
2.6%
170
 
2.2%
Other values (242) 3781
49.7%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
Distinct534
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-03-14T20:25:56.578362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length110
Median length47
Mean length11.686411
Min length1

Characters and Unicode

Total characters6708
Distinct characters527
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique511 ?
Unique (%)89.0%

Sample

1st row흑연제조품, 내화단열재
2nd row전자부품 성형용기
3rd rowPhenol 수지, 주물용 주제
4th row텔레비전
5th row자동차 부품
ValueCountFrequency (%)
45
 
3.2%
25
 
1.8%
부품 22
 
1.6%
플라스틱 17
 
1.2%
15
 
1.1%
자동차 13
 
0.9%
자동차부품 12
 
0.9%
콘크리트 10
 
0.7%
제조 8
 
0.6%
산업용 7
 
0.5%
Other values (946) 1223
87.5%
2024-03-14T20:25:58.218462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
865
 
12.9%
, 349
 
5.2%
166
 
2.5%
110
 
1.6%
107
 
1.6%
105
 
1.6%
98
 
1.5%
95
 
1.4%
83
 
1.2%
80
 
1.2%
Other values (517) 4650
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4645
69.2%
Space Separator 865
 
12.9%
Uppercase Letter 425
 
6.3%
Other Punctuation 364
 
5.4%
Lowercase Letter 287
 
4.3%
Close Punctuation 49
 
0.7%
Open Punctuation 49
 
0.7%
Decimal Number 17
 
0.3%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
166
 
3.6%
110
 
2.4%
107
 
2.3%
105
 
2.3%
98
 
2.1%
95
 
2.0%
83
 
1.8%
80
 
1.7%
76
 
1.6%
67
 
1.4%
Other values (456) 3658
78.8%
Uppercase Letter
ValueCountFrequency (%)
P 52
12.2%
C 41
 
9.6%
E 39
 
9.2%
T 32
 
7.5%
A 30
 
7.1%
S 28
 
6.6%
D 25
 
5.9%
L 25
 
5.9%
R 21
 
4.9%
M 18
 
4.2%
Other values (14) 114
26.8%
Lowercase Letter
ValueCountFrequency (%)
r 37
12.9%
e 34
11.8%
s 30
10.5%
i 27
9.4%
a 23
8.0%
l 20
 
7.0%
n 17
 
5.9%
o 17
 
5.9%
t 16
 
5.6%
c 10
 
3.5%
Other values (13) 56
19.5%
Other Punctuation
ValueCountFrequency (%)
, 349
95.9%
/ 9
 
2.5%
. 3
 
0.8%
' 2
 
0.5%
& 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
9 8
47.1%
6 3
 
17.6%
3 2
 
11.8%
4 2
 
11.8%
2 2
 
11.8%
Space Separator
ValueCountFrequency (%)
865
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4644
69.2%
Common 1351
 
20.1%
Latin 712
 
10.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
166
 
3.6%
110
 
2.4%
107
 
2.3%
105
 
2.3%
98
 
2.1%
95
 
2.0%
83
 
1.8%
80
 
1.7%
76
 
1.6%
67
 
1.4%
Other values (455) 3657
78.7%
Latin
ValueCountFrequency (%)
P 52
 
7.3%
C 41
 
5.8%
E 39
 
5.5%
r 37
 
5.2%
e 34
 
4.8%
T 32
 
4.5%
s 30
 
4.2%
A 30
 
4.2%
S 28
 
3.9%
i 27
 
3.8%
Other values (37) 362
50.8%
Common
ValueCountFrequency (%)
865
64.0%
, 349
25.8%
) 49
 
3.6%
( 49
 
3.6%
/ 9
 
0.7%
9 8
 
0.6%
- 7
 
0.5%
6 3
 
0.2%
. 3
 
0.2%
3 2
 
0.1%
Other values (4) 7
 
0.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4644
69.2%
ASCII 2063
30.8%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
865
41.9%
, 349
16.9%
P 52
 
2.5%
) 49
 
2.4%
( 49
 
2.4%
C 41
 
2.0%
E 39
 
1.9%
r 37
 
1.8%
e 34
 
1.6%
T 32
 
1.6%
Other values (51) 516
25.0%
Hangul
ValueCountFrequency (%)
166
 
3.6%
110
 
2.4%
107
 
2.3%
105
 
2.3%
98
 
2.1%
95
 
2.0%
83
 
1.8%
80
 
1.7%
76
 
1.6%
67
 
1.4%
Other values (455) 3657
78.7%
CJK
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct476
Distinct (%)90.7%
Missing49
Missing (%)8.5%
Memory size4.6 KiB
2024-03-14T20:25:59.101164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.022857
Min length9

Characters and Unicode

Total characters6312
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique435 ?
Unique (%)82.9%

Sample

1st row054-716-1191
2nd row054-910-7337
3rd row054-437-6331
4th row070-7615-7004
5th row054-431-1900
ValueCountFrequency (%)
054-436-2297 4
 
0.8%
054-439-9339 3
 
0.6%
054-439-2241 3
 
0.6%
054-716-2124 3
 
0.6%
054-434-3333 3
 
0.6%
054-435-5550 3
 
0.6%
054-420-8356 3
 
0.6%
054-437-1480 2
 
0.4%
054-436-2121 2
 
0.4%
054-434-9990 2
 
0.4%
Other values (466) 497
94.7%
2024-03-14T20:26:00.429495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 1168
18.5%
- 1049
16.6%
0 947
15.0%
5 750
11.9%
3 706
11.2%
1 371
 
5.9%
7 306
 
4.8%
2 275
 
4.4%
8 252
 
4.0%
9 249
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5263
83.4%
Dash Punctuation 1049
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 1168
22.2%
0 947
18.0%
5 750
14.3%
3 706
13.4%
1 371
 
7.0%
7 306
 
5.8%
2 275
 
5.2%
8 252
 
4.8%
9 249
 
4.7%
6 239
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 1049
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6312
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 1168
18.5%
- 1049
16.6%
0 947
15.0%
5 750
11.9%
3 706
11.2%
1 371
 
5.9%
7 306
 
4.8%
2 275
 
4.4%
8 252
 
4.0%
9 249
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6312
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 1168
18.5%
- 1049
16.6%
0 947
15.0%
5 750
11.9%
3 706
11.2%
1 371
 
5.9%
7 306
 
4.8%
2 275
 
4.4%
8 252
 
4.0%
9 249
 
3.9%

팩스번호
Text

MISSING 

Distinct409
Distinct (%)90.7%
Missing123
Missing (%)21.4%
Memory size4.6 KiB
2024-03-14T20:26:01.415215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.044346
Min length11

Characters and Unicode

Total characters5432
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique372 ?
Unique (%)82.5%

Sample

1st row0504-849-1752
2nd row054-910-7338
3rd row054-437-6320
4th row070-7618-8020
5th row054-434-1940
ValueCountFrequency (%)
054-430-2298 4
 
0.9%
054-434-7111 3
 
0.7%
054-436-9248 3
 
0.7%
054-716-2125 3
 
0.7%
054-436-2122 2
 
0.4%
054-434-3618 2
 
0.4%
054-439-3417 2
 
0.4%
054-439-1187 2
 
0.4%
054-432-4238 2
 
0.4%
054-433-5188 2
 
0.4%
Other values (399) 426
94.5%
2024-03-14T20:26:02.786754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 1004
18.5%
- 902
16.6%
0 754
13.9%
5 657
12.1%
3 606
11.2%
1 283
 
5.2%
7 280
 
5.2%
2 254
 
4.7%
9 236
 
4.3%
6 230
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4530
83.4%
Dash Punctuation 902
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 1004
22.2%
0 754
16.6%
5 657
14.5%
3 606
13.4%
1 283
 
6.2%
7 280
 
6.2%
2 254
 
5.6%
9 236
 
5.2%
6 230
 
5.1%
8 226
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 902
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5432
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 1004
18.5%
- 902
16.6%
0 754
13.9%
5 657
12.1%
3 606
11.2%
1 283
 
5.2%
7 280
 
5.2%
2 254
 
4.7%
9 236
 
4.3%
6 230
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5432
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 1004
18.5%
- 902
16.6%
0 754
13.9%
5 657
12.1%
3 606
11.2%
1 283
 
5.2%
7 280
 
5.2%
2 254
 
4.7%
9 236
 
4.3%
6 230
 
4.2%

산업단지명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
<NA>
388 
김천제1일반산업단지
106 
김천대광농공단지
 
33
김천아포농공단지
 
29
김천감문농공단지
 
14

Length

Max length10
Median length4
Mean length5.6655052
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김천제1일반산업단지
2nd row김천제1일반산업단지
3rd row김천제1일반산업단지
4th row김천제1일반산업단지
5th row김천제1일반산업단지

Common Values

ValueCountFrequency (%)
<NA> 388
67.6%
김천제1일반산업단지 106
 
18.5%
김천대광농공단지 33
 
5.7%
김천아포농공단지 29
 
5.1%
김천감문농공단지 14
 
2.4%
김천지례농공단지 4
 
0.7%

Length

2024-03-14T20:26:03.215171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:03.548932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 388
67.6%
김천제1일반산업단지 106
 
18.5%
김천대광농공단지 33
 
5.7%
김천아포농공단지 29
 
5.1%
김천감문농공단지 14
 
2.4%
김천지례농공단지 4
 
0.7%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2024-02-07
574 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-02-07
2nd row2024-02-07
3rd row2024-02-07
4th row2024-02-07
5th row2024-02-07

Common Values

ValueCountFrequency (%)
2024-02-07 574
100.0%

Length

2024-03-14T20:26:03.839030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:26:03.998533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-02-07 574
100.0%

Interactions

2024-03-14T20:25:43.622967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:41.300562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:42.098824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:42.911957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:43.839445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:41.565759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:42.268818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:43.078243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:44.069753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:41.744737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:42.453878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:43.263172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:44.251726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:41.916308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:42.711070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:25:43.436511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:26:04.101018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업자등록번호위도경도산업단지명
순번1.0000.0000.6230.6780.789
사업자등록번호0.0001.0000.1960.2590.132
위도0.6230.1961.0000.8900.846
경도0.6780.2590.8901.0001.000
산업단지명0.7890.1320.8461.0001.000
2024-03-14T20:26:04.302661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업자등록번호위도경도산업단지명
순번1.000-0.064-0.4160.1540.744
사업자등록번호-0.0641.0000.0470.0800.052
위도-0.4160.0471.000-0.0520.820
경도0.1540.080-0.0521.0000.997
산업단지명0.7440.0520.8200.9971.000

Missing values

2024-03-14T20:25:44.504478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:25:44.795333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T20:25:45.247682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번회사명사업자등록번호공장대표주소(도로명)위도경도업종명생산품전화번호팩스번호산업단지명데이터기준일
01(주)건일산업1868600900경상북도 김천시 어모면 산업단지5로 16636.17674128.136805그 외 기타 분류 안된 비금속 광물제품 제조업흑연제조품, 내화단열재054-716-11910504-849-1752김천제1일반산업단지2024-02-07
12(주)골드테크5678700993경상북도 김천시 어모면 산업단지3로 5736.164587128.124101그 외 기타 전자부품 제조업 외 3 종전자부품 성형용기054-910-7337054-910-7338김천제1일반산업단지2024-02-07
23(주)나노코1138540154경상북도 김천시 어모면 산업단지2로 5536.163579128.122444합성수지 및 기타 플라스틱 물질 제조업Phenol 수지, 주물용 주제054-437-6331054-437-6320김천제1일반산업단지2024-02-07
34(주)대우테크1238649765경상북도 김천시 어모면 산업단지3로 6336.165848128.124959텔레비전 제조업텔레비전070-7615-7004070-7618-8020김천제1일반산업단지2024-02-07
45(주)동희산업 김천공장5108509179경상북도 김천시 어모면 산업단지1로 7536.156992128.121533자동차 차체용 신품 부품 제조업 외 4 종자동차 부품054-431-1900054-434-1940김천제1일반산업단지2024-02-07
56(주)모베이스오토5798801255경상북도 김천시 어모면 산업단지1로 4536.157066128.119055운송장비용 조명장치 제조업자동차 헤드램프 하우징, 베젤054-436-7644054-436-8644김천제1일반산업단지2024-02-07
67(주)엘앤에프5138119101경상북도 김천시 어모면 산업단지1로 8336.158054128.122646자동차 엔진용 신품 부품 제조업 외 1 종2차전지 양극용 전구체053-592-7300053-592-7301김천제1일반산업단지2024-02-07
78(주)영해식품5148155207경상북도 김천시 어모면 산업단지2로 43, 영해식품36.163075128.121643기타 수산동물 가공 및 저장 처리업오징어핫바054-435-6262<NA>김천제1일반산업단지2024-02-07
89(주)정도정밀5108132051경상북도 김천시 어모면 산업단지로 2036.15841128.118568자동차용 신품 동력전달장치 제조업 외 8 종자동차부품054-435-1012053-434-3507김천제1일반산업단지2024-02-07
910(주)케이씨씨5108508302경상북도 김천시 어모면 산업단지로 3936.161228128.120795암면 및 유사제품 제조업 외 3 종보온재, 천장재054-420-1700054-420-1799김천제1일반산업단지2024-02-07
순번회사명사업자등록번호공장대표주소(도로명)위도경도업종명생산품전화번호팩스번호산업단지명데이터기준일
564565한우물캡2730201372경상북도 김천시 어모면 어모로 133-14 외 2필지36.169941128.114661탭, 밸브 및 유사장치 제조업 외 2 종지하수상부보호공, 수위조절기, 신축관이음054-435-6636054-435-6637<NA>2024-02-07
565566한일콘텍(주)5108125415경상북도 김천시 아포읍 아포1로 140 외 4필지36.180185128.233954콘크리트 관 및 기타 구조용 콘크리트 제품 제조업수로관, 벤취플룸관054-434-4006054-434-4077<NA>2024-02-07
566567한제목재산업5130393234경상북도 김천시 농소면 농남로 33036.097136128.201444목재 포장용 상자, 드럼 및 유사용기 제조업목재파레트054-430-4610054-430-4611<NA>2024-02-07
567568한진플랜트 건설5100243403경상북도 김천시 평화장미길 104 (평화동)36.129132128.112968육상 금속 골조 구조재 제조업 외 3 종도로표지판054-437-2200<NA><NA>2024-02-07
568569해광씨앤에스2842701166경상북도 김천시 농소면 연명2길 16536.075549128.191075근무복, 작업복 및 유사의복 제조업근무복, 작업복 및 유사 의류<NA><NA><NA>2024-02-07
569570혁신건설기계5100527345경상북도 김천시 농소면 용시길 29-1336.110378128.171744건설 및 채광용 기계장비 제조업건설기계부품<NA><NA><NA>2024-02-07
570571현구석재사업5102262280경상북도 김천시 지례면 지례로 2635.988686128.032992기타 석제품 제조업 외 1 종석물가공054-435-0321054-434-4190<NA>2024-02-07
571572현대목재 김천공장4183300214경상북도 김천시 개령면 감문로 596 외 4필지36.181751128.158715목재 포장용 상자, 드럼 및 유사용기 제조업목재 깔판 및 목재 포장목 상자054-437-5001054-437-5007<NA>2024-02-07
572573혜인우드앤디자인6030791726경상북도 김천시 남면 영남대로 2905, 1층36.091807128.24143구조용 금속 판제품 및 공작물 제조업 외 1 종디자인형울타리, 안내판<NA><NA><NA>2024-02-07
573574황악협동조합1238649203경상북도 김천시 대항면 황학동길 1836.114593128.016851기타 곡물 가공품 제조업현미 누룽지054-434-6631054-434-6632<NA>2024-02-07