Overview

Dataset statistics

Number of variables7
Number of observations87
Missing cells2
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory58.5 B

Variable types

Numeric1
Text3
Categorical2
DateTime1

Dataset

Description경기도 용인시 농식품가공업체 현황입니다. 업체명, 연락처, 주소 등의 데이터를 제공합니다. ※ 데이터기준일자 : 2023-05-01
URLhttps://www.data.go.kr/data/15014167/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
연번 is highly overall correlated with 비고High correlation
대표품목 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연락처 has 2 (2.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:49:45.690797
Analysis finished2023-12-12 22:49:46.500207
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct87
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44
Minimum1
Maximum87
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size915.0 B
2023-12-13T07:49:46.562040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.3
Q122.5
median44
Q365.5
95-th percentile82.7
Maximum87
Range86
Interquartile range (IQR)43

Descriptive statistics

Standard deviation25.258662
Coefficient of variation (CV)0.5740605
Kurtosis-1.2
Mean44
Median Absolute Deviation (MAD)22
Skewness0
Sum3828
Variance638
MonotonicityStrictly increasing
2023-12-13T07:49:46.665453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
2 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
59 1
 
1.1%
58 1
 
1.1%
Other values (77) 77
88.5%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%
80 1
1.1%
79 1
1.1%
78 1
1.1%
Distinct84
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size828.0 B
2023-12-13T07:49:46.877029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length5.1034483
Min length2

Characters and Unicode

Total characters444
Distinct characters150
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)93.1%

Sample

1st row대원식품
2nd row이내식품
3rd row(유)두드림푸드시스템
4th row㈜코푸홈푸드
5th row(주)데일리쿡
ValueCountFrequency (%)
㈜동산 2
 
2.2%
소문난떡집 2
 
2.2%
종로떡집 2
 
2.2%
동희㈜ 2
 
2.2%
민속떡집 2
 
2.2%
대우식품 2
 
2.2%
떡뫼마을 1
 
1.1%
떡이랑 1
 
1.1%
상현떡방 1
 
1.1%
팔도명가 1
 
1.1%
Other values (74) 74
82.2%
2023-12-13T07:49:47.199420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
6.8%
17
 
3.8%
15
 
3.4%
14
 
3.2%
13
 
2.9%
11
 
2.5%
11
 
2.5%
10
 
2.3%
9
 
2.0%
9
 
2.0%
Other values (140) 305
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 396
89.2%
Space Separator 13
 
2.9%
Other Symbol 11
 
2.5%
Close Punctuation 8
 
1.8%
Open Punctuation 8
 
1.8%
Uppercase Letter 7
 
1.6%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
7.6%
17
 
4.3%
15
 
3.8%
14
 
3.5%
11
 
2.8%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (130) 265
66.9%
Uppercase Letter
ValueCountFrequency (%)
R 2
28.6%
C 2
28.6%
N 1
14.3%
S 1
14.3%
F 1
14.3%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 407
91.7%
Common 30
 
6.8%
Latin 7
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
7.4%
17
 
4.2%
15
 
3.7%
14
 
3.4%
11
 
2.7%
11
 
2.7%
10
 
2.5%
9
 
2.2%
9
 
2.2%
8
 
2.0%
Other values (131) 273
67.1%
Latin
ValueCountFrequency (%)
R 2
28.6%
C 2
28.6%
N 1
14.3%
S 1
14.3%
F 1
14.3%
Common
ValueCountFrequency (%)
13
43.3%
) 8
26.7%
( 8
26.7%
& 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 396
89.2%
ASCII 37
 
8.3%
None 11
 
2.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
7.6%
17
 
4.3%
15
 
3.8%
14
 
3.5%
11
 
2.8%
10
 
2.5%
9
 
2.3%
9
 
2.3%
8
 
2.0%
8
 
2.0%
Other values (130) 265
66.9%
ASCII
ValueCountFrequency (%)
13
35.1%
) 8
21.6%
( 8
21.6%
R 2
 
5.4%
C 2
 
5.4%
N 1
 
2.7%
S 1
 
2.7%
& 1
 
2.7%
F 1
 
2.7%
None
ValueCountFrequency (%)
11
100.0%

연락처
Text

MISSING 

Distinct80
Distinct (%)94.1%
Missing2
Missing (%)2.3%
Memory size828.0 B
2023-12-13T07:49:47.408377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.011765
Min length12

Characters and Unicode

Total characters1021
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)88.2%

Sample

1st row031-332-4922
2nd row031-332-3368
3rd row031-338-0457
4th row031-334-7884
5th row031-338-8660
ValueCountFrequency (%)
031-334-4100 2
 
2.4%
031-339-3367 2
 
2.4%
031-332-4289 2
 
2.4%
031-338-0348 2
 
2.4%
031-333-0335 2
 
2.4%
031-264-1251 1
 
1.2%
031-261-8123 1
 
1.2%
031-897-0909 1
 
1.2%
031-266-4088 1
 
1.2%
031-261-6787 1
 
1.2%
Other values (70) 70
82.4%
2023-12-13T07:49:47.765762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 224
21.9%
- 170
16.7%
0 124
12.1%
1 115
11.3%
2 103
10.1%
8 61
 
6.0%
6 56
 
5.5%
4 50
 
4.9%
7 47
 
4.6%
9 41
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 851
83.3%
Dash Punctuation 170
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 224
26.3%
0 124
14.6%
1 115
13.5%
2 103
12.1%
8 61
 
7.2%
6 56
 
6.6%
4 50
 
5.9%
7 47
 
5.5%
9 41
 
4.8%
5 30
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1021
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 224
21.9%
- 170
16.7%
0 124
12.1%
1 115
11.3%
2 103
10.1%
8 61
 
6.0%
6 56
 
5.5%
4 50
 
4.9%
7 47
 
4.6%
9 41
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1021
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 224
21.9%
- 170
16.7%
0 124
12.1%
1 115
11.3%
2 103
10.1%
8 61
 
6.0%
6 56
 
5.5%
4 50
 
4.9%
7 47
 
4.6%
9 41
 
4.0%

주소
Text

Distinct85
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size828.0 B
2023-12-13T07:49:48.004379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length15.908046
Min length7

Characters and Unicode

Total characters1384
Distinct characters134
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)95.4%

Sample

1st row처인구 백암면 황새울로43번길 4-6
2nd row처인구 백암면 고안로 97-36
3rd row처인구 모현읍 능원로 10번길 30
4th row처인구 모현읍 포은대로 1100-36
5th row처인구 양지면 양지로143번길 5-3
ValueCountFrequency (%)
처인구 31
 
10.5%
수지구 23
 
7.8%
기흥구 16
 
5.4%
풍덕천동 10
 
3.4%
백암면 10
 
3.4%
양지면 7
 
2.4%
모현읍 7
 
2.4%
상현동 6
 
2.0%
원삼면 5
 
1.7%
남사면 5
 
1.7%
Other values (152) 174
59.2%
2023-12-13T07:49:48.358134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
213
 
15.4%
74
 
5.3%
1 62
 
4.5%
- 52
 
3.8%
3 49
 
3.5%
49
 
3.5%
6 43
 
3.1%
2 41
 
3.0%
40
 
2.9%
0 39
 
2.8%
Other values (124) 722
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 737
53.3%
Decimal Number 376
27.2%
Space Separator 213
 
15.4%
Dash Punctuation 52
 
3.8%
Uppercase Letter 5
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
10.0%
49
 
6.6%
40
 
5.4%
31
 
4.2%
31
 
4.2%
27
 
3.7%
25
 
3.4%
23
 
3.1%
23
 
3.1%
19
 
2.6%
Other values (108) 395
53.6%
Decimal Number
ValueCountFrequency (%)
1 62
16.5%
3 49
13.0%
6 43
11.4%
2 41
10.9%
0 39
10.4%
4 36
9.6%
5 31
8.2%
8 31
8.2%
7 27
7.2%
9 17
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
A 3
60.0%
P 1
 
20.0%
B 1
 
20.0%
Space Separator
ValueCountFrequency (%)
213
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 737
53.3%
Common 642
46.4%
Latin 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
10.0%
49
 
6.6%
40
 
5.4%
31
 
4.2%
31
 
4.2%
27
 
3.7%
25
 
3.4%
23
 
3.1%
23
 
3.1%
19
 
2.6%
Other values (108) 395
53.6%
Common
ValueCountFrequency (%)
213
33.2%
1 62
 
9.7%
- 52
 
8.1%
3 49
 
7.6%
6 43
 
6.7%
2 41
 
6.4%
0 39
 
6.1%
4 36
 
5.6%
5 31
 
4.8%
8 31
 
4.8%
Other values (3) 45
 
7.0%
Latin
ValueCountFrequency (%)
A 3
60.0%
P 1
 
20.0%
B 1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 737
53.3%
ASCII 647
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
213
32.9%
1 62
 
9.6%
- 52
 
8.0%
3 49
 
7.6%
6 43
 
6.6%
2 41
 
6.3%
0 39
 
6.0%
4 36
 
5.6%
5 31
 
4.8%
8 31
 
4.8%
Other values (6) 50
 
7.7%
Hangul
ValueCountFrequency (%)
74
 
10.0%
49
 
6.6%
40
 
5.4%
31
 
4.2%
31
 
4.2%
27
 
3.7%
25
 
3.4%
23
 
3.1%
23
 
3.1%
19
 
2.6%
Other values (108) 395
53.6%

대표품목
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)29.9%
Missing0
Missing (%)0.0%
Memory size828.0 B
떡류가공업
44 
김치 제조판매
17 
쌀가루
 
2
전통장류
 
2
백암막걸리
 
1
Other values (21)
21 

Length

Max length10
Median length5
Mean length5.4597701
Min length2

Unique

Unique22 ?
Unique (%)25.3%

Sample

1st row김치 제조판매
2nd row김치 제조판매
3rd row김치 제조판매
4th row김치 제조판매
5th row김치 제조판매

Common Values

ValueCountFrequency (%)
떡류가공업 44
50.6%
김치 제조판매 17
 
19.5%
쌀가루 2
 
2.3%
전통장류 2
 
2.3%
백암막걸리 1
 
1.1%
원삼막걸리 1
 
1.1%
양지막걸리 1
 
1.1%
처인성막걸리 1
 
1.1%
석향주, 석성주 1
 
1.1%
미르40, 백설공주 1
 
1.1%
Other values (16) 16
 
18.4%

Length

2023-12-13T07:49:48.499781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
떡류가공업 44
39.3%
김치 18
16.1%
제조판매 17
 
15.2%
쌀가루 2
 
1.8%
전통장류 2
 
1.8%
단무지 2
 
1.8%
제품 1
 
0.9%
감자분말 1
 
0.9%
떡류 1
 
0.9%
오가피 1
 
0.9%
Other values (23) 23
20.5%

비고
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size828.0 B
떡류가공업
44 
김치류
17 
기타품목
17 
전통주류

Length

Max length5
Median length5
Mean length4.3103448
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김치류
2nd row김치류
3rd row김치류
4th row김치류
5th row김치류

Common Values

ValueCountFrequency (%)
떡류가공업 44
50.6%
김치류 17
 
19.5%
기타품목 17
 
19.5%
전통주류 9
 
10.3%

Length

2023-12-13T07:49:48.627321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:49:48.730752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
떡류가공업 44
50.6%
김치류 17
 
19.5%
기타품목 17
 
19.5%
전통주류 9
 
10.3%

데이터 기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size828.0 B
Minimum2023-05-01 00:00:00
Maximum2023-05-01 00:00:00
2023-12-13T07:49:48.822955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:49:48.897414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T07:49:46.307108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:49:48.962090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업체명연락처주소대표품목비고
연번1.0000.7720.8370.9700.8110.993
업체명0.7721.0000.9950.9890.9960.977
연락처0.8370.9951.0000.9980.0000.000
주소0.9700.9890.9981.0000.9750.848
대표품목0.8110.9960.0000.9751.0001.000
비고0.9930.9770.0000.8481.0001.000
2023-12-13T07:49:49.313519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고대표품목
비고1.0000.857
대표품목0.8571.000
2023-12-13T07:49:49.386885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번대표품목비고
연번1.0000.4030.922
대표품목0.4031.0000.857
비고0.9220.8571.000

Missing values

2023-12-13T07:49:46.383331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:49:46.466751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명연락처주소대표품목비고데이터 기준일자
01대원식품031-332-4922처인구 백암면 황새울로43번길 4-6김치 제조판매김치류2023-05-01
12이내식품031-332-3368처인구 백암면 고안로 97-36김치 제조판매김치류2023-05-01
23(유)두드림푸드시스템031-338-0457처인구 모현읍 능원로 10번길 30김치 제조판매김치류2023-05-01
34㈜코푸홈푸드031-334-7884처인구 모현읍 포은대로 1100-36김치 제조판매김치류2023-05-01
45(주)데일리쿡031-338-8660처인구 양지면 양지로143번길 5-3김치 제조판매김치류2023-05-01
56㈜씨알에프엔씨(CRF&C)031-338-1114처인구 양지면 양지로 17-18김치 제조판매김치류2023-05-01
67㈜델리후레쉬031-334-6263처인구 남사면 원암로 481김치 제조판매김치류2023-05-01
78주식회사신촌댁식품031-321-1820처인구 원삼면 죽양대로 1774김치 제조판매김치류2023-05-01
89좋구먼식품031-322-4992처인구 포곡읍 금어로 586번길 4-35김치 제조판매김치류2023-05-01
910다솜식품031-334-7223처인구 이동읍 서리로 138김치 제조판매김치류2023-05-01
연번업체명연락처주소대표품목비고데이터 기준일자
7778소문난떡집031-274-1363기흥구 구갈동 594떡류가공업떡류가공업2023-05-01
7879민속떡방앗간031-274-8183기흥구 마북동 171-2 삼가동 가동 2호떡류가공업떡류가공업2023-05-01
7980떡시루031-286-1218기흥구 마북동 524-8떡류가공업떡류가공업2023-05-01
8081떡사랑031-283-7712기흥구 보정동 906-3떡류가공업떡류가공업2023-05-01
8182자연수떡방031-283-3040기흥구 언남동 336-7떡류가공업떡류가공업2023-05-01
8283떡수레031-285-3945기흥구 언남동 336-7떡류가공업떡류가공업2023-05-01
8384황금시루031-284-8233기흥구 언남동 416-5 푸른종합상가 101호떡류가공업떡류가공업2023-05-01
8485풍년떡집031-274-0045기흥구 상갈동 466-8떡류가공업떡류가공업2023-05-01
8586궁중떡방031-285-6464기흥구 상갈동 466-1떡류가공업떡류가공업2023-05-01
8687민속촌 떡 방앗간031-286-6883기흥구 보라동 417-1떡류가공업떡류가공업2023-05-01