Overview

Dataset statistics

Number of variables5
Number of observations118
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory42.1 B

Variable types

Numeric1
Categorical3
Text1

Dataset

Description대전광역시 서구 전화권유판매업 등록현황(순번, 대표자명, 법인또는상호, 취급품목, 데이터기준일자) 입니다.
URLhttps://www.data.go.kr/data/15113154/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:29:32.860044
Analysis finished2023-12-11 23:29:33.337094
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct118
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.5
Minimum1
Maximum118
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T08:29:33.421193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.85
Q130.25
median59.5
Q388.75
95-th percentile112.15
Maximum118
Range117
Interquartile range (IQR)58.5

Descriptive statistics

Standard deviation34.207699
Coefficient of variation (CV)0.57491931
Kurtosis-1.2
Mean59.5
Median Absolute Deviation (MAD)29.5
Skewness0
Sum7021
Variance1170.1667
MonotonicityStrictly increasing
2023-12-12T08:29:33.567704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
76 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
81 1
 
0.8%
Other values (108) 108
91.5%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%
110 1
0.8%
109 1
0.8%

대표자명
Categorical

Distinct38
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
김**
24 
이**
17 
최**
박**
조**
Other values (33)
54 

Length

Max length7
Median length3
Mean length3.1779661
Min length3

Unique

Unique24 ?
Unique (%)20.3%

Sample

1st row조**
2nd row고**
3rd row김**,이**
4th row남**
5th row조**

Common Values

ValueCountFrequency (%)
김** 24
20.3%
이** 17
14.4%
최** 9
 
7.6%
박** 8
 
6.8%
조** 6
 
5.1%
송** 5
 
4.2%
고** 5
 
4.2%
전** 4
 
3.4%
정** 3
 
2.5%
신** 3
 
2.5%
Other values (28) 34
28.8%

Length

2023-12-12T08:29:33.724919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
24
20.3%
17
14.4%
9
 
7.6%
8
 
6.8%
6
 
5.1%
5
 
4.2%
5
 
4.2%
4
 
3.4%
3
 
2.5%
3
 
2.5%
Other values (27) 34
28.8%
Distinct117
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-12T08:29:34.057598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length18
Mean length9.0338983
Min length2

Characters and Unicode

Total characters1066
Distinct characters236
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)98.3%

Sample

1st row주식회사 우경아이앤씨
2nd row주식회사 바이오온
3rd row주식회사 티에이치
4th row에프엠통신
5th row스마트자산원
ValueCountFrequency (%)
주식회사 46
 
24.3%
5
 
2.6%
케이엠에스모바일 2
 
1.1%
2
 
1.1%
주)도솔종합개발 1
 
0.5%
인터월드 1
 
0.5%
울릉산림조합유통 1
 
0.5%
커플국제결혼 1
 
0.5%
한국미래에너지 1
 
0.5%
설레임 1
 
0.5%
Other values (128) 128
67.7%
2023-12-12T08:29:34.572351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
6.7%
64
 
6.0%
51
 
4.8%
47
 
4.4%
47
 
4.4%
35
 
3.3%
31
 
2.9%
( 25
 
2.3%
) 25
 
2.3%
17
 
1.6%
Other values (226) 653
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 830
77.9%
Uppercase Letter 73
 
6.8%
Space Separator 71
 
6.7%
Open Punctuation 25
 
2.3%
Close Punctuation 25
 
2.3%
Lowercase Letter 20
 
1.9%
Decimal Number 18
 
1.7%
Other Symbol 2
 
0.2%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
7.7%
51
 
6.1%
47
 
5.7%
47
 
5.7%
35
 
4.2%
31
 
3.7%
17
 
2.0%
16
 
1.9%
14
 
1.7%
12
 
1.4%
Other values (186) 496
59.8%
Uppercase Letter
ValueCountFrequency (%)
I 12
16.4%
T 9
12.3%
C 9
12.3%
N 7
9.6%
M 5
 
6.8%
O 4
 
5.5%
S 4
 
5.5%
A 3
 
4.1%
U 3
 
4.1%
R 3
 
4.1%
Other values (9) 14
19.2%
Lowercase Letter
ValueCountFrequency (%)
d 4
20.0%
a 2
10.0%
o 2
10.0%
e 2
10.0%
n 2
10.0%
y 1
 
5.0%
p 1
 
5.0%
m 1
 
5.0%
c 1
 
5.0%
r 1
 
5.0%
Other values (3) 3
15.0%
Decimal Number
ValueCountFrequency (%)
1 12
66.7%
4 6
33.3%
Other Punctuation
ValueCountFrequency (%)
/ 1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 832
78.0%
Common 141
 
13.2%
Latin 93
 
8.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
7.7%
51
 
6.1%
47
 
5.6%
47
 
5.6%
35
 
4.2%
31
 
3.7%
17
 
2.0%
16
 
1.9%
14
 
1.7%
12
 
1.4%
Other values (187) 498
59.9%
Latin
ValueCountFrequency (%)
I 12
 
12.9%
T 9
 
9.7%
C 9
 
9.7%
N 7
 
7.5%
M 5
 
5.4%
O 4
 
4.3%
S 4
 
4.3%
d 4
 
4.3%
A 3
 
3.2%
U 3
 
3.2%
Other values (22) 33
35.5%
Common
ValueCountFrequency (%)
71
50.4%
( 25
 
17.7%
) 25
 
17.7%
1 12
 
8.5%
4 6
 
4.3%
/ 1
 
0.7%
. 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 830
77.9%
ASCII 234
 
22.0%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
30.3%
( 25
 
10.7%
) 25
 
10.7%
I 12
 
5.1%
1 12
 
5.1%
T 9
 
3.8%
C 9
 
3.8%
N 7
 
3.0%
4 6
 
2.6%
M 5
 
2.1%
Other values (29) 53
22.6%
Hangul
ValueCountFrequency (%)
64
 
7.7%
51
 
6.1%
47
 
5.7%
47
 
5.7%
35
 
4.2%
31
 
3.7%
17
 
2.0%
16
 
1.9%
14
 
1.7%
12
 
1.4%
Other values (186) 496
59.8%
None
ValueCountFrequency (%)
2
100.0%

취급품목
Categorical

Distinct23
Distinct (%)19.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
기타
61 
건강식품
16 
통신기기
14 
건강식품 화장품/미용용품
 
3
화장품/미용용품
 
3
Other values (18)
21 

Length

Max length25
Median length2
Mean length4.3983051
Min length2

Unique

Unique15 ?
Unique (%)12.7%

Sample

1st row통신기기
2nd row건강식품
3rd row기타
4th row통신기기
5th row기타

Common Values

ValueCountFrequency (%)
기타 61
51.7%
건강식품 16
 
13.6%
통신기기 14
 
11.9%
건강식품 화장품/미용용품 3
 
2.5%
화장품/미용용품 3
 
2.5%
컴퓨터/사무용품 통신기기 2
 
1.7%
컴퓨터/사무용품 2
 
1.7%
<NA> 2
 
1.7%
가전 1
 
0.8%
통신기기 건강식품 기타 1
 
0.8%
Other values (13) 13
 
11.0%

Length

2023-12-12T08:29:34.731361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
기타 70
48.6%
건강식품 25
 
17.4%
통신기기 22
 
15.3%
화장품/미용용품 9
 
6.2%
컴퓨터/사무용품 7
 
4.9%
가전 6
 
4.2%
na 2
 
1.4%
의류/패션 1
 
0.7%
교육/도서 1
 
0.7%
회원권/상품권 1
 
0.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2022-09-13
118 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-09-13
2nd row2022-09-13
3rd row2022-09-13
4th row2022-09-13
5th row2022-09-13

Common Values

ValueCountFrequency (%)
2022-09-13 118
100.0%

Length

2023-12-12T08:29:34.882478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:29:34.986347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-09-13 118
100.0%

Interactions

2023-12-12T08:29:33.082950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:29:35.054143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번대표자명취급품목
순번1.0000.0000.093
대표자명0.0001.0000.000
취급품목0.0930.0001.000
2023-12-12T08:29:35.168578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취급품목대표자명
취급품목1.0000.000
대표자명0.0001.000
2023-12-12T08:29:35.266831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번대표자명취급품목
순번1.0000.0000.000
대표자명0.0001.0000.000
취급품목0.0000.0001.000

Missing values

2023-12-12T08:29:33.190714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:29:33.290755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번대표자명법인또는상호취급품목데이터기준일자
01조**주식회사 우경아이앤씨통신기기2022-09-13
12고**주식회사 바이오온건강식품2022-09-13
23김**,이**주식회사 티에이치기타2022-09-13
34남**에프엠통신통신기기2022-09-13
45조**스마트자산원기타2022-09-13
56김**씨에스 네트워크컴퓨터/사무용품 통신기기2022-09-13
67김**주식회사 와이디컴퍼니기타2022-09-13
78정**아이비드통신기기2022-09-13
89이**주식회사 위에드(We add)기타2022-09-13
910하**찾아가는통신통신기기2022-09-13
순번대표자명법인또는상호취급품목데이터기준일자
108109윤**주식회사 이지스기타2022-09-13
109110이**주식회사 두드림건강식품 화장품/미용용품2022-09-13
110111오**서비스탑 주식회사기타2022-09-13
111112김**한국모바일인포기타2022-09-13
112113최**,김**(주) 네스컴퓨터/사무용품2022-09-13
113114문**신용협동조합중앙회기타2022-09-13
114115이**화신엔지니어링기타2022-09-13
115116홍**다우정보(주)기타2022-09-13
116117김**(주)한화갤러리아타임월드기타2022-09-13
117118박**(주)케이티씨에스기타2022-09-13