Overview

Dataset statistics

Number of variables3
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)8.0%
Total size in memory2.7 KiB
Average record size in memory27.3 B

Variable types

Numeric2
Text1

Dataset

Description울산광역시 승용차요일제 RFID 시스템 자동차마스터에 대한 데이터입니다. 차명, 차종, 연식의 내용을 포함합니다.
URLhttps://www.data.go.kr/data/15122248/fileData.do

Alerts

Dataset has 8 (8.0%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 15:34:06.447642
Analysis finished2023-12-12 15:34:07.509992
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

차종
Real number (ℝ)

Distinct13
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.02
Minimum30
Maximum361
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T00:34:07.585131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile30
Q1112
median113
Q3132
95-th percentile302.1
Maximum361
Range331
Interquartile range (IQR)20

Descriptive statistics

Standard deviation66.035953
Coefficient of variation (CV)0.51582528
Kurtosis6.2047334
Mean128.02
Median Absolute Deviation (MAD)1
Skewness2.2912519
Sum12802
Variance4360.7471
MonotonicityNot monotonic
2023-12-13T00:34:07.787310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
112 34
34.0%
113 13
 
13.0%
114 12
 
12.0%
132 9
 
9.0%
30 8
 
8.0%
111 7
 
7.0%
361 4
 
4.0%
133 4
 
4.0%
152 2
 
2.0%
153 2
 
2.0%
Other values (3) 5
 
5.0%
ValueCountFrequency (%)
30 8
 
8.0%
111 7
 
7.0%
112 34
34.0%
113 13
 
13.0%
114 12
 
12.0%
132 9
 
9.0%
133 4
 
4.0%
152 2
 
2.0%
153 2
 
2.0%
212 2
 
2.0%
ValueCountFrequency (%)
361 4
 
4.0%
342 1
 
1.0%
300 2
 
2.0%
212 2
 
2.0%
153 2
 
2.0%
152 2
 
2.0%
133 4
 
4.0%
132 9
9.0%
114 12
12.0%
113 13
13.0%

차명
Text

Distinct64
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:34:08.145746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length16
Mean length8.6
Min length2

Characters and Unicode

Total characters860
Distinct characters132
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)45.0%

Sample

1st row아반떼 (AVANTE)
2nd row아반떼오토메틱
3rd row카렌스LPG
4th row마티즈
5th row마티즈 0.8S AT
ValueCountFrequency (%)
마티즈 7
 
4.8%
그랜저(grandeur 6
 
4.1%
at 6
 
4.1%
쏘나타 6
 
4.1%
sonata 6
 
4.1%
아반떼(avante 5
 
3.4%
포터ⅱ(porterⅱ 4
 
2.7%
아반떼 4
 
2.7%
0.8s 4
 
2.7%
new 3
 
2.0%
Other values (68) 96
65.3%
2023-12-13T00:34:08.682274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 51
 
5.9%
47
 
5.5%
N 35
 
4.1%
( 34
 
4.0%
) 34
 
4.0%
E 33
 
3.8%
T 33
 
3.8%
S 26
 
3.0%
R 24
 
2.8%
O 22
 
2.6%
Other values (122) 521
60.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 336
39.1%
Other Letter 332
38.6%
Space Separator 47
 
5.5%
Decimal Number 47
 
5.5%
Open Punctuation 34
 
4.0%
Close Punctuation 34
 
4.0%
Other Punctuation 15
 
1.7%
Letter Number 11
 
1.3%
Lowercase Letter 2
 
0.2%
Dash Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
5.4%
15
 
4.5%
15
 
4.5%
14
 
4.2%
14
 
4.2%
14
 
4.2%
12
 
3.6%
12
 
3.6%
11
 
3.3%
9
 
2.7%
Other values (83) 198
59.6%
Uppercase Letter
ValueCountFrequency (%)
A 51
15.2%
N 35
10.4%
E 33
9.8%
T 33
9.8%
S 26
 
7.7%
R 24
 
7.1%
O 22
 
6.5%
D 17
 
5.1%
G 17
 
5.1%
U 10
 
3.0%
Other values (12) 68
20.2%
Decimal Number
ValueCountFrequency (%)
0 12
25.5%
5 9
19.1%
2 8
17.0%
1 8
17.0%
8 4
 
8.5%
3 3
 
6.4%
6 2
 
4.3%
4 1
 
2.1%
Other Punctuation
ValueCountFrequency (%)
. 14
93.3%
¡ 1
 
6.7%
Letter Number
ValueCountFrequency (%)
9
81.8%
2
 
18.2%
Space Separator
ValueCountFrequency (%)
47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Lowercase Letter
ValueCountFrequency (%)
i 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 349
40.6%
Hangul 332
38.6%
Common 179
20.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
5.4%
15
 
4.5%
15
 
4.5%
14
 
4.2%
14
 
4.2%
14
 
4.2%
12
 
3.6%
12
 
3.6%
11
 
3.3%
9
 
2.7%
Other values (83) 198
59.6%
Latin
ValueCountFrequency (%)
A 51
14.6%
N 35
10.0%
E 33
9.5%
T 33
9.5%
S 26
 
7.4%
R 24
 
6.9%
O 22
 
6.3%
D 17
 
4.9%
G 17
 
4.9%
U 10
 
2.9%
Other values (15) 81
23.2%
Common
ValueCountFrequency (%)
47
26.3%
( 34
19.0%
) 34
19.0%
. 14
 
7.8%
0 12
 
6.7%
5 9
 
5.0%
2 8
 
4.5%
1 8
 
4.5%
8 4
 
2.2%
3 3
 
1.7%
Other values (4) 6
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 516
60.0%
Hangul 332
38.6%
Number Forms 11
 
1.3%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 51
 
9.9%
47
 
9.1%
N 35
 
6.8%
( 34
 
6.6%
) 34
 
6.6%
E 33
 
6.4%
T 33
 
6.4%
S 26
 
5.0%
R 24
 
4.7%
O 22
 
4.3%
Other values (26) 177
34.3%
Hangul
ValueCountFrequency (%)
18
 
5.4%
15
 
4.5%
15
 
4.5%
14
 
4.2%
14
 
4.2%
14
 
4.2%
12
 
3.6%
12
 
3.6%
11
 
3.3%
9
 
2.7%
Other values (83) 198
59.6%
Number Forms
ValueCountFrequency (%)
9
81.8%
2
 
18.2%
None
ValueCountFrequency (%)
¡ 1
100.0%

연식
Real number (ℝ)

Distinct16
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2004.92
Minimum1995
Maximum2012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T00:34:08.854707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1995
5-th percentile1999
Q12001
median2005
Q32008.25
95-th percentile2012
Maximum2012
Range17
Interquartile range (IQR)7.25

Descriptive statistics

Standard deviation4.3616325
Coefficient of variation (CV)0.0021754646
Kurtosis-0.97670919
Mean2004.92
Median Absolute Deviation (MAD)4
Skewness-0.021607124
Sum200492
Variance19.023838
MonotonicityNot monotonic
2023-12-13T00:34:09.039659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
2006 12
12.0%
2002 12
12.0%
2000 9
9.0%
2001 9
9.0%
2008 8
8.0%
2012 8
8.0%
2009 8
8.0%
1999 6
 
6.0%
2005 6
 
6.0%
2011 6
 
6.0%
Other values (6) 16
16.0%
ValueCountFrequency (%)
1995 1
 
1.0%
1996 2
 
2.0%
1999 6
6.0%
2000 9
9.0%
2001 9
9.0%
2002 12
12.0%
2003 4
 
4.0%
2004 2
 
2.0%
2005 6
6.0%
2006 12
12.0%
ValueCountFrequency (%)
2012 8
8.0%
2011 6
6.0%
2010 3
 
3.0%
2009 8
8.0%
2008 8
8.0%
2007 4
 
4.0%
2006 12
12.0%
2005 6
6.0%
2004 2
 
2.0%
2003 4
 
4.0%

Interactions

2023-12-13T00:34:06.966328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:34:06.673198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:34:07.114976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:34:06.821344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:34:09.176857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차종차명연식
차종1.0000.9480.218
차명0.9481.0000.682
연식0.2180.6821.000
2023-12-13T00:34:09.312839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차종연식
차종1.000-0.104
연식-0.1041.000

Missing values

2023-12-13T00:34:07.322972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:34:07.458323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

차종차명연식
0112아반떼 (AVANTE)2012
1111아반떼오토메틱1996
2152카렌스LPG2000
3114마티즈2001
4114마티즈 0.8S AT2009
530포터엘피지초장축1999
6132싼타페2002
7153카니발2002
8112SM52012
9114비스토2000
차종차명연식
90113BMW530¡2002
91114레이2012
92112아반떼(AVANTE)2009
93112쏘나타 (SONATA)2012
94112아반떼 (AVANTE)2012
95114마티즈 0.8S AT2006
96114비스토1999
9730봉고Ⅲ 1톤2008
98133허머 H2 SUT2005
99112SM32010

Duplicate rows

Most frequently occurring

차종차명연식# duplicates
4112아반떼(AVANTE)20093
0112라세티 1.6D AT20062
1112쏘나타 (SONATA)20082
2112쏘나타(SONATA)20062
3112아반떼 (AVANTE)20122
5113그랜저(GRANDEUR)20112
6114마티즈20012
7132스포티지20052