Overview

Dataset statistics

Number of variables3
Number of observations21
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory657.0 B
Average record size in memory31.3 B

Variable types

Text2
Numeric1

Dataset

Description대전광역시 중구 관내 법인택시현황으로서 대전법인택시이름, 각 법인택시별 전화번호, 각 법인택시별 운행대수를 제공합니다.
URLhttps://www.data.go.kr/data/15119617/fileData.do

Alerts

법인명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:27:03.405803
Analysis finished2023-12-12 10:27:03.782449
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

법인명
Text

UNIQUE 

Distinct21
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T19:27:03.957750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length6.5714286
Min length5

Characters and Unicode

Total characters138
Distinct characters43
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row경전기업(합)
2nd row광덕운수(자)
3rd row대광상운(자)
4th row대전택시㈜
5th row대종운수(자)
ValueCountFrequency (%)
경전기업(합 1
 
4.8%
안전교통㈜ 1
 
4.8%
독립택시(합 1
 
4.8%
동건상운(합 1
 
4.8%
동산운수㈜ 1
 
4.8%
경신운수(합 1
 
4.8%
현대육운(합 1
 
4.8%
현대상운(자 1
 
4.8%
한일운수(자 1
 
4.8%
우리희망(조합 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T19:27:04.325316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 15
 
10.9%
) 15
 
10.9%
11
 
8.0%
9
 
6.5%
6
 
4.3%
6
 
4.3%
6
 
4.3%
5
 
3.6%
5
 
3.6%
5
 
3.6%
Other values (33) 55
39.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100
72.5%
Open Punctuation 15
 
10.9%
Close Punctuation 15
 
10.9%
Other Symbol 6
 
4.3%
Space Separator 2
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
11.0%
9
 
9.0%
6
 
6.0%
6
 
6.0%
5
 
5.0%
5
 
5.0%
5
 
5.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
Other values (29) 42
42.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 106
76.8%
Common 32
 
23.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
10.4%
9
 
8.5%
6
 
5.7%
6
 
5.7%
6
 
5.7%
5
 
4.7%
5
 
4.7%
5
 
4.7%
4
 
3.8%
4
 
3.8%
Other values (30) 45
42.5%
Common
ValueCountFrequency (%)
( 15
46.9%
) 15
46.9%
2
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100
72.5%
ASCII 32
 
23.2%
None 6
 
4.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 15
46.9%
) 15
46.9%
2
 
6.2%
Hangul
ValueCountFrequency (%)
11
 
11.0%
9
 
9.0%
6
 
6.0%
6
 
6.0%
5
 
5.0%
5
 
5.0%
5
 
5.0%
4
 
4.0%
4
 
4.0%
3
 
3.0%
Other values (29) 42
42.0%
None
ValueCountFrequency (%)
6
100.0%
Distinct20
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size300.0 B
2023-12-12T19:27:04.534275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters252
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)90.5%

Sample

1st row042-271-0718
2nd row042-252-7752
3rd row042-271-1571
4th row042-522-4565
5th row042-255-9040
ValueCountFrequency (%)
042-625-2222 2
 
9.5%
042-271-0718 1
 
4.8%
042-223-9331 1
 
4.8%
042-272-8441 1
 
4.8%
042-824-8441 1
 
4.8%
042-533-2108 1
 
4.8%
042-253-6119 1
 
4.8%
042-255-8888 1
 
4.8%
042-581-2700 1
 
4.8%
042-226-7434 1
 
4.8%
Other values (10) 10
47.6%
2023-12-12T19:27:04.850929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 60
23.8%
- 42
16.7%
4 32
12.7%
0 30
11.9%
5 27
10.7%
8 16
 
6.3%
1 15
 
6.0%
6 9
 
3.6%
7 9
 
3.6%
3 8
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 210
83.3%
Dash Punctuation 42
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 60
28.6%
4 32
15.2%
0 30
14.3%
5 27
12.9%
8 16
 
7.6%
1 15
 
7.1%
6 9
 
4.3%
7 9
 
4.3%
3 8
 
3.8%
9 4
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 252
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 60
23.8%
- 42
16.7%
4 32
12.7%
0 30
11.9%
5 27
10.7%
8 16
 
6.3%
1 15
 
6.0%
6 9
 
3.6%
7 9
 
3.6%
3 8
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 252
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 60
23.8%
- 42
16.7%
4 32
12.7%
0 30
11.9%
5 27
10.7%
8 16
 
6.3%
1 15
 
6.0%
6 9
 
3.6%
7 9
 
3.6%
3 8
 
3.2%

운행대수
Real number (ℝ)

Distinct18
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55.142857
Minimum33
Maximum94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size321.0 B
2023-12-12T19:27:04.983111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33
5-th percentile34
Q141
median53
Q366
95-th percentile83
Maximum94
Range61
Interquartile range (IQR)25

Descriptive statistics

Standard deviation18.075635
Coefficient of variation (CV)0.32779649
Kurtosis-0.56847504
Mean55.142857
Median Absolute Deviation (MAD)13
Skewness0.66276949
Sum1158
Variance326.72857
MonotonicityNot monotonic
2023-12-12T19:27:05.102081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
34 2
 
9.5%
41 2
 
9.5%
56 2
 
9.5%
53 1
 
4.8%
94 1
 
4.8%
58 1
 
4.8%
50 1
 
4.8%
40 1
 
4.8%
39 1
 
4.8%
33 1
 
4.8%
Other values (8) 8
38.1%
ValueCountFrequency (%)
33 1
4.8%
34 2
9.5%
39 1
4.8%
40 1
4.8%
41 2
9.5%
42 1
4.8%
45 1
4.8%
50 1
4.8%
53 1
4.8%
56 2
9.5%
ValueCountFrequency (%)
94 1
4.8%
83 1
4.8%
81 1
4.8%
80 1
4.8%
69 1
4.8%
66 1
4.8%
63 1
4.8%
58 1
4.8%
56 2
9.5%
53 1
4.8%

Interactions

2023-12-12T19:27:03.529608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:27:05.210228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명전화번호운행대수
법인명1.0001.0001.000
전화번호1.0001.0000.304
운행대수1.0000.3041.000

Missing values

2023-12-12T19:27:03.650464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:27:03.749755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

법인명전화번호운행대수
0경전기업(합)042-271-071853
1광덕운수(자)042-252-775256
2대광상운(자)042-271-157169
3대전택시㈜042-522-456541
4대종운수(자)042-255-904045
5동양택시(자)042-584-565683
6동일운수(자)042-522-515556
7장성기업㈜042-625-222266
8삼경택시(자)042-583-820180
9신광상운㈜042-224-228863
법인명전화번호운행대수
11안전교통㈜042-223-933141
12우리희망(조합)042-226-743442
13한일운수(자)042-581-270094
14현대상운(자)042-255-888833
15현대육운(합)042-253-611939
16경신운수(합)042-533-210840
17동산운수㈜042-625-222250
18동건상운(합)042-824-844158
19독립택시(합)042-272-844134
20㈜성연기업042-621-525234