Overview

Dataset statistics

Number of variables5
Number of observations200
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory42.7 B

Variable types

Text1
Numeric2
Categorical1
DateTime1

Reproduction

Analysis started2023-12-10 06:38:37.889765
Analysis finished2023-12-10 06:38:39.114421
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct55
Distinct (%)27.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-10T15:38:39.393726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters2000
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowT_42224958
2nd rowT_92920772
3rd rowT_42957394
4th rowT_17792596
5th rowT_43250369
ValueCountFrequency (%)
t_42224958 4
 
2.0%
t_67682730 4
 
2.0%
t_67755974 4
 
2.0%
t_18525033 4
 
2.0%
t_92920772 4
 
2.0%
t_15961504 4
 
2.0%
t_41931983 4
 
2.0%
t_92188335 4
 
2.0%
t_17865840 4
 
2.0%
t_91162924 4
 
2.0%
Other values (45) 160
80.0%
2023-12-10T15:38:40.064318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
T 200
10.0%
_ 200
10.0%
6 200
10.0%
9 196
9.8%
4 178
8.9%
1 171
8.6%
2 170
8.5%
8 159
8.0%
5 144
7.2%
7 135
6.8%
Other values (2) 247
12.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1600
80.0%
Uppercase Letter 200
 
10.0%
Connector Punctuation 200
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 200
12.5%
9 196
12.2%
4 178
11.1%
1 171
10.7%
2 170
10.6%
8 159
9.9%
5 144
9.0%
7 135
8.4%
0 124
7.8%
3 123
7.7%
Uppercase Letter
ValueCountFrequency (%)
T 200
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1800
90.0%
Latin 200
 
10.0%

Most frequent character per script

Common
ValueCountFrequency (%)
_ 200
11.1%
6 200
11.1%
9 196
10.9%
4 178
9.9%
1 171
9.5%
2 170
9.4%
8 159
8.8%
5 144
8.0%
7 135
7.5%
0 124
6.9%
Latin
ValueCountFrequency (%)
T 200
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
T 200
10.0%
_ 200
10.0%
6 200
10.0%
9 196
9.8%
4 178
8.9%
1 171
8.6%
2 170
8.5%
8 159
8.0%
5 144
7.2%
7 135
6.8%
Other values (2) 247
12.3%

위도
Real number (ℝ)

Distinct149
Distinct (%)74.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.481262
Minimum37.373634
Maximum37.674953
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:38:40.442400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.373634
5-th percentile37.41242
Q137.451388
median37.47352
Q337.50663
95-th percentile37.567085
Maximum37.674953
Range0.301319
Interquartile range (IQR)0.0552425

Descriptive statistics

Standard deviation0.050817335
Coefficient of variation (CV)0.0013558064
Kurtosis3.2988183
Mean37.481262
Median Absolute Deviation (MAD)0.026063
Skewness1.2636908
Sum7496.2523
Variance0.0025824015
MonotonicityNot monotonic
2023-12-10T15:38:40.673057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.41242 4
 
2.0%
37.5171 4
 
2.0%
37.46871 4
 
2.0%
37.422707 4
 
2.0%
37.46668 3
 
1.5%
37.52786 3
 
1.5%
37.447147 3
 
1.5%
37.47529 3
 
1.5%
37.47401 3
 
1.5%
37.56709 3
 
1.5%
Other values (139) 166
83.0%
ValueCountFrequency (%)
37.373634 1
 
0.5%
37.373695 1
 
0.5%
37.373756 1
 
0.5%
37.373825 1
 
0.5%
37.406136 1
 
0.5%
37.406273 1
 
0.5%
37.406414 1
 
0.5%
37.40655 1
 
0.5%
37.41242 4
2.0%
37.414936 1
 
0.5%
ValueCountFrequency (%)
37.674953 1
 
0.5%
37.67488 1
 
0.5%
37.67481 1
 
0.5%
37.674732 1
 
0.5%
37.619686 2
1.0%
37.619682 1
 
0.5%
37.56709 3
1.5%
37.567085 1
 
0.5%
37.554916 1
 
0.5%
37.55491 2
1.0%

경도
Real number (ℝ)

Distinct126
Distinct (%)63.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.69228
Minimum126.6014
Maximum126.85201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-10T15:38:41.001992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.6014
5-th percentile126.62595
Q1126.67046
median126.69039
Q3126.72075
95-th percentile126.74662
Maximum126.85201
Range0.250605
Interquartile range (IQR)0.0502905

Descriptive statistics

Standard deviation0.041928234
Coefficient of variation (CV)0.00033094545
Kurtosis2.8072866
Mean126.69228
Median Absolute Deviation (MAD)0.020995
Skewness0.72025129
Sum25338.456
Variance0.0017579768
MonotonicityNot monotonic
2023-12-10T15:38:41.304607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.67239 5
 
2.5%
126.681755 4
 
2.0%
126.6312 4
 
2.0%
126.688805 4
 
2.0%
126.72075 4
 
2.0%
126.66804 4
 
2.0%
126.68265 4
 
2.0%
126.690384 4
 
2.0%
126.737564 4
 
2.0%
126.732376 4
 
2.0%
Other values (116) 159
79.5%
ValueCountFrequency (%)
126.6014 1
0.5%
126.60171 1
0.5%
126.60202 1
0.5%
126.602325 1
0.5%
126.614845 1
0.5%
126.61492 1
0.5%
126.615005 1
0.5%
126.61509 1
0.5%
126.62593 1
0.5%
126.625946 2
1.0%
ValueCountFrequency (%)
126.852005 1
 
0.5%
126.85192 1
 
0.5%
126.85184 1
 
0.5%
126.85176 1
 
0.5%
126.76268 1
 
0.5%
126.762665 3
1.5%
126.74687 1
 
0.5%
126.74681 1
 
0.5%
126.74661 2
1.0%
126.737564 4
2.0%

광고유형
Categorical

Distinct8
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
제조업
80 
식품_식음료
29 
협회 및 단체
25 
정보알림
18 
일반의약품
15 
Other values (3)
33 

Length

Max length11
Median length9
Mean length5.135
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품_식음료
2nd row관공서 및 지자체
3rd row기타 도매 및 소매업
4th row협회 및 단체
5th row제조업

Common Values

ValueCountFrequency (%)
제조업 80
40.0%
식품_식음료 29
 
14.5%
협회 및 단체 25
 
12.5%
정보알림 18
 
9.0%
일반의약품 15
 
7.5%
관공서 및 지자체 14
 
7.0%
영화_비디오물 11
 
5.5%
기타 도매 및 소매업 8
 
4.0%

Length

2023-12-10T15:38:41.541815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:38:41.737209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 80
26.5%
47
15.6%
식품_식음료 29
 
9.6%
협회 25
 
8.3%
단체 25
 
8.3%
정보알림 18
 
6.0%
일반의약품 15
 
5.0%
관공서 14
 
4.6%
지자체 14
 
4.6%
영화_비디오물 11
 
3.6%
Other values (3) 24
 
7.9%
Distinct5
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2019-11-26 11:15:05
Maximum2019-11-26 11:15:09
2023-12-10T15:38:41.931571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:38:42.109176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=5)

Interactions

2023-12-10T15:38:38.501179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:38:38.171946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:38:38.665212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:38:38.341063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:38:42.267294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
택시ID위도경도광고유형측정시간
택시ID1.0001.0001.0001.0000.000
위도1.0001.0000.7450.6410.000
경도1.0000.7451.0000.7210.000
광고유형1.0000.6410.7211.0000.000
측정시간0.0000.0000.0000.0001.000
2023-12-10T15:38:42.436688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도광고유형
위도1.0000.0710.381
경도0.0711.0000.315
광고유형0.3810.3151.000

Missing values

2023-12-10T15:38:38.860722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:38:39.047685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

택시ID위도경도광고유형측정시간
0T_4222495837.447544126.70663식품_식음료2019-11-26 11:15:05
1T_9292077237.461773126.73202관공서 및 지자체2019-11-26 11:15:05
2T_4295739437.485428126.72414기타 도매 및 소매업2019-11-26 11:15:05
3T_1779259637.46871126.6312협회 및 단체2019-11-26 11:15:05
4T_4325036937.528282126.85176제조업2019-11-26 11:15:05
5T_9072346237.41494126.675766제조업2019-11-26 11:15:06
6T_4368983137.461163126.642334제조업2019-11-26 11:15:06
7T_9379969637.452908126.66577협회 및 단체2019-11-26 11:15:06
8T_6658407537.447365126.67419영화_비디오물2019-11-26 11:15:06
9T_9211509237.50169126.72544정보알림2019-11-26 11:15:06
택시ID위도경도광고유형측정시간
190T_4112630237.477436126.63011정보알림2019-11-26 11:15:09
191T_6687705037.490513126.72368관공서 및 지자체2019-11-26 11:15:09
192T_1852503337.5171126.732376제조업2019-11-26 11:15:09
193T_9204184837.530453126.737564일반의약품2019-11-26 11:15:09
194T_9116292437.501514126.762665제조업2019-11-26 11:15:09
195T_4288415037.46668126.680565제조업2019-11-26 11:15:09
196T_1698691637.47357126.69027제조업2019-11-26 11:15:09
197T_9372645337.56709126.6014영화_비디오물2019-11-26 11:15:09
198T_4119954637.456665126.70361제조업2019-11-26 11:15:09
199T_4281090737.41242126.688805제조업2019-11-26 11:15:09