Amino acid dipepetide frequency for Kadipiro virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.345AlaAla: 6.345 ± 1.117
0.793AlaCys: 0.793 ± 0.288
3.49AlaAsp: 3.49 ± 0.594
4.442AlaGlu: 4.442 ± 0.644
1.586AlaPhe: 1.586 ± 0.428
3.807AlaGly: 3.807 ± 0.667
1.269AlaHis: 1.269 ± 0.388
5.552AlaIle: 5.552 ± 1.058
4.124AlaLys: 4.124 ± 1.07
7.614AlaLeu: 7.614 ± 0.818
0.793AlaMet: 0.793 ± 0.278
3.807AlaAsn: 3.807 ± 0.868
1.745AlaPro: 1.745 ± 0.472
1.11AlaGln: 1.11 ± 0.288
2.221AlaArg: 2.221 ± 0.532
4.918AlaSer: 4.918 ± 0.613
3.648AlaThr: 3.648 ± 0.797
3.966AlaVal: 3.966 ± 0.716
0.159AlaTrp: 0.159 ± 0.178
2.379AlaTyr: 2.379 ± 0.651
0.0AlaXaa: 0.0 ± 0.0
Cys
0.952CysAla: 0.952 ± 0.295
0.317CysCys: 0.317 ± 0.241
0.952CysAsp: 0.952 ± 0.333
0.476CysGlu: 0.476 ± 0.194
0.317CysPhe: 0.317 ± 0.212
0.476CysGly: 0.476 ± 0.209
0.159CysHis: 0.159 ± 0.171
1.269CysIle: 1.269 ± 0.522
0.793CysLys: 0.793 ± 0.297
0.952CysLeu: 0.952 ± 0.336
0.476CysMet: 0.476 ± 0.265
0.793CysAsn: 0.793 ± 0.579
0.317CysPro: 0.317 ± 0.159
0.476CysGln: 0.476 ± 0.31
0.635CysArg: 0.635 ± 0.214
1.745CysSer: 1.745 ± 0.515
0.635CysThr: 0.635 ± 0.284
1.428CysVal: 1.428 ± 0.353
0.159CysTrp: 0.159 ± 0.156
0.952CysTyr: 0.952 ± 0.45
0.0CysXaa: 0.0 ± 0.0
Asp
3.173AspAla: 3.173 ± 0.718
0.317AspCys: 0.317 ± 0.2
5.552AspAsp: 5.552 ± 1.1
5.235AspGlu: 5.235 ± 0.794
2.697AspPhe: 2.697 ± 0.527
3.807AspGly: 3.807 ± 0.807
1.745AspHis: 1.745 ± 0.533
4.759AspIle: 4.759 ± 1.21
3.966AspLys: 3.966 ± 0.999
8.09AspLeu: 8.09 ± 1.289
2.379AspMet: 2.379 ± 0.685
5.235AspAsn: 5.235 ± 0.802
1.904AspPro: 1.904 ± 0.561
0.635AspGln: 0.635 ± 0.517
2.855AspArg: 2.855 ± 0.621
4.124AspSer: 4.124 ± 0.644
2.855AspThr: 2.855 ± 0.668
5.235AspVal: 5.235 ± 0.477
0.317AspTrp: 0.317 ± 0.215
2.062AspTyr: 2.062 ± 0.314
0.0AspXaa: 0.0 ± 0.0
Glu
4.124GluAla: 4.124 ± 0.624
1.586GluCys: 1.586 ± 0.379
2.062GluAsp: 2.062 ± 0.581
1.11GluGlu: 1.11 ± 0.407
1.904GluPhe: 1.904 ± 0.565
2.855GluGly: 2.855 ± 0.751
1.428GluHis: 1.428 ± 0.542
3.807GluIle: 3.807 ± 0.956
3.49GluLys: 3.49 ± 0.529
5.076GluLeu: 5.076 ± 0.678
2.062GluMet: 2.062 ± 0.519
3.49GluAsn: 3.49 ± 0.618
1.11GluPro: 1.11 ± 0.349
1.904GluGln: 1.904 ± 0.439
2.855GluArg: 2.855 ± 0.606
3.014GluSer: 3.014 ± 0.561
2.379GluThr: 2.379 ± 0.728
4.124GluVal: 4.124 ± 1.061
0.476GluTrp: 0.476 ± 0.282
2.697GluTyr: 2.697 ± 0.844
0.0GluXaa: 0.0 ± 0.0
Phe
2.697PheAla: 2.697 ± 0.488
1.269PheCys: 1.269 ± 0.392
3.648PheAsp: 3.648 ± 0.732
1.904PheGlu: 1.904 ± 0.628
1.428PhePhe: 1.428 ± 0.398
1.904PheGly: 1.904 ± 0.431
0.635PheHis: 0.635 ± 0.327
3.173PheIle: 3.173 ± 0.47
2.697PheLys: 2.697 ± 0.773
3.014PheLeu: 3.014 ± 0.572
0.793PheMet: 0.793 ± 0.326
3.014PheAsn: 3.014 ± 0.498
0.635PhePro: 0.635 ± 0.233
0.476PheGln: 0.476 ± 0.301
2.855PheArg: 2.855 ± 0.442
2.697PheSer: 2.697 ± 0.583
1.904PheThr: 1.904 ± 0.444
2.379PheVal: 2.379 ± 0.779
0.0PheTrp: 0.0 ± 0.0
0.476PheTyr: 0.476 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
3.331GlyAla: 3.331 ± 0.555
0.159GlyCys: 0.159 ± 0.154
2.697GlyAsp: 2.697 ± 0.483
2.538GlyGlu: 2.538 ± 0.455
2.221GlyPhe: 2.221 ± 0.462
2.379GlyGly: 2.379 ± 0.563
1.428GlyHis: 1.428 ± 0.527
3.966GlyIle: 3.966 ± 0.54
4.283GlyLys: 4.283 ± 0.674
6.821GlyLeu: 6.821 ± 0.864
1.428GlyMet: 1.428 ± 0.361
3.014GlyAsn: 3.014 ± 0.736
1.586GlyPro: 1.586 ± 0.411
1.269GlyGln: 1.269 ± 0.447
2.221GlyArg: 2.221 ± 0.44
4.6GlySer: 4.6 ± 0.593
3.966GlyThr: 3.966 ± 0.766
4.6GlyVal: 4.6 ± 0.772
0.476GlyTrp: 0.476 ± 0.239
2.855GlyTyr: 2.855 ± 0.48
0.0GlyXaa: 0.0 ± 0.0
His
0.793HisAla: 0.793 ± 0.334
0.793HisCys: 0.793 ± 0.423
1.428HisAsp: 1.428 ± 0.524
0.793HisGlu: 0.793 ± 0.293
0.952HisPhe: 0.952 ± 0.29
1.11HisGly: 1.11 ± 0.434
0.635HisHis: 0.635 ± 0.47
2.221HisIle: 2.221 ± 0.477
1.586HisLys: 1.586 ± 0.247
1.904HisLeu: 1.904 ± 0.604
0.635HisMet: 0.635 ± 0.253
1.11HisAsn: 1.11 ± 0.388
0.635HisPro: 0.635 ± 0.341
0.476HisGln: 0.476 ± 0.307
0.635HisArg: 0.635 ± 0.198
1.904HisSer: 1.904 ± 0.496
0.952HisThr: 0.952 ± 0.29
1.904HisVal: 1.904 ± 0.442
0.0HisTrp: 0.0 ± 0.0
0.476HisTyr: 0.476 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
3.807IleAla: 3.807 ± 1.015
0.476IleCys: 0.476 ± 0.207
6.187IleAsp: 6.187 ± 0.609
4.283IleGlu: 4.283 ± 0.743
2.221IlePhe: 2.221 ± 0.668
3.966IleGly: 3.966 ± 0.655
0.952IleHis: 0.952 ± 0.381
4.442IleIle: 4.442 ± 0.53
5.552IleLys: 5.552 ± 0.625
5.393IleLeu: 5.393 ± 0.734
1.904IleMet: 1.904 ± 0.565
7.773IleAsn: 7.773 ± 1.06
2.697IlePro: 2.697 ± 0.537
2.221IleGln: 2.221 ± 0.484
2.697IleArg: 2.697 ± 0.914
6.504IleSer: 6.504 ± 0.905
4.442IleThr: 4.442 ± 0.679
5.393IleVal: 5.393 ± 1.004
0.317IleTrp: 0.317 ± 0.23
2.855IleTyr: 2.855 ± 0.641
0.0IleXaa: 0.0 ± 0.0
Lys
3.49LysAla: 3.49 ± 0.605
1.586LysCys: 1.586 ± 0.54
2.855LysAsp: 2.855 ± 0.427
3.014LysGlu: 3.014 ± 0.572
3.331LysPhe: 3.331 ± 0.685
3.014LysGly: 3.014 ± 0.393
0.952LysHis: 0.952 ± 0.349
6.187LysIle: 6.187 ± 0.8
1.11LysLys: 1.11 ± 0.254
5.869LysLeu: 5.869 ± 0.777
1.269LysMet: 1.269 ± 0.383
3.014LysAsn: 3.014 ± 0.626
2.697LysPro: 2.697 ± 0.713
2.697LysGln: 2.697 ± 0.66
4.442LysArg: 4.442 ± 0.766
5.711LysSer: 5.711 ± 1.398
2.379LysThr: 2.379 ± 0.57
4.124LysVal: 4.124 ± 0.681
0.635LysTrp: 0.635 ± 0.179
3.966LysTyr: 3.966 ± 1.125
0.0LysXaa: 0.0 ± 0.0
Leu
5.869LeuAla: 5.869 ± 0.952
1.11LeuCys: 1.11 ± 0.387
5.235LeuAsp: 5.235 ± 0.791
4.918LeuGlu: 4.918 ± 0.63
3.648LeuPhe: 3.648 ± 0.69
5.393LeuGly: 5.393 ± 0.772
1.745LeuHis: 1.745 ± 0.761
7.773LeuIle: 7.773 ± 0.863
6.98LeuLys: 6.98 ± 0.856
8.407LeuLeu: 8.407 ± 1.048
2.379LeuMet: 2.379 ± 0.668
5.076LeuAsn: 5.076 ± 0.569
3.648LeuPro: 3.648 ± 0.648
2.221LeuGln: 2.221 ± 0.44
5.235LeuArg: 5.235 ± 0.974
7.138LeuSer: 7.138 ± 0.866
7.456LeuThr: 7.456 ± 1.249
6.504LeuVal: 6.504 ± 0.646
0.317LeuTrp: 0.317 ± 0.196
3.331LeuTyr: 3.331 ± 0.733
0.0LeuXaa: 0.0 ± 0.0
Met
1.11MetAla: 1.11 ± 0.396
0.476MetCys: 0.476 ± 0.236
1.428MetAsp: 1.428 ± 0.405
0.952MetGlu: 0.952 ± 0.19
0.476MetPhe: 0.476 ± 0.477
0.952MetGly: 0.952 ± 0.468
0.635MetHis: 0.635 ± 0.258
1.586MetIle: 1.586 ± 0.419
1.586MetLys: 1.586 ± 0.436
2.538MetLeu: 2.538 ± 0.539
0.635MetMet: 0.635 ± 0.307
2.221MetAsn: 2.221 ± 0.82
1.586MetPro: 1.586 ± 0.508
0.317MetGln: 0.317 ± 0.213
1.745MetArg: 1.745 ± 0.559
2.855MetSer: 2.855 ± 0.732
1.428MetThr: 1.428 ± 0.391
2.697MetVal: 2.697 ± 0.392
0.159MetTrp: 0.159 ± 0.133
0.793MetTyr: 0.793 ± 0.315
0.0MetXaa: 0.0 ± 0.0
Asn
4.442AsnAla: 4.442 ± 0.904
0.793AsnCys: 0.793 ± 0.285
5.393AsnAsp: 5.393 ± 0.852
3.49AsnGlu: 3.49 ± 0.765
1.11AsnPhe: 1.11 ± 0.311
5.235AsnGly: 5.235 ± 0.619
1.586AsnHis: 1.586 ± 0.444
4.283AsnIle: 4.283 ± 0.54
3.173AsnLys: 3.173 ± 0.51
5.393AsnLeu: 5.393 ± 0.981
1.428AsnMet: 1.428 ± 0.441
3.807AsnAsn: 3.807 ± 0.81
1.269AsnPro: 1.269 ± 0.481
1.745AsnGln: 1.745 ± 0.662
2.379AsnArg: 2.379 ± 0.426
7.456AsnSer: 7.456 ± 1.142
3.014AsnThr: 3.014 ± 0.775
5.235AsnVal: 5.235 ± 0.766
0.476AsnTrp: 0.476 ± 0.207
3.331AsnTyr: 3.331 ± 0.545
0.0AsnXaa: 0.0 ± 0.0
Pro
2.538ProAla: 2.538 ± 0.691
0.159ProCys: 0.159 ± 0.156
2.062ProAsp: 2.062 ± 0.428
2.221ProGlu: 2.221 ± 0.71
2.538ProPhe: 2.538 ± 0.578
1.269ProGly: 1.269 ± 0.241
0.476ProHis: 0.476 ± 0.301
1.586ProIle: 1.586 ± 0.392
2.379ProLys: 2.379 ± 0.426
2.379ProLeu: 2.379 ± 0.478
0.793ProMet: 0.793 ± 0.373
3.014ProAsn: 3.014 ± 0.638
1.428ProPro: 1.428 ± 0.831
0.476ProGln: 0.476 ± 0.268
1.269ProArg: 1.269 ± 0.512
2.379ProSer: 2.379 ± 0.454
2.221ProThr: 2.221 ± 0.62
1.904ProVal: 1.904 ± 0.712
0.0ProTrp: 0.0 ± 0.0
1.269ProTyr: 1.269 ± 0.622
0.0ProXaa: 0.0 ± 0.0
Gln
1.586GlnAla: 1.586 ± 0.559
0.793GlnCys: 0.793 ± 0.26
1.11GlnAsp: 1.11 ± 0.37
0.159GlnGlu: 0.159 ± 0.178
0.793GlnPhe: 0.793 ± 0.19
0.793GlnGly: 0.793 ± 0.287
0.476GlnHis: 0.476 ± 0.215
1.904GlnIle: 1.904 ± 0.49
0.952GlnLys: 0.952 ± 0.321
2.538GlnLeu: 2.538 ± 0.409
0.476GlnMet: 0.476 ± 0.19
0.635GlnAsn: 0.635 ± 0.284
1.745GlnPro: 1.745 ± 0.537
0.635GlnGln: 0.635 ± 0.292
1.586GlnArg: 1.586 ± 0.526
1.745GlnSer: 1.745 ± 0.457
0.635GlnThr: 0.635 ± 0.284
2.062GlnVal: 2.062 ± 0.6
0.159GlnTrp: 0.159 ± 0.156
1.269GlnTyr: 1.269 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
3.173ArgAla: 3.173 ± 0.348
0.952ArgCys: 0.952 ± 0.37
2.221ArgAsp: 2.221 ± 0.582
2.538ArgGlu: 2.538 ± 0.627
2.697ArgPhe: 2.697 ± 0.424
2.855ArgGly: 2.855 ± 0.648
0.793ArgHis: 0.793 ± 0.516
4.283ArgIle: 4.283 ± 0.845
3.014ArgLys: 3.014 ± 0.506
5.711ArgLeu: 5.711 ± 0.883
1.586ArgMet: 1.586 ± 0.448
2.697ArgAsn: 2.697 ± 0.704
1.11ArgPro: 1.11 ± 0.36
0.952ArgGln: 0.952 ± 0.43
2.379ArgArg: 2.379 ± 0.605
3.648ArgSer: 3.648 ± 0.678
2.697ArgThr: 2.697 ± 0.568
3.331ArgVal: 3.331 ± 0.574
0.476ArgTrp: 0.476 ± 0.207
3.331ArgTyr: 3.331 ± 0.731
0.0ArgXaa: 0.0 ± 0.0
Ser
6.504SerAla: 6.504 ± 0.873
0.635SerCys: 0.635 ± 0.362
4.918SerAsp: 4.918 ± 1.158
3.014SerGlu: 3.014 ± 0.714
3.49SerPhe: 3.49 ± 0.732
5.869SerGly: 5.869 ± 0.952
1.586SerHis: 1.586 ± 0.424
4.6SerIle: 4.6 ± 0.819
4.918SerLys: 4.918 ± 0.692
7.297SerLeu: 7.297 ± 1.039
1.745SerMet: 1.745 ± 0.525
5.076SerAsn: 5.076 ± 1.135
3.014SerPro: 3.014 ± 0.465
1.269SerGln: 1.269 ± 0.446
4.6SerArg: 4.6 ± 0.552
8.566SerSer: 8.566 ± 0.998
6.187SerThr: 6.187 ± 0.776
6.662SerVal: 6.662 ± 0.736
0.476SerTrp: 0.476 ± 0.28
3.173SerTyr: 3.173 ± 0.608
0.0SerXaa: 0.0 ± 0.0
Thr
3.648ThrAla: 3.648 ± 0.507
0.635ThrCys: 0.635 ± 0.345
3.807ThrAsp: 3.807 ± 0.646
3.014ThrGlu: 3.014 ± 0.525
1.904ThrPhe: 1.904 ± 0.438
2.221ThrGly: 2.221 ± 0.57
1.745ThrHis: 1.745 ± 0.236
3.648ThrIle: 3.648 ± 0.633
2.697ThrLys: 2.697 ± 0.573
6.187ThrLeu: 6.187 ± 0.887
0.952ThrMet: 0.952 ± 0.44
3.014ThrAsn: 3.014 ± 0.728
2.221ThrPro: 2.221 ± 0.512
1.586ThrGln: 1.586 ± 0.421
3.648ThrArg: 3.648 ± 0.593
5.235ThrSer: 5.235 ± 1.054
4.6ThrThr: 4.6 ± 0.637
3.807ThrVal: 3.807 ± 0.403
0.159ThrTrp: 0.159 ± 0.115
2.855ThrTyr: 2.855 ± 0.493
0.0ThrXaa: 0.0 ± 0.0
Val
4.918ValAla: 4.918 ± 0.766
1.11ValCys: 1.11 ± 0.582
6.662ValAsp: 6.662 ± 1.265
3.966ValGlu: 3.966 ± 0.769
2.062ValPhe: 2.062 ± 0.542
4.918ValGly: 4.918 ± 0.659
1.586ValHis: 1.586 ± 0.401
6.187ValIle: 6.187 ± 0.864
5.711ValLys: 5.711 ± 1.006
4.759ValLeu: 4.759 ± 0.522
2.221ValMet: 2.221 ± 0.606
4.283ValAsn: 4.283 ± 0.742
2.221ValPro: 2.221 ± 0.513
0.635ValGln: 0.635 ± 0.241
3.966ValArg: 3.966 ± 0.521
6.821ValSer: 6.821 ± 1.417
4.283ValThr: 4.283 ± 0.769
6.028ValVal: 6.028 ± 1.204
0.317ValTrp: 0.317 ± 0.18
1.586ValTyr: 1.586 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.159TrpAla: 0.159 ± 0.156
0.0TrpCys: 0.0 ± 0.0
0.476TrpAsp: 0.476 ± 0.189
0.793TrpGlu: 0.793 ± 0.269
0.159TrpPhe: 0.159 ± 0.156
0.159TrpGly: 0.159 ± 0.133
0.476TrpHis: 0.476 ± 0.257
0.159TrpIle: 0.159 ± 0.159
0.317TrpLys: 0.317 ± 0.18
0.317TrpLeu: 0.317 ± 0.184
0.159TrpMet: 0.159 ± 0.164
0.159TrpAsn: 0.159 ± 0.178
0.317TrpPro: 0.317 ± 0.176
0.0TrpGln: 0.0 ± 0.0
0.476TrpArg: 0.476 ± 0.177
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.476TrpVal: 0.476 ± 0.265
0.0TrpTrp: 0.0 ± 0.0
0.476TrpTyr: 0.476 ± 0.223
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.11TyrAla: 1.11 ± 0.456
0.317TyrCys: 0.317 ± 0.217
4.759TyrAsp: 4.759 ± 0.932
2.697TyrGlu: 2.697 ± 0.505
2.062TyrPhe: 2.062 ± 0.444
2.855TyrGly: 2.855 ± 0.62
0.635TyrHis: 0.635 ± 0.235
2.697TyrIle: 2.697 ± 0.575
3.014TyrLys: 3.014 ± 0.523
4.124TyrLeu: 4.124 ± 0.785
1.904TyrMet: 1.904 ± 0.511
3.807TyrAsn: 3.807 ± 0.872
0.476TyrPro: 0.476 ± 0.215
0.952TyrGln: 0.952 ± 0.235
1.745TyrArg: 1.745 ± 0.833
2.697TyrSer: 2.697 ± 1.638
1.904TyrThr: 1.904 ± 0.354
2.379TyrVal: 2.379 ± 0.574
0.0TyrTrp: 0.0 ± 0.0
1.586TyrTyr: 1.586 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (6305 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski