Amino acid dipepetide frequency for Tursiops truncatus papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.434AlaAla: 4.434 ± 1.169
2.821AlaCys: 2.821 ± 1.101
4.031AlaAsp: 4.031 ± 1.589
5.643AlaGlu: 5.643 ± 1.314
2.015AlaPhe: 2.015 ± 0.643
1.612AlaGly: 1.612 ± 0.478
2.418AlaHis: 2.418 ± 1.171
3.225AlaIle: 3.225 ± 1.402
3.225AlaLys: 3.225 ± 1.461
6.852AlaLeu: 6.852 ± 1.449
2.015AlaMet: 2.015 ± 0.86
0.806AlaAsn: 0.806 ± 0.725
4.434AlaPro: 4.434 ± 1.612
2.015AlaGln: 2.015 ± 1.121
4.031AlaArg: 4.031 ± 1.62
4.434AlaSer: 4.434 ± 0.525
4.837AlaThr: 4.837 ± 1.55
6.852AlaVal: 6.852 ± 0.815
0.0AlaTrp: 0.0 ± 0.0
3.225AlaTyr: 3.225 ± 0.648
0.0AlaXaa: 0.0 ± 0.0
Cys
0.806CysAla: 0.806 ± 0.515
1.612CysCys: 1.612 ± 0.687
0.806CysAsp: 0.806 ± 0.467
0.806CysGlu: 0.806 ± 0.388
0.806CysPhe: 0.806 ± 0.515
0.403CysGly: 0.403 ± 0.338
0.806CysHis: 0.806 ± 1.024
1.209CysIle: 1.209 ± 0.457
1.209CysLys: 1.209 ± 1.013
2.821CysLeu: 2.821 ± 1.212
1.612CysMet: 1.612 ± 0.454
1.209CysAsn: 1.209 ± 0.56
1.209CysPro: 1.209 ± 1.064
1.209CysGln: 1.209 ± 0.97
0.806CysArg: 0.806 ± 0.675
1.209CysSer: 1.209 ± 0.822
2.015CysThr: 2.015 ± 0.793
1.612CysVal: 1.612 ± 1.174
1.209CysTrp: 1.209 ± 0.814
2.015CysTyr: 2.015 ± 1.245
0.0CysXaa: 0.0 ± 0.0
Asp
5.643AspAla: 5.643 ± 1.392
1.209AspCys: 1.209 ± 0.636
3.225AspAsp: 3.225 ± 1.821
3.225AspGlu: 3.225 ± 1.428
2.015AspPhe: 2.015 ± 0.638
4.434AspGly: 4.434 ± 1.176
0.0AspHis: 0.0 ± 0.0
3.225AspIle: 3.225 ± 0.625
1.612AspLys: 1.612 ± 0.643
4.837AspLeu: 4.837 ± 1.863
1.209AspMet: 1.209 ± 0.702
2.015AspAsn: 2.015 ± 0.511
2.821AspPro: 2.821 ± 1.193
0.403AspGln: 0.403 ± 0.338
2.821AspArg: 2.821 ± 1.872
5.24AspSer: 5.24 ± 1.285
4.031AspThr: 4.031 ± 1.199
5.24AspVal: 5.24 ± 1.841
0.403AspTrp: 0.403 ± 0.338
1.209AspTyr: 1.209 ± 0.699
0.0AspXaa: 0.0 ± 0.0
Glu
3.225GluAla: 3.225 ± 0.487
0.806GluCys: 0.806 ± 0.675
2.418GluAsp: 2.418 ± 1.4
5.24GluGlu: 5.24 ± 1.516
1.612GluPhe: 1.612 ± 0.572
5.24GluGly: 5.24 ± 1.485
2.015GluHis: 2.015 ± 0.731
2.418GluIle: 2.418 ± 1.079
3.628GluLys: 3.628 ± 1.253
3.628GluLeu: 3.628 ± 1.41
0.403GluMet: 0.403 ± 0.338
3.225GluAsn: 3.225 ± 0.913
3.225GluPro: 3.225 ± 0.704
2.015GluGln: 2.015 ± 1.169
2.015GluArg: 2.015 ± 1.369
4.837GluSer: 4.837 ± 1.135
3.628GluThr: 3.628 ± 1.707
2.015GluVal: 2.015 ± 0.92
1.612GluTrp: 1.612 ± 0.941
3.225GluTyr: 3.225 ± 1.169
0.0GluXaa: 0.0 ± 0.0
Phe
2.418PheAla: 2.418 ± 0.86
2.015PheCys: 2.015 ± 0.873
1.209PheAsp: 1.209 ± 0.743
1.612PheGlu: 1.612 ± 0.687
1.612PhePhe: 1.612 ± 0.525
3.225PheGly: 3.225 ± 1.421
0.0PheHis: 0.0 ± 0.0
3.225PheIle: 3.225 ± 1.207
2.015PheLys: 2.015 ± 0.525
6.046PheLeu: 6.046 ± 1.739
0.0PheMet: 0.0 ± 0.0
1.612PheAsn: 1.612 ± 1.015
2.015PhePro: 2.015 ± 0.905
2.015PheGln: 2.015 ± 0.721
2.418PheArg: 2.418 ± 0.87
1.612PheSer: 1.612 ± 1.264
0.403PheThr: 0.403 ± 0.338
0.806PheVal: 0.806 ± 0.457
2.015PheTrp: 2.015 ± 0.708
0.806PheTyr: 0.806 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
5.24GlyAla: 5.24 ± 1.488
0.403GlyCys: 0.403 ± 0.355
6.852GlyAsp: 6.852 ± 1.578
3.225GlyGlu: 3.225 ± 0.648
2.015GlyPhe: 2.015 ± 0.963
5.643GlyGly: 5.643 ± 1.978
2.821GlyHis: 2.821 ± 0.638
2.821GlyIle: 2.821 ± 0.924
3.225GlyLys: 3.225 ± 0.706
4.434GlyLeu: 4.434 ± 0.42
1.612GlyMet: 1.612 ± 0.628
4.434GlyAsn: 4.434 ± 0.637
5.643GlyPro: 5.643 ± 2.16
2.015GlyGln: 2.015 ± 0.803
4.031GlyArg: 4.031 ± 0.767
5.643GlySer: 5.643 ± 1.824
6.046GlyThr: 6.046 ± 1.404
3.225GlyVal: 3.225 ± 0.732
0.403GlyTrp: 0.403 ± 0.512
2.015GlyTyr: 2.015 ± 1.13
0.0GlyXaa: 0.0 ± 0.0
His
2.821HisAla: 2.821 ± 1.031
0.403HisCys: 0.403 ± 0.355
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
2.015HisPhe: 2.015 ± 0.754
2.015HisGly: 2.015 ± 0.617
0.806HisHis: 0.806 ± 0.372
1.209HisIle: 1.209 ± 0.565
2.015HisLys: 2.015 ± 1.211
3.628HisLeu: 3.628 ± 0.887
1.209HisMet: 1.209 ± 0.603
0.806HisAsn: 0.806 ± 0.388
0.806HisPro: 0.806 ± 0.709
1.612HisGln: 1.612 ± 0.852
3.225HisArg: 3.225 ± 1.058
3.628HisSer: 3.628 ± 1.654
1.612HisThr: 1.612 ± 0.572
1.209HisVal: 1.209 ± 0.822
0.403HisTrp: 0.403 ± 0.338
0.806HisTyr: 0.806 ± 0.448
0.0HisXaa: 0.0 ± 0.0
Ile
3.628IleAla: 3.628 ± 1.566
0.806IleCys: 0.806 ± 0.515
2.418IleAsp: 2.418 ± 0.774
1.612IleGlu: 1.612 ± 0.744
2.015IlePhe: 2.015 ± 0.621
5.643IleGly: 5.643 ± 1.277
2.418IleHis: 2.418 ± 1.219
2.418IleIle: 2.418 ± 0.907
2.015IleLys: 2.015 ± 0.574
4.031IleLeu: 4.031 ± 1.317
0.403IleMet: 0.403 ± 0.313
1.612IleAsn: 1.612 ± 1.548
3.225IlePro: 3.225 ± 1.17
1.612IleGln: 1.612 ± 0.895
0.806IleArg: 0.806 ± 0.467
3.225IleSer: 3.225 ± 0.445
3.628IleThr: 3.628 ± 1.222
2.418IleVal: 2.418 ± 1.045
0.403IleTrp: 0.403 ± 0.408
0.806IleTyr: 0.806 ± 1.024
0.0IleXaa: 0.0 ± 0.0
Lys
4.434LysAla: 4.434 ± 1.084
2.821LysCys: 2.821 ± 0.916
2.418LysAsp: 2.418 ± 0.752
2.418LysGlu: 2.418 ± 1.255
1.209LysPhe: 1.209 ± 0.574
3.628LysGly: 3.628 ± 0.522
2.418LysHis: 2.418 ± 1.52
2.015LysIle: 2.015 ± 0.983
1.209LysLys: 1.209 ± 0.416
3.225LysLeu: 3.225 ± 0.988
1.209LysMet: 1.209 ± 0.636
1.209LysAsn: 1.209 ± 0.403
1.612LysPro: 1.612 ± 1.06
2.015LysGln: 2.015 ± 1.115
3.628LysArg: 3.628 ± 1.232
3.628LysSer: 3.628 ± 1.76
3.225LysThr: 3.225 ± 0.96
4.031LysVal: 4.031 ± 1.191
1.209LysTrp: 1.209 ± 0.626
2.821LysTyr: 2.821 ± 1.008
0.0LysXaa: 0.0 ± 0.0
Leu
5.24LeuAla: 5.24 ± 1.671
2.418LeuCys: 2.418 ± 1.048
4.837LeuAsp: 4.837 ± 1.27
5.643LeuGlu: 5.643 ± 0.989
3.225LeuPhe: 3.225 ± 1.425
6.046LeuGly: 6.046 ± 1.458
2.418LeuHis: 2.418 ± 0.664
2.418LeuIle: 2.418 ± 1.009
4.031LeuLys: 4.031 ± 0.999
12.898LeuLeu: 12.898 ± 6.063
2.015LeuMet: 2.015 ± 0.583
2.821LeuAsn: 2.821 ± 0.593
4.837LeuPro: 4.837 ± 1.292
4.031LeuGln: 4.031 ± 1.126
2.821LeuArg: 2.821 ± 1.464
8.464LeuSer: 8.464 ± 1.709
5.643LeuThr: 5.643 ± 1.459
6.852LeuVal: 6.852 ± 1.546
0.806LeuTrp: 0.806 ± 0.58
3.225LeuTyr: 3.225 ± 0.962
0.0LeuXaa: 0.0 ± 0.0
Met
2.015MetAla: 2.015 ± 0.998
0.0MetCys: 0.0 ± 0.0
2.418MetAsp: 2.418 ± 0.737
2.418MetGlu: 2.418 ± 1.405
0.806MetPhe: 0.806 ± 0.388
0.806MetGly: 0.806 ± 0.819
0.403MetHis: 0.403 ± 0.316
1.209MetIle: 1.209 ± 0.775
0.403MetLys: 0.403 ± 0.316
2.015MetLeu: 2.015 ± 0.643
0.0MetMet: 0.0 ± 0.0
0.806MetAsn: 0.806 ± 0.388
0.403MetPro: 0.403 ± 0.42
1.209MetGln: 1.209 ± 0.403
1.209MetArg: 1.209 ± 0.626
2.015MetSer: 2.015 ± 0.889
0.806MetThr: 0.806 ± 0.457
1.612MetVal: 1.612 ± 0.687
0.0MetTrp: 0.0 ± 0.0
0.806MetTyr: 0.806 ± 0.467
0.0MetXaa: 0.0 ± 0.0
Asn
4.031AsnAla: 4.031 ± 1.297
1.612AsnCys: 1.612 ± 0.966
0.403AsnAsp: 0.403 ± 0.338
2.821AsnGlu: 2.821 ± 0.858
0.806AsnPhe: 0.806 ± 0.508
4.434AsnGly: 4.434 ± 1.162
0.806AsnHis: 0.806 ± 0.454
3.225AsnIle: 3.225 ± 0.761
2.418AsnLys: 2.418 ± 1.633
1.612AsnLeu: 1.612 ± 0.563
0.806AsnMet: 0.806 ± 0.372
2.821AsnAsn: 2.821 ± 0.956
3.628AsnPro: 3.628 ± 1.21
0.806AsnGln: 0.806 ± 0.388
1.209AsnArg: 1.209 ± 0.663
2.418AsnSer: 2.418 ± 0.991
2.015AsnThr: 2.015 ± 0.469
2.418AsnVal: 2.418 ± 0.517
0.0AsnTrp: 0.0 ± 0.0
0.806AsnTyr: 0.806 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
6.852ProAla: 6.852 ± 2.672
0.403ProCys: 0.403 ± 0.42
4.434ProAsp: 4.434 ± 1.022
1.612ProGlu: 1.612 ± 0.589
3.225ProPhe: 3.225 ± 0.845
2.015ProGly: 2.015 ± 0.821
2.821ProHis: 2.821 ± 1.004
0.403ProIle: 0.403 ± 0.554
3.225ProLys: 3.225 ± 1.249
6.852ProLeu: 6.852 ± 1.106
1.209ProMet: 1.209 ± 0.656
2.015ProAsn: 2.015 ± 0.991
7.255ProPro: 7.255 ± 1.644
1.209ProGln: 1.209 ± 0.602
2.418ProArg: 2.418 ± 1.216
4.434ProSer: 4.434 ± 1.349
4.434ProThr: 4.434 ± 0.967
4.837ProVal: 4.837 ± 2.415
0.403ProTrp: 0.403 ± 0.316
1.612ProTyr: 1.612 ± 0.687
0.0ProXaa: 0.0 ± 0.0
Gln
2.821GlnAla: 2.821 ± 0.824
0.0GlnCys: 0.0 ± 0.0
2.015GlnAsp: 2.015 ± 0.621
2.015GlnGlu: 2.015 ± 0.99
2.015GlnPhe: 2.015 ± 0.986
2.418GlnGly: 2.418 ± 1.066
0.403GlnHis: 0.403 ± 0.316
1.209GlnIle: 1.209 ± 0.797
2.015GlnLys: 2.015 ± 0.913
5.24GlnLeu: 5.24 ± 1.565
1.612GlnMet: 1.612 ± 0.914
1.209GlnAsn: 1.209 ± 0.403
2.015GlnPro: 2.015 ± 1.006
0.806GlnGln: 0.806 ± 0.725
0.403GlnArg: 0.403 ± 0.355
1.209GlnSer: 1.209 ± 0.913
1.209GlnThr: 1.209 ± 0.699
2.015GlnVal: 2.015 ± 0.624
0.403GlnTrp: 0.403 ± 0.338
0.806GlnTyr: 0.806 ± 0.709
0.0GlnXaa: 0.0 ± 0.0
Arg
3.628ArgAla: 3.628 ± 0.792
2.015ArgCys: 2.015 ± 0.99
2.418ArgAsp: 2.418 ± 1.599
2.418ArgGlu: 2.418 ± 1.013
2.015ArgPhe: 2.015 ± 0.827
5.643ArgGly: 5.643 ± 1.475
2.015ArgHis: 2.015 ± 0.735
1.612ArgIle: 1.612 ± 0.591
3.225ArgLys: 3.225 ± 1.093
3.225ArgLeu: 3.225 ± 1.277
1.209ArgMet: 1.209 ± 0.704
1.612ArgAsn: 1.612 ± 0.598
2.418ArgPro: 2.418 ± 1.16
0.806ArgGln: 0.806 ± 0.388
4.434ArgArg: 4.434 ± 1.151
2.821ArgSer: 2.821 ± 0.987
3.225ArgThr: 3.225 ± 1.196
3.628ArgVal: 3.628 ± 1.321
1.209ArgTrp: 1.209 ± 0.699
2.015ArgTyr: 2.015 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
4.837SerAla: 4.837 ± 1.399
0.806SerCys: 0.806 ± 0.457
4.837SerAsp: 4.837 ± 1.22
4.031SerGlu: 4.031 ± 1.426
2.821SerPhe: 2.821 ± 1.139
7.255SerGly: 7.255 ± 1.534
3.225SerHis: 3.225 ± 0.997
4.434SerIle: 4.434 ± 1.082
4.434SerLys: 4.434 ± 1.041
6.046SerLeu: 6.046 ± 1.46
2.015SerMet: 2.015 ± 0.814
4.031SerAsn: 4.031 ± 1.688
3.628SerPro: 3.628 ± 0.996
2.015SerGln: 2.015 ± 1.406
3.628SerArg: 3.628 ± 0.771
9.27SerSer: 9.27 ± 2.394
7.658SerThr: 7.658 ± 2.778
4.434SerVal: 4.434 ± 1.813
0.806SerTrp: 0.806 ± 0.515
1.612SerTyr: 1.612 ± 0.606
0.0SerXaa: 0.0 ± 0.0
Thr
3.225ThrAla: 3.225 ± 0.492
2.015ThrCys: 2.015 ± 0.926
4.031ThrAsp: 4.031 ± 0.99
4.031ThrGlu: 4.031 ± 1.858
2.418ThrPhe: 2.418 ± 1.016
5.643ThrGly: 5.643 ± 1.039
0.806ThrHis: 0.806 ± 0.725
4.031ThrIle: 4.031 ± 1.228
1.612ThrLys: 1.612 ± 1.082
4.031ThrLeu: 4.031 ± 1.309
0.403ThrMet: 0.403 ± 0.449
3.628ThrAsn: 3.628 ± 1.34
5.24ThrPro: 5.24 ± 2.287
2.418ThrGln: 2.418 ± 0.5
3.628ThrArg: 3.628 ± 0.919
6.449ThrSer: 6.449 ± 2.214
3.628ThrThr: 3.628 ± 1.144
6.852ThrVal: 6.852 ± 1.432
1.612ThrTrp: 1.612 ± 1.679
1.612ThrTyr: 1.612 ± 0.989
0.0ThrXaa: 0.0 ± 0.0
Val
1.612ValAla: 1.612 ± 0.431
2.418ValCys: 2.418 ± 0.928
3.225ValAsp: 3.225 ± 1.375
4.837ValGlu: 4.837 ± 2.113
2.821ValPhe: 2.821 ± 0.553
4.031ValGly: 4.031 ± 1.115
2.418ValHis: 2.418 ± 0.888
2.821ValIle: 2.821 ± 0.842
3.628ValLys: 3.628 ± 0.905
4.837ValLeu: 4.837 ± 1.729
0.403ValMet: 0.403 ± 0.338
1.612ValAsn: 1.612 ± 0.572
4.837ValPro: 4.837 ± 1.082
3.225ValGln: 3.225 ± 1.116
4.031ValArg: 4.031 ± 1.001
8.464ValSer: 8.464 ± 1.48
5.24ValThr: 5.24 ± 2.187
2.418ValVal: 2.418 ± 1.368
1.209ValTrp: 1.209 ± 0.501
1.612ValTyr: 1.612 ± 0.572
0.0ValXaa: 0.0 ± 0.0
Trp
0.806TrpAla: 0.806 ± 0.388
0.0TrpCys: 0.0 ± 0.0
0.806TrpAsp: 0.806 ± 0.388
0.806TrpGlu: 0.806 ± 0.618
0.806TrpPhe: 0.806 ± 0.448
0.403TrpGly: 0.403 ± 0.338
0.806TrpHis: 0.806 ± 0.448
1.209TrpIle: 1.209 ± 0.704
2.418TrpLys: 2.418 ± 1.217
1.612TrpLeu: 1.612 ± 0.725
0.0TrpMet: 0.0 ± 0.0
0.806TrpAsn: 0.806 ± 0.58
0.806TrpPro: 0.806 ± 0.515
0.0TrpGln: 0.0 ± 0.0
1.612TrpArg: 1.612 ± 0.961
0.0TrpSer: 0.0 ± 0.0
2.015TrpThr: 2.015 ± 1.604
0.403TrpVal: 0.403 ± 0.338
0.403TrpTrp: 0.403 ± 0.338
0.403TrpTyr: 0.403 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.209TyrAla: 1.209 ± 0.704
0.806TyrCys: 0.806 ± 0.618
2.015TyrAsp: 2.015 ± 0.511
2.015TyrGlu: 2.015 ± 0.617
0.806TyrPhe: 0.806 ± 0.454
1.209TyrGly: 1.209 ± 0.403
0.403TyrHis: 0.403 ± 0.355
0.806TyrIle: 0.806 ± 0.454
2.821TyrLys: 2.821 ± 0.372
2.821TyrLeu: 2.821 ± 0.89
1.612TyrMet: 1.612 ± 0.687
0.806TyrAsn: 0.806 ± 0.467
1.612TyrPro: 1.612 ± 0.619
0.403TyrGln: 0.403 ± 0.355
2.418TyrArg: 2.418 ± 1.186
2.821TyrSer: 2.821 ± 0.983
2.418TyrThr: 2.418 ± 0.45
2.821TyrVal: 2.821 ± 0.524
1.612TyrTrp: 1.612 ± 0.862
1.209TyrTyr: 1.209 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2482 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski