Amino acid dipepetide frequency for Saimiri sciureus papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.297AlaAla: 12.297 ± 2.051
1.318AlaCys: 1.318 ± 0.501
4.392AlaAsp: 4.392 ± 1.751
3.074AlaGlu: 3.074 ± 1.167
3.953AlaPhe: 3.953 ± 1.524
7.905AlaGly: 7.905 ± 0.806
2.196AlaHis: 2.196 ± 0.902
2.196AlaIle: 2.196 ± 0.498
2.196AlaLys: 2.196 ± 0.69
8.344AlaLeu: 8.344 ± 2.078
2.635AlaMet: 2.635 ± 0.633
2.196AlaAsn: 2.196 ± 0.781
13.614AlaPro: 13.614 ± 7.113
2.635AlaGln: 2.635 ± 1.207
3.953AlaArg: 3.953 ± 1.202
6.588AlaSer: 6.588 ± 1.467
6.588AlaThr: 6.588 ± 0.888
4.831AlaVal: 4.831 ± 1.181
0.439AlaTrp: 0.439 ± 0.376
1.318AlaTyr: 1.318 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
1.757CysAla: 1.757 ± 0.858
0.439CysCys: 0.439 ± 0.323
0.878CysAsp: 0.878 ± 0.525
0.439CysGlu: 0.439 ± 0.396
0.878CysPhe: 0.878 ± 0.448
1.318CysGly: 1.318 ± 0.693
1.757CysHis: 1.757 ± 0.996
2.196CysIle: 2.196 ± 1.135
1.757CysLys: 1.757 ± 0.821
1.757CysLeu: 1.757 ± 0.539
0.0CysMet: 0.0 ± 0.0
0.878CysAsn: 0.878 ± 0.654
2.196CysPro: 2.196 ± 0.537
1.757CysGln: 1.757 ± 0.823
0.439CysArg: 0.439 ± 0.323
2.196CysSer: 2.196 ± 1.077
2.635CysThr: 2.635 ± 1.325
1.757CysVal: 1.757 ± 0.791
1.757CysTrp: 1.757 ± 0.649
1.318CysTyr: 1.318 ± 0.502
0.0CysXaa: 0.0 ± 0.0
Asp
4.392AspAla: 4.392 ± 1.36
2.635AspCys: 2.635 ± 0.688
2.196AspAsp: 2.196 ± 0.692
2.635AspGlu: 2.635 ± 0.879
1.757AspPhe: 1.757 ± 0.203
3.074AspGly: 3.074 ± 0.518
0.878AspHis: 0.878 ± 0.879
4.831AspIle: 4.831 ± 2.428
2.196AspLys: 2.196 ± 0.683
5.709AspLeu: 5.709 ± 1.977
0.439AspMet: 0.439 ± 0.396
4.392AspAsn: 4.392 ± 0.447
2.196AspPro: 2.196 ± 0.728
0.878AspGln: 0.878 ± 0.437
2.196AspArg: 2.196 ± 1.213
5.709AspSer: 5.709 ± 1.824
5.709AspThr: 5.709 ± 0.998
4.831AspVal: 4.831 ± 1.594
1.318AspTrp: 1.318 ± 0.827
1.318AspTyr: 1.318 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
5.709GluAla: 5.709 ± 1.028
0.878GluCys: 0.878 ± 0.448
8.344GluAsp: 8.344 ± 1.754
2.635GluGlu: 2.635 ± 1.066
1.318GluPhe: 1.318 ± 0.603
3.074GluGly: 3.074 ± 0.981
2.196GluHis: 2.196 ± 0.806
0.439GluIle: 0.439 ± 0.44
1.318GluLys: 1.318 ± 0.722
2.635GluLeu: 2.635 ± 1.28
2.196GluMet: 2.196 ± 0.704
2.635GluAsn: 2.635 ± 1.638
3.074GluPro: 3.074 ± 1.091
1.757GluGln: 1.757 ± 0.767
2.196GluArg: 2.196 ± 1.063
1.318GluSer: 1.318 ± 0.677
2.635GluThr: 2.635 ± 1.13
5.27GluVal: 5.27 ± 1.588
1.318GluTrp: 1.318 ± 0.645
1.318GluTyr: 1.318 ± 0.501
0.0GluXaa: 0.0 ± 0.0
Phe
1.757PheAla: 1.757 ± 0.874
1.318PheCys: 1.318 ± 0.693
2.635PheAsp: 2.635 ± 0.478
1.318PheGlu: 1.318 ± 0.677
3.074PhePhe: 3.074 ± 0.979
3.074PheGly: 3.074 ± 1.046
0.439PheHis: 0.439 ± 0.415
1.318PheIle: 1.318 ± 0.501
3.074PheLys: 3.074 ± 1.056
3.513PheLeu: 3.513 ± 0.903
0.439PheMet: 0.439 ± 0.323
1.318PheAsn: 1.318 ± 1.187
2.196PhePro: 2.196 ± 1.047
0.878PheGln: 0.878 ± 0.525
1.318PheArg: 1.318 ± 0.355
1.757PheSer: 1.757 ± 1.137
2.196PheThr: 2.196 ± 0.902
1.757PheVal: 1.757 ± 0.78
0.878PheTrp: 0.878 ± 0.395
1.757PheTyr: 1.757 ± 0.874
0.0PheXaa: 0.0 ± 0.0
Gly
5.27GlyAla: 5.27 ± 1.329
0.439GlyCys: 0.439 ± 0.396
4.392GlyAsp: 4.392 ± 0.656
3.513GlyGlu: 3.513 ± 0.567
1.757GlyPhe: 1.757 ± 0.518
3.513GlyGly: 3.513 ± 1.543
2.635GlyHis: 2.635 ± 0.904
2.635GlyIle: 2.635 ± 0.514
3.074GlyLys: 3.074 ± 1.209
4.831GlyLeu: 4.831 ± 2.238
1.318GlyMet: 1.318 ± 0.645
2.635GlyAsn: 2.635 ± 0.683
0.878GlyPro: 0.878 ± 0.459
2.635GlyGln: 2.635 ± 1.059
4.831GlyArg: 4.831 ± 1.106
6.588GlySer: 6.588 ± 1.162
5.709GlyThr: 5.709 ± 0.8
2.635GlyVal: 2.635 ± 0.71
0.439GlyTrp: 0.439 ± 0.323
2.196GlyTyr: 2.196 ± 0.624
0.0GlyXaa: 0.0 ± 0.0
His
0.439HisAla: 0.439 ± 0.396
1.757HisCys: 1.757 ± 0.649
0.878HisAsp: 0.878 ± 0.714
0.878HisGlu: 0.878 ± 0.744
1.318HisPhe: 1.318 ± 0.355
0.878HisGly: 0.878 ± 0.448
0.878HisHis: 0.878 ± 0.532
1.757HisIle: 1.757 ± 0.827
1.318HisLys: 1.318 ± 0.827
1.757HisLeu: 1.757 ± 0.878
0.0HisMet: 0.0 ± 0.0
1.318HisAsn: 1.318 ± 0.866
0.878HisPro: 0.878 ± 0.437
0.878HisGln: 0.878 ± 0.879
3.953HisArg: 3.953 ± 1.64
1.757HisSer: 1.757 ± 1.759
1.757HisThr: 1.757 ± 0.788
0.878HisVal: 0.878 ± 0.459
0.878HisTrp: 0.878 ± 0.525
0.878HisTyr: 0.878 ± 0.459
0.0HisXaa: 0.0 ± 0.0
Ile
3.513IleAla: 3.513 ± 0.938
0.439IleCys: 0.439 ± 0.323
2.196IleAsp: 2.196 ± 0.296
3.953IleGlu: 3.953 ± 1.568
0.878IlePhe: 0.878 ± 0.538
3.074IleGly: 3.074 ± 1.001
1.318IleHis: 1.318 ± 0.597
1.318IleIle: 1.318 ± 0.725
0.878IleLys: 0.878 ± 0.455
2.635IleLeu: 2.635 ± 0.588
0.0IleMet: 0.0 ± 0.0
2.196IleAsn: 2.196 ± 0.692
1.757IlePro: 1.757 ± 0.527
1.757IleGln: 1.757 ± 0.791
1.757IleArg: 1.757 ± 0.793
1.757IleSer: 1.757 ± 0.874
1.757IleThr: 1.757 ± 0.76
5.27IleVal: 5.27 ± 1.715
0.0IleTrp: 0.0 ± 0.0
0.878IleTyr: 0.878 ± 0.508
0.0IleXaa: 0.0 ± 0.0
Lys
4.831LysAla: 4.831 ± 0.594
2.196LysCys: 2.196 ± 0.978
2.196LysAsp: 2.196 ± 0.704
3.074LysGlu: 3.074 ± 1.043
2.196LysPhe: 2.196 ± 1.48
1.757LysGly: 1.757 ± 0.896
0.878LysHis: 0.878 ± 0.448
1.757LysIle: 1.757 ± 0.903
1.318LysLys: 1.318 ± 0.884
2.196LysLeu: 2.196 ± 0.71
0.439LysMet: 0.439 ± 0.396
0.439LysAsn: 0.439 ± 0.323
1.757LysPro: 1.757 ± 1.081
2.196LysGln: 2.196 ± 0.946
6.588LysArg: 6.588 ± 1.253
4.392LysSer: 4.392 ± 1.559
1.757LysThr: 1.757 ± 0.883
2.196LysVal: 2.196 ± 0.897
0.878LysTrp: 0.878 ± 0.455
1.757LysTyr: 1.757 ± 0.539
0.0LysXaa: 0.0 ± 0.0
Leu
3.513LeuAla: 3.513 ± 1.993
3.513LeuCys: 3.513 ± 1.232
3.953LeuAsp: 3.953 ± 0.72
3.953LeuGlu: 3.953 ± 1.226
3.953LeuPhe: 3.953 ± 2.005
7.027LeuGly: 7.027 ± 1.01
2.635LeuHis: 2.635 ± 1.379
2.635LeuIle: 2.635 ± 1.066
6.588LeuLys: 6.588 ± 1.871
6.148LeuLeu: 6.148 ± 1.326
0.878LeuMet: 0.878 ± 0.648
0.0LeuAsn: 0.0 ± 0.0
3.513LeuPro: 3.513 ± 1.454
7.027LeuGln: 7.027 ± 0.722
4.831LeuArg: 4.831 ± 1.091
5.709LeuSer: 5.709 ± 1.207
3.074LeuThr: 3.074 ± 0.828
3.513LeuVal: 3.513 ± 1.37
1.757LeuTrp: 1.757 ± 0.203
4.831LeuTyr: 4.831 ± 0.891
0.0LeuXaa: 0.0 ± 0.0
Met
1.318MetAla: 1.318 ± 0.603
0.0MetCys: 0.0 ± 0.0
0.878MetAsp: 0.878 ± 0.395
1.318MetGlu: 1.318 ± 0.502
0.878MetPhe: 0.878 ± 0.645
0.439MetGly: 0.439 ± 0.396
0.439MetHis: 0.439 ± 0.44
1.318MetIle: 1.318 ± 0.725
0.439MetLys: 0.439 ± 0.323
0.439MetLeu: 0.439 ± 0.323
0.0MetMet: 0.0 ± 0.0
1.318MetAsn: 1.318 ± 0.725
2.196MetPro: 2.196 ± 0.296
0.878MetGln: 0.878 ± 0.508
1.318MetArg: 1.318 ± 0.501
1.757MetSer: 1.757 ± 0.518
1.318MetThr: 1.318 ± 0.755
1.318MetVal: 1.318 ± 0.603
0.439MetTrp: 0.439 ± 0.44
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.074AsnAla: 3.074 ± 1.195
1.318AsnCys: 1.318 ± 0.645
1.757AsnAsp: 1.757 ± 0.981
1.757AsnGlu: 1.757 ± 0.815
0.439AsnPhe: 0.439 ± 0.396
1.757AsnGly: 1.757 ± 0.518
0.439AsnHis: 0.439 ± 0.44
2.196AsnIle: 2.196 ± 0.893
3.074AsnLys: 3.074 ± 2.232
2.196AsnLeu: 2.196 ± 0.946
1.318AsnMet: 1.318 ± 1.016
0.439AsnAsn: 0.439 ± 0.396
3.074AsnPro: 3.074 ± 0.896
1.757AsnGln: 1.757 ± 1.095
1.318AsnArg: 1.318 ± 0.722
1.318AsnSer: 1.318 ± 0.423
3.074AsnThr: 3.074 ± 0.897
0.439AsnVal: 0.439 ± 0.323
0.878AsnTrp: 0.878 ± 0.395
0.439AsnTyr: 0.439 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
14.932ProAla: 14.932 ± 7.071
0.0ProCys: 0.0 ± 0.0
4.831ProAsp: 4.831 ± 1.861
3.074ProGlu: 3.074 ± 0.69
1.757ProPhe: 1.757 ± 0.518
2.635ProGly: 2.635 ± 0.966
1.318ProHis: 1.318 ± 0.827
2.635ProIle: 2.635 ± 0.478
1.757ProLys: 1.757 ± 0.539
7.466ProLeu: 7.466 ± 1.719
0.439ProMet: 0.439 ± 0.415
1.318ProAsn: 1.318 ± 0.603
5.27ProPro: 5.27 ± 0.77
1.757ProGln: 1.757 ± 0.518
1.757ProArg: 1.757 ± 0.203
6.148ProSer: 6.148 ± 4.157
3.953ProThr: 3.953 ± 1.405
5.27ProVal: 5.27 ± 1.314
0.439ProTrp: 0.439 ± 0.376
2.635ProTyr: 2.635 ± 0.999
0.0ProXaa: 0.0 ± 0.0
Gln
3.074GlnAla: 3.074 ± 0.913
1.757GlnCys: 1.757 ± 1.071
1.318GlnAsp: 1.318 ± 0.603
2.196GlnGlu: 2.196 ± 1.256
1.318GlnPhe: 1.318 ± 0.355
3.074GlnGly: 3.074 ± 1.056
0.0GlnHis: 0.0 ± 0.0
1.757GlnIle: 1.757 ± 0.203
0.878GlnLys: 0.878 ± 0.448
6.148GlnLeu: 6.148 ± 1.098
0.0GlnMet: 0.0 ± 0.0
1.318GlnAsn: 1.318 ± 0.819
3.953GlnPro: 3.953 ± 0.966
3.074GlnGln: 3.074 ± 1.209
1.757GlnArg: 1.757 ± 0.679
3.074GlnSer: 3.074 ± 0.612
3.074GlnThr: 3.074 ± 0.934
1.757GlnVal: 1.757 ± 0.634
1.318GlnTrp: 1.318 ± 0.677
2.196GlnTyr: 2.196 ± 1.317
0.0GlnXaa: 0.0 ± 0.0
Arg
6.588ArgAla: 6.588 ± 1.033
3.074ArgCys: 3.074 ± 1.212
0.439ArgAsp: 0.439 ± 0.415
2.196ArgGlu: 2.196 ± 0.959
1.757ArgPhe: 1.757 ± 0.827
1.757ArgGly: 1.757 ± 0.557
2.635ArgHis: 2.635 ± 1.113
0.878ArgIle: 0.878 ± 0.525
3.953ArgLys: 3.953 ± 0.714
4.831ArgLeu: 4.831 ± 0.712
1.757ArgMet: 1.757 ± 0.668
0.878ArgAsn: 0.878 ± 0.645
3.953ArgPro: 3.953 ± 1.087
2.635ArgGln: 2.635 ± 1.066
5.27ArgArg: 5.27 ± 1.218
2.635ArgSer: 2.635 ± 1.053
3.513ArgThr: 3.513 ± 1.503
4.392ArgVal: 4.392 ± 1.195
0.439ArgTrp: 0.439 ± 0.323
3.953ArgTyr: 3.953 ± 0.84
0.0ArgXaa: 0.0 ± 0.0
Ser
6.588SerAla: 6.588 ± 1.351
0.439SerCys: 0.439 ± 0.396
4.392SerAsp: 4.392 ± 0.447
4.831SerGlu: 4.831 ± 0.941
1.318SerPhe: 1.318 ± 0.968
5.27SerGly: 5.27 ± 1.468
1.318SerHis: 1.318 ± 0.645
2.635SerIle: 2.635 ± 0.585
3.074SerLys: 3.074 ± 1.694
4.392SerLeu: 4.392 ± 0.735
1.757SerMet: 1.757 ± 0.203
3.953SerAsn: 3.953 ± 1.384
3.953SerPro: 3.953 ± 1.192
3.074SerGln: 3.074 ± 0.682
4.831SerArg: 4.831 ± 1.939
5.709SerSer: 5.709 ± 2.204
7.466SerThr: 7.466 ± 2.187
6.148SerVal: 6.148 ± 2.17
0.878SerTrp: 0.878 ± 0.645
2.635SerTyr: 2.635 ± 1.127
0.0SerXaa: 0.0 ± 0.0
Thr
3.074ThrAla: 3.074 ± 1.02
3.953ThrCys: 3.953 ± 1.146
4.831ThrAsp: 4.831 ± 0.748
4.831ThrGlu: 4.831 ± 1.254
1.757ThrPhe: 1.757 ± 0.738
4.392ThrGly: 4.392 ± 0.695
0.878ThrHis: 0.878 ± 0.879
0.878ThrIle: 0.878 ± 0.459
1.757ThrLys: 1.757 ± 0.203
5.27ThrLeu: 5.27 ± 1.0
0.439ThrMet: 0.439 ± 0.323
1.318ThrAsn: 1.318 ± 0.603
5.27ThrPro: 5.27 ± 0.516
3.513ThrGln: 3.513 ± 1.414
3.074ThrArg: 3.074 ± 1.088
10.101ThrSer: 10.101 ± 1.378
4.831ThrThr: 4.831 ± 1.015
4.831ThrVal: 4.831 ± 1.669
1.318ThrTrp: 1.318 ± 1.319
2.635ThrTyr: 2.635 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
4.831ValAla: 4.831 ± 1.249
2.196ValCys: 2.196 ± 1.559
5.27ValAsp: 5.27 ± 1.238
3.953ValGlu: 3.953 ± 1.153
2.196ValPhe: 2.196 ± 1.047
4.392ValGly: 4.392 ± 1.388
1.318ValHis: 1.318 ± 0.814
1.757ValIle: 1.757 ± 0.203
3.953ValLys: 3.953 ± 0.663
3.953ValLeu: 3.953 ± 0.675
1.318ValMet: 1.318 ± 0.428
1.318ValAsn: 1.318 ± 0.355
7.027ValPro: 7.027 ± 2.616
2.196ValGln: 2.196 ± 1.256
2.635ValArg: 2.635 ± 1.353
4.392ValSer: 4.392 ± 0.996
5.27ValThr: 5.27 ± 2.008
6.148ValVal: 6.148 ± 0.74
0.878ValTrp: 0.878 ± 0.525
1.757ValTyr: 1.757 ± 0.595
0.0ValXaa: 0.0 ± 0.0
Trp
2.635TrpAla: 2.635 ± 1.141
0.0TrpCys: 0.0 ± 0.0
0.878TrpAsp: 0.878 ± 0.448
0.878TrpGlu: 0.878 ± 0.395
1.757TrpPhe: 1.757 ± 0.593
0.878TrpGly: 0.878 ± 0.437
0.439TrpHis: 0.439 ± 0.376
0.878TrpIle: 0.878 ± 0.455
0.878TrpLys: 0.878 ± 0.395
2.635TrpLeu: 2.635 ± 1.207
0.0TrpMet: 0.0 ± 0.0
0.878TrpAsn: 0.878 ± 0.538
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.757TrpArg: 1.757 ± 0.91
0.439TrpSer: 0.439 ± 0.323
1.757TrpThr: 1.757 ± 1.759
0.878TrpVal: 0.878 ± 0.448
0.0TrpTrp: 0.0 ± 0.0
0.439TrpTyr: 0.439 ± 0.44
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.074TyrAla: 3.074 ± 0.657
0.439TyrCys: 0.439 ± 0.323
2.635TyrAsp: 2.635 ± 0.626
1.757TyrGlu: 1.757 ± 0.896
1.757TyrPhe: 1.757 ± 1.137
2.635TyrGly: 2.635 ± 0.478
0.439TyrHis: 0.439 ± 0.376
1.318TyrIle: 1.318 ± 0.723
1.318TyrLys: 1.318 ± 0.355
2.635TyrLeu: 2.635 ± 0.988
2.196TyrMet: 2.196 ± 0.626
1.318TyrAsn: 1.318 ± 0.355
2.635TyrPro: 2.635 ± 0.97
1.757TyrGln: 1.757 ± 0.74
1.757TyrArg: 1.757 ± 1.05
1.318TyrSer: 1.318 ± 0.448
0.878TyrThr: 0.878 ± 0.448
2.635TyrVal: 2.635 ± 1.155
1.757TyrTrp: 1.757 ± 0.471
0.878TyrTyr: 0.878 ± 0.525
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2278 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski