Amino acid dipepetide frequency for Saimiri sciureus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.439AlaAla: 12.439 ± 1.727
1.333AlaCys: 1.333 ± 0.562
3.998AlaAsp: 3.998 ± 0.993
3.998AlaGlu: 3.998 ± 0.47
2.665AlaPhe: 2.665 ± 1.263
6.664AlaGly: 6.664 ± 0.444
1.777AlaHis: 1.777 ± 0.667
2.665AlaIle: 2.665 ± 0.503
3.554AlaLys: 3.554 ± 1.327
6.219AlaLeu: 6.219 ± 1.129
1.777AlaMet: 1.777 ± 0.398
1.777AlaAsn: 1.777 ± 0.545
6.664AlaPro: 6.664 ± 4.004
2.221AlaGln: 2.221 ± 0.847
4.442AlaArg: 4.442 ± 1.412
6.664AlaSer: 6.664 ± 1.004
6.219AlaThr: 6.219 ± 0.931
5.331AlaVal: 5.331 ± 0.834
0.888AlaTrp: 0.888 ± 1.076
2.665AlaTyr: 2.665 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
2.221CysAla: 2.221 ± 0.824
0.444CysCys: 0.444 ± 0.344
0.888CysAsp: 0.888 ± 0.518
0.888CysGlu: 0.888 ± 0.518
0.888CysPhe: 0.888 ± 0.438
1.777CysGly: 1.777 ± 1.19
1.333CysHis: 1.333 ± 1.088
1.777CysIle: 1.777 ± 0.731
0.888CysLys: 0.888 ± 0.518
0.444CysLeu: 0.444 ± 0.424
0.0CysMet: 0.0 ± 0.0
1.333CysAsn: 1.333 ± 0.631
2.221CysPro: 2.221 ± 0.501
1.333CysGln: 1.333 ± 0.563
0.888CysArg: 0.888 ± 0.406
1.777CysSer: 1.777 ± 1.092
2.665CysThr: 2.665 ± 1.605
2.221CysVal: 2.221 ± 0.831
1.777CysTrp: 1.777 ± 0.731
2.221CysTyr: 2.221 ± 1.288
0.0CysXaa: 0.0 ± 0.0
Asp
3.554AspAla: 3.554 ± 1.104
2.665AspCys: 2.665 ± 0.759
1.333AspAsp: 1.333 ± 1.032
2.221AspGlu: 2.221 ± 0.944
3.554AspPhe: 3.554 ± 0.859
4.442AspGly: 4.442 ± 1.016
1.333AspHis: 1.333 ± 0.848
4.887AspIle: 4.887 ± 2.514
1.777AspLys: 1.777 ± 0.744
3.998AspLeu: 3.998 ± 1.344
0.444AspMet: 0.444 ± 0.371
4.442AspAsn: 4.442 ± 0.609
1.777AspPro: 1.777 ± 0.855
0.444AspGln: 0.444 ± 0.371
2.665AspArg: 2.665 ± 1.599
5.775AspSer: 5.775 ± 1.267
7.108AspThr: 7.108 ± 1.295
4.442AspVal: 4.442 ± 1.476
1.333AspTrp: 1.333 ± 0.664
1.333AspTyr: 1.333 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
3.554GluAla: 3.554 ± 0.636
0.888GluCys: 0.888 ± 0.438
9.329GluAsp: 9.329 ± 1.75
1.777GluGlu: 1.777 ± 0.731
0.444GluPhe: 0.444 ± 0.344
2.665GluGly: 2.665 ± 0.796
1.777GluHis: 1.777 ± 0.513
0.444GluIle: 0.444 ± 0.514
0.888GluLys: 0.888 ± 0.743
3.11GluLeu: 3.11 ± 0.859
2.665GluMet: 2.665 ± 0.554
3.554GluAsn: 3.554 ± 1.242
3.11GluPro: 3.11 ± 1.212
0.888GluGln: 0.888 ± 0.518
2.221GluArg: 2.221 ± 1.207
1.333GluSer: 1.333 ± 1.032
3.11GluThr: 3.11 ± 0.59
3.554GluVal: 3.554 ± 1.108
0.888GluTrp: 0.888 ± 0.688
1.333GluTyr: 1.333 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
1.777PheAla: 1.777 ± 0.719
0.888PheCys: 0.888 ± 0.595
1.777PheAsp: 1.777 ± 0.667
0.888PheGlu: 0.888 ± 0.688
2.665PhePhe: 2.665 ± 0.957
3.11PheGly: 3.11 ± 1.128
0.444PheHis: 0.444 ± 0.398
1.333PheIle: 1.333 ± 0.562
3.11PheLys: 3.11 ± 1.203
3.11PheLeu: 3.11 ± 0.963
0.888PheMet: 0.888 ± 0.595
1.333PheAsn: 1.333 ± 0.712
2.665PhePro: 2.665 ± 1.21
0.888PheGln: 0.888 ± 0.518
1.333PheArg: 1.333 ± 0.446
1.777PheSer: 1.777 ± 0.855
3.11PheThr: 3.11 ± 1.025
1.333PheVal: 1.333 ± 0.328
1.333PheTrp: 1.333 ± 0.328
1.777PheTyr: 1.777 ± 0.855
0.0PheXaa: 0.0 ± 0.0
Gly
6.664GlyAla: 6.664 ± 1.913
0.888GlyCys: 0.888 ± 0.518
6.219GlyAsp: 6.219 ± 1.387
1.333GlyGlu: 1.333 ± 0.328
2.221GlyPhe: 2.221 ± 0.329
2.665GlyGly: 2.665 ± 1.455
2.665GlyHis: 2.665 ± 0.809
3.11GlyIle: 3.11 ± 0.777
3.554GlyLys: 3.554 ± 0.957
6.219GlyLeu: 6.219 ± 1.965
1.777GlyMet: 1.777 ± 0.737
3.554GlyAsn: 3.554 ± 0.979
1.777GlyPro: 1.777 ± 0.545
3.554GlyGln: 3.554 ± 1.58
4.442GlyArg: 4.442 ± 0.527
3.998GlySer: 3.998 ± 0.625
8.441GlyThr: 8.441 ± 1.238
3.11GlyVal: 3.11 ± 1.019
0.444GlyTrp: 0.444 ± 0.344
2.221GlyTyr: 2.221 ± 0.68
0.0GlyXaa: 0.0 ± 0.0
His
0.444HisAla: 0.444 ± 0.514
1.777HisCys: 1.777 ± 0.731
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.333HisPhe: 1.333 ± 0.328
0.888HisGly: 0.888 ± 0.406
0.888HisHis: 0.888 ± 0.626
1.333HisIle: 1.333 ± 0.791
1.333HisLys: 1.333 ± 0.791
1.333HisLeu: 1.333 ± 1.088
0.444HisMet: 0.444 ± 0.344
0.888HisAsn: 0.888 ± 0.577
1.333HisPro: 1.333 ± 0.738
1.333HisGln: 1.333 ± 0.924
3.998HisArg: 3.998 ± 1.862
0.0HisSer: 0.0 ± 0.0
0.888HisThr: 0.888 ± 0.418
1.777HisVal: 1.777 ± 0.23
0.888HisTrp: 0.888 ± 0.518
0.444HisTyr: 0.444 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
3.11IleAla: 3.11 ± 0.879
1.333IleCys: 1.333 ± 0.669
3.11IleAsp: 3.11 ± 0.521
3.554IleGlu: 3.554 ± 1.16
1.333IlePhe: 1.333 ± 0.663
3.554IleGly: 3.554 ± 1.156
0.888IleHis: 0.888 ± 0.518
0.888IleIle: 0.888 ± 0.62
1.333IleLys: 1.333 ± 0.809
3.11IleLeu: 3.11 ± 1.434
0.444IleMet: 0.444 ± 0.388
0.888IleAsn: 0.888 ± 0.688
1.333IlePro: 1.333 ± 0.4
3.11IleGln: 3.11 ± 0.867
0.888IleArg: 0.888 ± 0.577
1.333IleSer: 1.333 ± 0.738
1.333IleThr: 1.333 ± 0.791
3.554IleVal: 3.554 ± 1.189
0.0IleTrp: 0.0 ± 0.0
0.888IleTyr: 0.888 ± 0.509
0.0IleXaa: 0.0 ± 0.0
Lys
3.554LysAla: 3.554 ± 0.67
1.777LysCys: 1.777 ± 0.937
2.221LysAsp: 2.221 ± 0.563
3.554LysGlu: 3.554 ± 1.165
1.777LysPhe: 1.777 ± 1.056
2.665LysGly: 2.665 ± 0.554
0.888LysHis: 0.888 ± 0.438
0.888LysIle: 0.888 ± 0.406
1.333LysLys: 1.333 ± 0.446
2.221LysLeu: 2.221 ± 1.087
0.888LysMet: 0.888 ± 0.418
0.888LysAsn: 0.888 ± 0.438
1.333LysPro: 1.333 ± 0.328
1.777LysGln: 1.777 ± 1.029
5.775LysArg: 5.775 ± 1.015
4.442LysSer: 4.442 ± 1.724
2.665LysThr: 2.665 ± 0.943
3.11LysVal: 3.11 ± 1.176
0.888LysTrp: 0.888 ± 0.595
3.11LysTyr: 3.11 ± 1.006
0.0LysXaa: 0.0 ± 0.0
Leu
3.554LeuAla: 3.554 ± 1.242
3.554LeuCys: 3.554 ± 1.401
4.887LeuAsp: 4.887 ± 0.639
3.998LeuGlu: 3.998 ± 1.112
3.554LeuPhe: 3.554 ± 1.989
7.108LeuGly: 7.108 ± 1.075
0.888LeuHis: 0.888 ± 0.797
1.777LeuIle: 1.777 ± 0.963
7.108LeuLys: 7.108 ± 2.494
5.331LeuLeu: 5.331 ± 1.001
1.777LeuMet: 1.777 ± 0.997
1.777LeuAsn: 1.777 ± 0.767
3.554LeuPro: 3.554 ± 1.579
8.441LeuGln: 8.441 ± 1.174
5.331LeuArg: 5.331 ± 0.83
3.998LeuSer: 3.998 ± 0.848
3.11LeuThr: 3.11 ± 1.13
3.554LeuVal: 3.554 ± 0.908
1.777LeuTrp: 1.777 ± 0.23
3.998LeuTyr: 3.998 ± 0.586
0.0LeuXaa: 0.0 ± 0.0
Met
1.777MetAla: 1.777 ± 0.835
0.444MetCys: 0.444 ± 0.424
1.333MetAsp: 1.333 ± 0.562
0.444MetGlu: 0.444 ± 0.424
0.888MetPhe: 0.888 ± 0.688
0.888MetGly: 0.888 ± 0.418
1.333MetHis: 1.333 ± 0.924
1.333MetIle: 1.333 ± 0.563
0.0MetLys: 0.0 ± 0.0
0.444MetLeu: 0.444 ± 0.344
0.0MetMet: 0.0 ± 0.0
1.333MetAsn: 1.333 ± 0.563
1.777MetPro: 1.777 ± 0.963
0.888MetGln: 0.888 ± 0.509
1.333MetArg: 1.333 ± 0.669
1.333MetSer: 1.333 ± 1.032
1.333MetThr: 1.333 ± 0.328
1.777MetVal: 1.777 ± 0.835
0.444MetTrp: 0.444 ± 0.424
0.888MetTyr: 0.888 ± 0.518
0.0MetXaa: 0.0 ± 0.0
Asn
2.665AsnAla: 2.665 ± 1.328
1.777AsnCys: 1.777 ± 0.738
0.888AsnAsp: 0.888 ± 0.688
2.665AsnGlu: 2.665 ± 1.204
0.444AsnPhe: 0.444 ± 0.371
2.665AsnGly: 2.665 ± 0.424
0.444AsnHis: 0.444 ± 0.344
1.333AsnIle: 1.333 ± 0.809
3.554AsnLys: 3.554 ± 1.543
3.554AsnLeu: 3.554 ± 0.489
0.888AsnMet: 0.888 ± 0.582
1.777AsnAsn: 1.777 ± 1.047
3.554AsnPro: 3.554 ± 1.559
1.777AsnGln: 1.777 ± 1.056
0.888AsnArg: 0.888 ± 0.418
2.665AsnSer: 2.665 ± 1.267
3.554AsnThr: 3.554 ± 1.148
1.333AsnVal: 1.333 ± 0.712
0.444AsnTrp: 0.444 ± 0.344
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.108ProAla: 7.108 ± 2.905
0.444ProCys: 0.444 ± 0.371
4.442ProAsp: 4.442 ± 1.724
2.665ProGlu: 2.665 ± 0.978
0.888ProPhe: 0.888 ± 0.688
2.221ProGly: 2.221 ± 0.962
0.888ProHis: 0.888 ± 0.418
3.11ProIle: 3.11 ± 1.081
2.221ProLys: 2.221 ± 0.802
7.108ProLeu: 7.108 ± 1.585
0.444ProMet: 0.444 ± 0.371
1.333ProAsn: 1.333 ± 0.669
8.441ProPro: 8.441 ± 1.489
1.777ProGln: 1.777 ± 0.877
0.444ProArg: 0.444 ± 0.344
5.775ProSer: 5.775 ± 2.634
6.219ProThr: 6.219 ± 1.457
6.664ProVal: 6.664 ± 2.486
0.0ProTrp: 0.0 ± 0.0
3.11ProTyr: 3.11 ± 0.958
0.0ProXaa: 0.0 ± 0.0
Gln
3.11GlnAla: 3.11 ± 0.796
1.777GlnCys: 1.777 ± 1.055
1.333GlnAsp: 1.333 ± 0.669
1.777GlnGlu: 1.777 ± 1.191
0.888GlnPhe: 0.888 ± 0.418
4.442GlnGly: 4.442 ± 1.558
0.0GlnHis: 0.0 ± 0.0
2.221GlnIle: 2.221 ± 0.852
1.333GlnLys: 1.333 ± 0.791
5.331GlnLeu: 5.331 ± 1.418
0.0GlnMet: 0.0 ± 0.0
1.333GlnAsn: 1.333 ± 0.795
3.998GlnPro: 3.998 ± 0.577
2.221GlnGln: 2.221 ± 0.739
2.221GlnArg: 2.221 ± 0.919
1.333GlnSer: 1.333 ± 0.483
4.887GlnThr: 4.887 ± 1.011
1.777GlnVal: 1.777 ± 0.613
0.888GlnTrp: 0.888 ± 0.406
2.665GlnTyr: 2.665 ± 1.149
0.0GlnXaa: 0.0 ± 0.0
Arg
7.552ArgAla: 7.552 ± 0.803
2.665ArgCys: 2.665 ± 1.459
0.888ArgAsp: 0.888 ± 0.659
1.777ArgGlu: 1.777 ± 0.513
1.777ArgPhe: 1.777 ± 0.84
3.11ArgGly: 3.11 ± 1.366
2.665ArgHis: 2.665 ± 1.124
1.333ArgIle: 1.333 ± 0.728
4.442ArgLys: 4.442 ± 0.525
4.442ArgLeu: 4.442 ± 1.391
0.888ArgMet: 0.888 ± 0.419
0.888ArgAsn: 0.888 ± 0.688
3.998ArgPro: 3.998 ± 0.965
2.665ArgGln: 2.665 ± 1.187
3.554ArgArg: 3.554 ± 1.217
2.221ArgSer: 2.221 ± 0.473
2.665ArgThr: 2.665 ± 0.844
4.442ArgVal: 4.442 ± 1.517
0.444ArgTrp: 0.444 ± 0.344
2.665ArgTyr: 2.665 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
3.554SerAla: 3.554 ± 2.13
0.0SerCys: 0.0 ± 0.0
3.554SerAsp: 3.554 ± 0.67
6.664SerGlu: 6.664 ± 1.27
3.11SerPhe: 3.11 ± 0.906
4.887SerGly: 4.887 ± 0.917
0.888SerHis: 0.888 ± 0.688
1.333SerIle: 1.333 ± 0.663
1.777SerLys: 1.777 ± 0.963
6.664SerLeu: 6.664 ± 0.679
1.777SerMet: 1.777 ± 0.978
3.11SerAsn: 3.11 ± 1.976
3.998SerPro: 3.998 ± 1.19
2.665SerGln: 2.665 ± 1.074
4.887SerArg: 4.887 ± 1.322
5.775SerSer: 5.775 ± 1.863
7.552SerThr: 7.552 ± 2.401
3.554SerVal: 3.554 ± 1.459
1.333SerTrp: 1.333 ± 0.669
2.221SerTyr: 2.221 ± 0.962
0.0SerXaa: 0.0 ± 0.0
Thr
4.887ThrAla: 4.887 ± 1.16
3.554ThrCys: 3.554 ± 0.995
5.331ThrAsp: 5.331 ± 1.005
3.554ThrGlu: 3.554 ± 0.67
1.777ThrPhe: 1.777 ± 0.813
7.108ThrGly: 7.108 ± 1.309
0.888ThrHis: 0.888 ± 0.848
2.665ThrIle: 2.665 ± 0.833
2.665ThrLys: 2.665 ± 0.69
5.775ThrLeu: 5.775 ± 1.119
1.333ThrMet: 1.333 ± 0.446
0.888ThrAsn: 0.888 ± 0.418
5.331ThrPro: 5.331 ± 0.91
3.998ThrGln: 3.998 ± 1.311
3.11ThrArg: 3.11 ± 1.59
10.662ThrSer: 10.662 ± 2.187
6.664ThrThr: 6.664 ± 1.366
6.664ThrVal: 6.664 ± 1.641
1.333ThrTrp: 1.333 ± 1.272
1.777ThrTyr: 1.777 ± 0.518
0.0ThrXaa: 0.0 ± 0.0
Val
8.441ValAla: 8.441 ± 0.894
1.333ValCys: 1.333 ± 0.783
4.887ValAsp: 4.887 ± 1.511
3.11ValGlu: 3.11 ± 1.157
2.221ValPhe: 2.221 ± 0.962
4.442ValGly: 4.442 ± 1.309
0.888ValHis: 0.888 ± 0.406
1.777ValIle: 1.777 ± 0.23
2.665ValLys: 2.665 ± 0.66
4.442ValLeu: 4.442 ± 1.278
0.888ValMet: 0.888 ± 0.418
2.665ValAsn: 2.665 ± 0.528
6.664ValPro: 6.664 ± 1.979
2.221ValGln: 2.221 ± 0.739
2.665ValArg: 2.665 ± 1.219
4.887ValSer: 4.887 ± 0.884
5.775ValThr: 5.775 ± 2.287
6.219ValVal: 6.219 ± 1.29
0.888ValTrp: 0.888 ± 0.518
2.221ValTyr: 2.221 ± 0.68
0.0ValXaa: 0.0 ± 0.0
Trp
1.777TrpAla: 1.777 ± 0.605
0.0TrpCys: 0.0 ± 0.0
0.444TrpAsp: 0.444 ± 0.344
0.888TrpGlu: 0.888 ± 0.418
1.777TrpPhe: 1.777 ± 0.545
0.888TrpGly: 0.888 ± 0.428
0.444TrpHis: 0.444 ± 0.538
0.888TrpIle: 0.888 ± 0.595
1.333TrpLys: 1.333 ± 0.328
2.665TrpLeu: 2.665 ± 1.338
0.444TrpMet: 0.444 ± 0.424
0.888TrpAsn: 0.888 ± 0.577
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.777TrpArg: 1.777 ± 1.19
0.888TrpSer: 0.888 ± 0.438
1.333TrpThr: 1.333 ± 1.272
0.888TrpVal: 0.888 ± 0.438
0.0TrpTrp: 0.0 ± 0.0
0.444TrpTyr: 0.444 ± 0.424
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.665TyrAla: 2.665 ± 0.983
0.444TyrCys: 0.444 ± 0.344
2.665TyrAsp: 2.665 ± 0.78
1.777TyrGlu: 1.777 ± 0.545
1.333TyrPhe: 1.333 ± 0.738
2.665TyrGly: 2.665 ± 0.41
0.0TyrHis: 0.0 ± 0.0
0.888TyrIle: 0.888 ± 0.428
0.888TyrLys: 0.888 ± 0.428
3.554TyrLeu: 3.554 ± 1.398
1.777TyrMet: 1.777 ± 0.605
2.221TyrAsn: 2.221 ± 0.329
1.777TyrPro: 1.777 ± 0.855
1.333TyrGln: 1.333 ± 0.579
2.221TyrArg: 2.221 ± 1.18
2.665TyrSer: 2.665 ± 0.844
1.333TyrThr: 1.333 ± 0.631
3.998TyrVal: 3.998 ± 1.434
1.777TyrTrp: 1.777 ± 0.513
1.333TyrTyr: 1.333 ± 0.87
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2252 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski