Amino acid dipepetide frequency for Rusa timorensis papillomavirus type 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.5AlaAla: 5.5 ± 2.237
1.833AlaCys: 1.833 ± 0.954
4.583AlaAsp: 4.583 ± 0.778
6.416AlaGlu: 6.416 ± 1.005
0.917AlaPhe: 0.917 ± 0.677
4.125AlaGly: 4.125 ± 0.689
0.0AlaHis: 0.0 ± 0.0
2.291AlaIle: 2.291 ± 1.439
4.125AlaLys: 4.125 ± 1.478
5.041AlaLeu: 5.041 ± 0.928
1.833AlaMet: 1.833 ± 0.27
1.833AlaAsn: 1.833 ± 0.195
4.125AlaPro: 4.125 ± 1.114
2.75AlaGln: 2.75 ± 1.505
3.208AlaArg: 3.208 ± 0.86
4.125AlaSer: 4.125 ± 0.554
4.125AlaThr: 4.125 ± 0.554
4.583AlaVal: 4.583 ± 1.396
0.917AlaTrp: 0.917 ± 0.675
1.833AlaTyr: 1.833 ± 0.979
0.0AlaXaa: 0.0 ± 0.0
Cys
2.291CysAla: 2.291 ± 0.857
0.917CysCys: 0.917 ± 0.698
0.458CysAsp: 0.458 ± 0.338
2.291CysGlu: 2.291 ± 0.81
1.833CysPhe: 1.833 ± 1.166
1.375CysGly: 1.375 ± 1.001
0.0CysHis: 0.0 ± 0.0
1.375CysIle: 1.375 ± 0.646
0.917CysLys: 0.917 ± 0.432
3.666CysLeu: 3.666 ± 1.498
0.0CysMet: 0.0 ± 0.0
0.917CysAsn: 0.917 ± 0.675
1.833CysPro: 1.833 ± 1.349
0.917CysGln: 0.917 ± 0.698
2.291CysArg: 2.291 ± 0.322
1.833CysSer: 1.833 ± 0.725
2.291CysThr: 2.291 ± 0.81
1.375CysVal: 1.375 ± 0.689
0.0CysTrp: 0.0 ± 0.0
0.917CysTyr: 0.917 ± 0.477
0.0CysXaa: 0.0 ± 0.0
Asp
2.75AspAla: 2.75 ± 1.12
2.291AspCys: 2.291 ± 0.779
3.666AspAsp: 3.666 ± 1.211
4.125AspGlu: 4.125 ± 1.788
3.666AspPhe: 3.666 ± 1.16
4.125AspGly: 4.125 ± 1.16
0.458AspHis: 0.458 ± 0.456
3.666AspIle: 3.666 ± 1.108
2.75AspLys: 2.75 ± 0.905
4.125AspLeu: 4.125 ± 0.832
0.458AspMet: 0.458 ± 0.352
4.125AspAsn: 4.125 ± 0.782
2.75AspPro: 2.75 ± 0.429
1.833AspGln: 1.833 ± 1.408
1.833AspArg: 1.833 ± 0.534
8.708AspSer: 8.708 ± 2.268
2.75AspThr: 2.75 ± 1.155
4.583AspVal: 4.583 ± 1.786
0.917AspTrp: 0.917 ± 0.521
1.375AspTyr: 1.375 ± 0.752
0.0AspXaa: 0.0 ± 0.0
Glu
5.041GluAla: 5.041 ± 1.578
1.375GluCys: 1.375 ± 0.694
5.041GluAsp: 5.041 ± 1.13
3.208GluGlu: 3.208 ± 0.618
0.458GluPhe: 0.458 ± 0.352
3.666GluGly: 3.666 ± 1.384
0.917GluHis: 0.917 ± 0.389
1.375GluIle: 1.375 ± 0.347
3.666GluLys: 3.666 ± 1.274
5.041GluLeu: 5.041 ± 0.816
0.917GluMet: 0.917 ± 0.648
3.208GluAsn: 3.208 ± 1.167
3.208GluPro: 3.208 ± 1.111
4.583GluGln: 4.583 ± 0.793
2.291GluArg: 2.291 ± 1.246
5.041GluSer: 5.041 ± 1.182
3.666GluThr: 3.666 ± 2.279
3.208GluVal: 3.208 ± 1.007
0.0GluTrp: 0.0 ± 0.0
1.375GluTyr: 1.375 ± 0.672
0.0GluXaa: 0.0 ± 0.0
Phe
3.208PheAla: 3.208 ± 1.137
1.375PheCys: 1.375 ± 0.67
0.458PheAsp: 0.458 ± 0.338
1.833PheGlu: 1.833 ± 0.646
0.917PhePhe: 0.917 ± 0.417
3.666PheGly: 3.666 ± 1.446
0.0PheHis: 0.0 ± 0.0
1.833PheIle: 1.833 ± 0.954
3.208PheLys: 3.208 ± 1.443
5.041PheLeu: 5.041 ± 1.493
0.458PheMet: 0.458 ± 0.352
2.75PheAsn: 2.75 ± 1.387
0.917PhePro: 0.917 ± 0.384
1.375PheGln: 1.375 ± 1.013
1.833PheArg: 1.833 ± 0.567
2.75PheSer: 2.75 ± 1.291
3.208PheThr: 3.208 ± 1.011
2.291PheVal: 2.291 ± 0.648
1.375PheTrp: 1.375 ± 0.377
3.208PheTyr: 3.208 ± 1.196
0.0PheXaa: 0.0 ± 0.0
Gly
3.666GlyAla: 3.666 ± 1.108
1.833GlyCys: 1.833 ± 0.909
4.583GlyAsp: 4.583 ± 0.992
7.333GlyGlu: 7.333 ± 1.086
1.375GlyPhe: 1.375 ± 0.347
7.791GlyGly: 7.791 ± 1.767
1.375GlyHis: 1.375 ± 0.694
3.208GlyIle: 3.208 ± 1.234
2.291GlyLys: 2.291 ± 0.322
3.208GlyLeu: 3.208 ± 1.301
1.375GlyMet: 1.375 ± 0.448
4.125GlyAsn: 4.125 ± 0.845
2.291GlyPro: 2.291 ± 0.485
2.75GlyGln: 2.75 ± 0.636
5.5GlyArg: 5.5 ± 2.077
6.416GlySer: 6.416 ± 2.44
6.874GlyThr: 6.874 ± 0.824
6.874GlyVal: 6.874 ± 1.918
0.458GlyTrp: 0.458 ± 0.456
1.375GlyTyr: 1.375 ± 0.672
0.0GlyXaa: 0.0 ± 0.0
His
0.917HisAla: 0.917 ± 0.432
0.0HisCys: 0.0 ± 0.0
0.917HisAsp: 0.917 ± 0.675
0.0HisGlu: 0.0 ± 0.0
0.458HisPhe: 0.458 ± 0.456
1.833HisGly: 1.833 ± 0.954
1.375HisHis: 1.375 ± 0.657
1.375HisIle: 1.375 ± 0.864
2.291HisLys: 2.291 ± 0.652
0.0HisLeu: 0.0 ± 0.0
0.458HisMet: 0.458 ± 0.331
1.375HisAsn: 1.375 ± 0.703
1.833HisPro: 1.833 ± 0.977
0.458HisGln: 0.458 ± 0.338
0.458HisArg: 0.458 ± 0.338
0.458HisSer: 0.458 ± 0.339
0.917HisThr: 0.917 ± 0.698
1.375HisVal: 1.375 ± 0.448
0.0HisTrp: 0.0 ± 0.0
1.375HisTyr: 1.375 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
2.291IleAla: 2.291 ± 1.257
1.833IleCys: 1.833 ± 0.979
2.75IleAsp: 2.75 ± 0.857
5.5IleGlu: 5.5 ± 0.608
1.833IlePhe: 1.833 ± 0.646
4.125IleGly: 4.125 ± 1.38
0.917IleHis: 0.917 ± 0.432
0.458IleIle: 0.458 ± 0.352
1.375IleLys: 1.375 ± 0.646
3.208IleLeu: 3.208 ± 0.888
1.375IleMet: 1.375 ± 0.417
2.75IleAsn: 2.75 ± 0.879
1.833IlePro: 1.833 ± 0.778
1.833IleGln: 1.833 ± 1.166
0.458IleArg: 0.458 ± 0.352
3.666IleSer: 3.666 ± 1.088
4.583IleThr: 4.583 ± 0.585
0.917IleVal: 0.917 ± 0.417
0.0IleTrp: 0.0 ± 0.0
0.917IleTyr: 0.917 ± 0.477
0.0IleXaa: 0.0 ± 0.0
Lys
4.125LysAla: 4.125 ± 1.018
3.208LysCys: 3.208 ± 0.969
1.375LysAsp: 1.375 ± 0.752
1.833LysGlu: 1.833 ± 0.91
2.291LysPhe: 2.291 ± 0.81
1.375LysGly: 1.375 ± 1.001
2.291LysHis: 2.291 ± 1.023
1.375LysIle: 1.375 ± 0.694
3.208LysLys: 3.208 ± 1.167
4.125LysLeu: 4.125 ± 1.466
0.917LysMet: 0.917 ± 0.417
3.666LysAsn: 3.666 ± 1.557
1.833LysPro: 1.833 ± 1.296
2.291LysGln: 2.291 ± 0.387
5.041LysArg: 5.041 ± 0.416
4.125LysSer: 4.125 ± 1.568
3.666LysThr: 3.666 ± 0.471
2.75LysVal: 2.75 ± 0.39
0.917LysTrp: 0.917 ± 0.757
3.208LysTyr: 3.208 ± 1.568
0.0LysXaa: 0.0 ± 0.0
Leu
4.583LeuAla: 4.583 ± 1.288
2.75LeuCys: 2.75 ± 0.884
4.125LeuAsp: 4.125 ± 1.148
4.125LeuGlu: 4.125 ± 0.729
1.833LeuPhe: 1.833 ± 0.834
6.874LeuGly: 6.874 ± 1.928
1.375LeuHis: 1.375 ± 0.657
3.666LeuIle: 3.666 ± 0.755
6.874LeuLys: 6.874 ± 2.215
8.249LeuLeu: 8.249 ± 2.2
0.917LeuMet: 0.917 ± 0.389
4.125LeuAsn: 4.125 ± 0.695
5.041LeuPro: 5.041 ± 2.02
3.208LeuGln: 3.208 ± 0.866
5.041LeuArg: 5.041 ± 1.333
5.5LeuSer: 5.5 ± 0.899
5.5LeuThr: 5.5 ± 1.34
3.666LeuVal: 3.666 ± 1.143
2.291LeuTrp: 2.291 ± 0.649
4.125LeuTyr: 4.125 ± 1.49
0.0LeuXaa: 0.0 ± 0.0
Met
2.291MetAla: 2.291 ± 0.387
0.0MetCys: 0.0 ± 0.0
1.375MetAsp: 1.375 ± 0.347
0.458MetGlu: 0.458 ± 0.456
0.917MetPhe: 0.917 ± 0.389
0.0MetGly: 0.0 ± 0.0
0.917MetHis: 0.917 ± 0.417
1.375MetIle: 1.375 ± 0.703
0.0MetLys: 0.0 ± 0.0
2.75MetLeu: 2.75 ± 1.117
0.458MetMet: 0.458 ± 0.316
0.917MetAsn: 0.917 ± 0.477
0.917MetPro: 0.917 ± 0.675
1.375MetGln: 1.375 ± 0.627
1.375MetArg: 1.375 ± 1.013
0.917MetSer: 0.917 ± 0.389
0.458MetThr: 0.458 ± 0.352
1.833MetVal: 1.833 ± 0.798
0.458MetTrp: 0.458 ± 0.456
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.833AsnAla: 1.833 ± 0.567
0.458AsnCys: 0.458 ± 0.338
3.666AsnAsp: 3.666 ± 0.929
3.208AsnGlu: 3.208 ± 0.33
2.75AsnPhe: 2.75 ± 0.665
2.291AsnGly: 2.291 ± 1.331
0.917AsnHis: 0.917 ± 0.417
2.75AsnIle: 2.75 ± 1.186
2.291AsnLys: 2.291 ± 1.182
3.666AsnLeu: 3.666 ± 0.713
1.375AsnMet: 1.375 ± 0.377
1.833AsnAsn: 1.833 ± 0.767
2.75AsnPro: 2.75 ± 0.668
3.208AsnGln: 3.208 ± 1.026
2.291AsnArg: 2.291 ± 0.404
2.291AsnSer: 2.291 ± 0.651
3.208AsnThr: 3.208 ± 1.346
4.125AsnVal: 4.125 ± 0.537
0.458AsnTrp: 0.458 ± 0.352
0.917AsnTyr: 0.917 ± 1.253
0.0AsnXaa: 0.0 ± 0.0
Pro
7.791ProAla: 7.791 ± 1.879
0.458ProCys: 0.458 ± 0.456
5.958ProAsp: 5.958 ± 1.442
1.833ProGlu: 1.833 ± 0.58
1.833ProPhe: 1.833 ± 0.74
4.583ProGly: 4.583 ± 1.609
0.0ProHis: 0.0 ± 0.0
2.291ProIle: 2.291 ± 0.894
5.041ProLys: 5.041 ± 1.363
5.041ProLeu: 5.041 ± 1.654
0.458ProMet: 0.458 ± 0.338
1.833ProAsn: 1.833 ± 1.408
4.583ProPro: 4.583 ± 1.288
1.833ProGln: 1.833 ± 1.043
0.917ProArg: 0.917 ± 0.384
4.125ProSer: 4.125 ± 0.537
1.833ProThr: 1.833 ± 1.256
3.208ProVal: 3.208 ± 0.796
0.458ProTrp: 0.458 ± 0.456
1.833ProTyr: 1.833 ± 0.613
0.0ProXaa: 0.0 ± 0.0
Gln
2.291GlnAla: 2.291 ± 0.73
0.917GlnCys: 0.917 ± 0.675
0.458GlnAsp: 0.458 ± 0.352
0.458GlnGlu: 0.458 ± 0.456
3.666GlnPhe: 3.666 ± 1.305
2.75GlnGly: 2.75 ± 1.25
0.917GlnHis: 0.917 ± 0.911
0.458GlnIle: 0.458 ± 0.338
1.833GlnLys: 1.833 ± 0.778
3.666GlnLeu: 3.666 ± 1.186
0.917GlnMet: 0.917 ± 0.477
1.375GlnAsn: 1.375 ± 0.647
2.75GlnPro: 2.75 ± 1.109
1.375GlnGln: 1.375 ± 0.377
2.291GlnArg: 2.291 ± 1.341
3.208GlnSer: 3.208 ± 0.969
4.583GlnThr: 4.583 ± 1.419
4.125GlnVal: 4.125 ± 1.289
0.458GlnTrp: 0.458 ± 0.338
1.833GlnTyr: 1.833 ± 0.767
0.0GlnXaa: 0.0 ± 0.0
Arg
3.666ArgAla: 3.666 ± 0.618
0.917ArgCys: 0.917 ± 0.477
2.291ArgAsp: 2.291 ± 0.609
0.917ArgGlu: 0.917 ± 0.417
3.208ArgPhe: 3.208 ± 0.548
7.333ArgGly: 7.333 ± 0.78
2.291ArgHis: 2.291 ± 0.688
1.833ArgIle: 1.833 ± 0.74
5.041ArgLys: 5.041 ± 0.922
5.041ArgLeu: 5.041 ± 1.004
0.0ArgMet: 0.0 ± 0.0
1.833ArgAsn: 1.833 ± 0.91
3.208ArgPro: 3.208 ± 1.16
1.375ArgGln: 1.375 ± 0.627
4.583ArgArg: 4.583 ± 1.829
2.75ArgSer: 2.75 ± 0.636
1.833ArgThr: 1.833 ± 0.544
2.291ArgVal: 2.291 ± 0.906
0.458ArgTrp: 0.458 ± 0.456
2.291ArgTyr: 2.291 ± 0.837
0.0ArgXaa: 0.0 ± 0.0
Ser
4.583SerAla: 4.583 ± 1.773
2.75SerCys: 2.75 ± 1.292
4.583SerAsp: 4.583 ± 1.288
5.5SerGlu: 5.5 ± 1.638
2.75SerPhe: 2.75 ± 1.16
5.041SerGly: 5.041 ± 1.966
0.917SerHis: 0.917 ± 0.417
2.75SerIle: 2.75 ± 0.693
0.917SerLys: 0.917 ± 0.389
9.166SerLeu: 9.166 ± 1.948
1.375SerMet: 1.375 ± 0.808
3.208SerAsn: 3.208 ± 0.839
4.583SerPro: 4.583 ± 0.702
3.666SerGln: 3.666 ± 1.084
4.125SerArg: 4.125 ± 1.375
8.249SerSer: 8.249 ± 2.089
4.125SerThr: 4.125 ± 1.402
5.958SerVal: 5.958 ± 2.677
1.375SerTrp: 1.375 ± 0.627
1.375SerTyr: 1.375 ± 0.632
0.0SerXaa: 0.0 ± 0.0
Thr
2.75ThrAla: 2.75 ± 1.43
0.917ThrCys: 0.917 ± 0.911
6.416ThrAsp: 6.416 ± 0.875
3.208ThrGlu: 3.208 ± 1.187
2.75ThrPhe: 2.75 ± 1.117
7.333ThrGly: 7.333 ± 1.888
0.917ThrHis: 0.917 ± 0.389
5.041ThrIle: 5.041 ± 1.482
1.833ThrLys: 1.833 ± 0.646
3.666ThrLeu: 3.666 ± 1.237
2.291ThrMet: 2.291 ± 1.023
0.917ThrAsn: 0.917 ± 0.699
4.125ThrPro: 4.125 ± 1.009
2.291ThrGln: 2.291 ± 0.404
3.666ThrArg: 3.666 ± 1.244
5.958ThrSer: 5.958 ± 1.436
6.874ThrThr: 6.874 ± 0.773
7.333ThrVal: 7.333 ± 1.935
1.375ThrTrp: 1.375 ± 0.627
1.375ThrTyr: 1.375 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
2.75ValAla: 2.75 ± 0.905
2.291ValCys: 2.291 ± 0.652
5.5ValAsp: 5.5 ± 1.234
2.75ValGlu: 2.75 ± 1.155
5.041ValPhe: 5.041 ± 1.38
4.125ValGly: 4.125 ± 0.917
1.833ValHis: 1.833 ± 0.778
3.666ValIle: 3.666 ± 1.04
4.125ValLys: 4.125 ± 1.458
4.583ValLeu: 4.583 ± 1.3
0.458ValMet: 0.458 ± 0.352
3.666ValAsn: 3.666 ± 1.384
4.125ValPro: 4.125 ± 1.024
1.833ValGln: 1.833 ± 0.544
2.75ValArg: 2.75 ± 0.785
4.125ValSer: 4.125 ± 1.932
6.874ValThr: 6.874 ± 3.866
2.291ValVal: 2.291 ± 1.693
0.917ValTrp: 0.917 ± 0.477
1.375ValTyr: 1.375 ± 0.864
0.0ValXaa: 0.0 ± 0.0
Trp
0.917TrpAla: 0.917 ± 0.675
0.0TrpCys: 0.0 ± 0.0
1.375TrpAsp: 1.375 ± 0.654
1.375TrpGlu: 1.375 ± 0.864
0.917TrpPhe: 0.917 ± 0.432
0.917TrpGly: 0.917 ± 0.477
0.0TrpHis: 0.0 ± 0.0
0.458TrpIle: 0.458 ± 0.352
0.917TrpLys: 0.917 ± 0.417
1.375TrpLeu: 1.375 ± 1.013
0.917TrpMet: 0.917 ± 0.675
0.0TrpAsn: 0.0 ± 0.0
0.458TrpPro: 0.458 ± 0.456
0.458TrpGln: 0.458 ± 0.352
1.375TrpArg: 1.375 ± 1.001
0.0TrpSer: 0.0 ± 0.0
0.917TrpThr: 0.917 ± 0.521
0.458TrpVal: 0.458 ± 0.338
0.0TrpTrp: 0.0 ± 0.0
1.375TrpTyr: 1.375 ± 0.448
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.917TyrAla: 0.917 ± 0.417
1.375TyrCys: 1.375 ± 0.752
1.375TyrAsp: 1.375 ± 0.627
1.375TyrGlu: 1.375 ± 0.347
2.291TyrPhe: 2.291 ± 0.81
1.375TyrGly: 1.375 ± 0.448
0.458TyrHis: 0.458 ± 0.338
1.375TyrIle: 1.375 ± 0.694
0.917TyrLys: 0.917 ± 0.389
3.208TyrLeu: 3.208 ± 0.877
1.375TyrMet: 1.375 ± 0.646
1.833TyrAsn: 1.833 ± 0.767
2.75TyrPro: 2.75 ± 0.429
0.917TyrGln: 0.917 ± 0.477
1.833TyrArg: 1.833 ± 0.767
2.75TyrSer: 2.75 ± 1.155
2.75TyrThr: 2.75 ± 1.689
1.833TyrVal: 1.833 ± 0.909
1.375TyrTrp: 1.375 ± 0.694
1.833TyrTyr: 1.833 ± 0.777
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2183 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski