Amino acid dipepetide frequency for Lagenorhynchus acutus papillomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.693AlaAla: 5.693 ± 1.114
2.44AlaCys: 2.44 ± 0.878
4.88AlaAsp: 4.88 ± 1.604
7.32AlaGlu: 7.32 ± 0.92
2.44AlaPhe: 2.44 ± 0.751
4.067AlaGly: 4.067 ± 1.284
1.22AlaHis: 1.22 ± 0.436
3.253AlaIle: 3.253 ± 0.995
2.847AlaLys: 2.847 ± 0.944
4.88AlaLeu: 4.88 ± 1.329
1.22AlaMet: 1.22 ± 0.622
2.033AlaAsn: 2.033 ± 0.771
2.847AlaPro: 2.847 ± 0.791
1.22AlaGln: 1.22 ± 0.349
2.44AlaArg: 2.44 ± 0.88
7.32AlaSer: 7.32 ± 2.561
4.473AlaThr: 4.473 ± 0.744
3.66AlaVal: 3.66 ± 0.632
0.0AlaTrp: 0.0 ± 0.0
2.033AlaTyr: 2.033 ± 0.506
0.0AlaXaa: 0.0 ± 0.0
Cys
1.22CysAla: 1.22 ± 0.687
0.407CysCys: 0.407 ± 0.484
0.813CysAsp: 0.813 ± 0.439
2.033CysGlu: 2.033 ± 0.664
0.813CysPhe: 0.813 ± 0.683
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.627CysIle: 1.627 ± 1.05
0.407CysLys: 0.407 ± 0.341
2.033CysLeu: 2.033 ± 1.172
0.407CysMet: 0.407 ± 0.341
1.22CysAsn: 1.22 ± 0.687
2.033CysPro: 2.033 ± 0.715
1.22CysGln: 1.22 ± 0.687
1.627CysArg: 1.627 ± 0.639
2.033CysSer: 2.033 ± 1.101
2.033CysThr: 2.033 ± 1.161
2.033CysVal: 2.033 ± 0.589
1.627CysTrp: 1.627 ± 0.599
1.22CysTyr: 1.22 ± 0.906
0.0CysXaa: 0.0 ± 0.0
Asp
4.067AspAla: 4.067 ± 0.915
2.033AspCys: 2.033 ± 0.749
4.88AspAsp: 4.88 ± 1.248
4.88AspGlu: 4.88 ± 1.786
3.253AspPhe: 3.253 ± 1.002
2.44AspGly: 2.44 ± 0.839
1.22AspHis: 1.22 ± 1.539
1.627AspIle: 1.627 ± 0.992
2.033AspLys: 2.033 ± 0.844
6.1AspLeu: 6.1 ± 1.996
2.033AspMet: 2.033 ± 0.594
1.627AspAsn: 1.627 ± 0.678
2.44AspPro: 2.44 ± 0.765
1.22AspGln: 1.22 ± 0.349
0.813AspArg: 0.813 ± 0.608
7.32AspSer: 7.32 ± 1.316
4.473AspThr: 4.473 ± 0.645
4.067AspVal: 4.067 ± 1.342
1.627AspTrp: 1.627 ± 0.968
1.22AspTyr: 1.22 ± 0.673
0.0AspXaa: 0.0 ± 0.0
Glu
4.067GluAla: 4.067 ± 2.052
0.813GluCys: 0.813 ± 0.439
4.067GluAsp: 4.067 ± 1.012
6.1GluGlu: 6.1 ± 1.125
1.22GluPhe: 1.22 ± 0.349
6.913GluGly: 6.913 ± 1.817
0.813GluHis: 0.813 ± 0.671
2.033GluIle: 2.033 ± 0.858
2.033GluLys: 2.033 ± 1.226
4.88GluLeu: 4.88 ± 1.722
1.22GluMet: 1.22 ± 0.687
2.847GluAsn: 2.847 ± 0.692
2.033GluPro: 2.033 ± 0.67
2.847GluGln: 2.847 ± 1.07
2.847GluArg: 2.847 ± 1.169
4.88GluSer: 4.88 ± 1.445
4.473GluThr: 4.473 ± 1.565
2.847GluVal: 2.847 ± 0.927
0.407GluTrp: 0.407 ± 0.383
0.813GluTyr: 0.813 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
2.847PheAla: 2.847 ± 0.744
0.813PheCys: 0.813 ± 0.569
2.033PheAsp: 2.033 ± 1.09
1.22PheGlu: 1.22 ± 0.585
1.627PhePhe: 1.627 ± 0.478
3.253PheGly: 3.253 ± 0.982
0.813PheHis: 0.813 ± 0.496
0.813PheIle: 0.813 ± 0.683
2.847PheLys: 2.847 ± 0.395
5.693PheLeu: 5.693 ± 1.47
0.0PheMet: 0.0 ± 0.316
1.627PheAsn: 1.627 ± 0.902
1.627PhePro: 1.627 ± 1.054
1.627PheGln: 1.627 ± 0.992
1.627PheArg: 1.627 ± 0.579
2.033PheSer: 2.033 ± 1.271
1.627PheThr: 1.627 ± 0.566
2.44PheVal: 2.44 ± 1.004
2.033PheTrp: 2.033 ± 0.631
1.22PheTyr: 1.22 ± 0.651
0.0PheXaa: 0.0 ± 0.0
Gly
4.473GlyAla: 4.473 ± 0.923
2.033GlyCys: 2.033 ± 0.589
6.913GlyAsp: 6.913 ± 0.841
4.88GlyGlu: 4.88 ± 1.968
2.033GlyPhe: 2.033 ± 1.09
5.287GlyGly: 5.287 ± 1.389
2.44GlyHis: 2.44 ± 0.971
2.847GlyIle: 2.847 ± 1.031
3.253GlyLys: 3.253 ± 1.054
3.66GlyLeu: 3.66 ± 0.382
0.813GlyMet: 0.813 ± 0.365
2.847GlyAsn: 2.847 ± 0.799
4.067GlyPro: 4.067 ± 1.689
2.033GlyGln: 2.033 ± 0.592
6.913GlyArg: 6.913 ± 1.989
7.727GlySer: 7.727 ± 1.762
4.473GlyThr: 4.473 ± 1.98
2.033GlyVal: 2.033 ± 0.415
0.407GlyTrp: 0.407 ± 0.336
0.407GlyTyr: 0.407 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
0.407HisAla: 0.407 ± 0.335
0.407HisCys: 0.407 ± 0.336
0.813HisAsp: 0.813 ± 0.392
0.0HisGlu: 0.0 ± 0.0
1.22HisPhe: 1.22 ± 0.488
0.813HisGly: 0.813 ± 0.365
2.847HisHis: 2.847 ± 3.073
1.627HisIle: 1.627 ± 1.097
1.627HisLys: 1.627 ± 0.588
4.067HisLeu: 4.067 ± 2.477
0.0HisMet: 0.0 ± 0.0
1.22HisAsn: 1.22 ± 0.655
4.067HisPro: 4.067 ± 2.723
0.813HisGln: 0.813 ± 0.608
2.847HisArg: 2.847 ± 1.454
2.033HisSer: 2.033 ± 0.44
1.627HisThr: 1.627 ± 0.626
1.22HisVal: 1.22 ± 0.701
0.407HisTrp: 0.407 ± 0.336
0.407HisTyr: 0.407 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
1.627IleAla: 1.627 ± 0.678
0.813IleCys: 0.813 ± 0.617
5.693IleAsp: 5.693 ± 1.66
2.44IleGlu: 2.44 ± 0.455
1.627IlePhe: 1.627 ± 1.012
2.847IleGly: 2.847 ± 1.156
0.813IleHis: 0.813 ± 0.392
1.22IleIle: 1.22 ± 0.684
0.407IleLys: 0.407 ± 0.341
3.253IleLeu: 3.253 ± 0.62
0.813IleMet: 0.813 ± 0.425
0.813IleAsn: 0.813 ± 0.671
4.473IlePro: 4.473 ± 1.327
1.627IleGln: 1.627 ± 0.74
1.627IleArg: 1.627 ± 0.478
2.44IleSer: 2.44 ± 0.845
2.033IleThr: 2.033 ± 0.415
1.22IleVal: 1.22 ± 0.701
0.0IleTrp: 0.0 ± 0.0
1.22IleTyr: 1.22 ± 0.726
0.0IleXaa: 0.0 ± 0.0
Lys
3.66LysAla: 3.66 ± 0.598
2.033LysCys: 2.033 ± 0.893
2.44LysAsp: 2.44 ± 0.729
2.847LysGlu: 2.847 ± 1.407
1.22LysPhe: 1.22 ± 0.622
0.813LysGly: 0.813 ± 0.392
1.627LysHis: 1.627 ± 0.935
0.813LysIle: 0.813 ± 0.392
2.033LysLys: 2.033 ± 1.476
2.033LysLeu: 2.033 ± 0.664
0.813LysMet: 0.813 ± 0.365
2.44LysAsn: 2.44 ± 0.979
2.033LysPro: 2.033 ± 0.996
1.627LysGln: 1.627 ± 0.731
5.693LysArg: 5.693 ± 1.05
3.66LysSer: 3.66 ± 1.324
2.033LysThr: 2.033 ± 0.506
3.66LysVal: 3.66 ± 1.211
1.627LysTrp: 1.627 ± 0.653
2.033LysTyr: 2.033 ± 0.873
0.0LysXaa: 0.0 ± 0.0
Leu
4.473LeuAla: 4.473 ± 1.039
1.627LeuCys: 1.627 ± 1.502
5.287LeuAsp: 5.287 ± 1.215
4.473LeuGlu: 4.473 ± 1.36
2.847LeuPhe: 2.847 ± 0.604
6.913LeuGly: 6.913 ± 1.007
2.033LeuHis: 2.033 ± 0.696
3.66LeuIle: 3.66 ± 1.29
5.287LeuLys: 5.287 ± 0.968
9.76LeuLeu: 9.76 ± 2.05
0.407LeuMet: 0.407 ± 0.28
2.44LeuAsn: 2.44 ± 0.633
4.88LeuPro: 4.88 ± 1.565
6.507LeuGln: 6.507 ± 1.377
6.507LeuArg: 6.507 ± 1.089
10.167LeuSer: 10.167 ± 0.937
4.473LeuThr: 4.473 ± 1.109
4.473LeuVal: 4.473 ± 1.155
1.627LeuTrp: 1.627 ± 0.987
6.507LeuTyr: 6.507 ± 1.262
0.0LeuXaa: 0.0 ± 0.0
Met
0.813MetAla: 0.813 ± 0.569
0.407MetCys: 0.407 ± 0.336
1.22MetAsp: 1.22 ± 0.622
0.813MetGlu: 0.813 ± 0.766
1.627MetPhe: 1.627 ± 0.254
0.407MetGly: 0.407 ± 0.341
0.407MetHis: 0.407 ± 0.383
1.22MetIle: 1.22 ± 0.614
0.813MetLys: 0.813 ± 0.766
2.44MetLeu: 2.44 ± 0.472
0.0MetMet: 0.0 ± 0.0
0.813MetAsn: 0.813 ± 0.365
0.813MetPro: 0.813 ± 0.683
0.813MetGln: 0.813 ± 0.365
0.813MetArg: 0.813 ± 0.392
2.44MetSer: 2.44 ± 1.165
0.0MetThr: 0.0 ± 0.0
1.627MetVal: 1.627 ± 0.588
0.0MetTrp: 0.0 ± 0.0
0.407MetTyr: 0.407 ± 0.335
0.0MetXaa: 0.0 ± 0.0
Asn
2.847AsnAla: 2.847 ± 0.997
2.033AsnCys: 2.033 ± 0.951
0.813AsnAsp: 0.813 ± 0.683
2.033AsnGlu: 2.033 ± 0.548
1.627AsnPhe: 1.627 ± 0.705
2.44AsnGly: 2.44 ± 0.729
0.407AsnHis: 0.407 ± 0.383
1.627AsnIle: 1.627 ± 0.658
2.033AsnLys: 2.033 ± 0.667
1.22AsnLeu: 1.22 ± 0.585
0.407AsnMet: 0.407 ± 0.341
2.44AsnAsn: 2.44 ± 1.256
4.067AsnPro: 4.067 ± 1.315
1.22AsnGln: 1.22 ± 0.593
2.033AsnArg: 2.033 ± 0.995
2.847AsnSer: 2.847 ± 0.852
1.627AsnThr: 1.627 ± 0.731
4.88AsnVal: 4.88 ± 1.147
0.813AsnTrp: 0.813 ± 0.496
0.813AsnTyr: 0.813 ± 0.673
0.0AsnXaa: 0.0 ± 0.0
Pro
6.507ProAla: 6.507 ± 2.159
1.22ProCys: 1.22 ± 0.568
1.627ProAsp: 1.627 ± 0.992
1.627ProGlu: 1.627 ± 0.579
2.847ProPhe: 2.847 ± 0.831
5.287ProGly: 5.287 ± 1.582
3.253ProHis: 3.253 ± 2.619
2.033ProIle: 2.033 ± 0.999
4.067ProLys: 4.067 ± 1.604
8.54ProLeu: 8.54 ± 1.206
1.22ProMet: 1.22 ± 0.655
3.253ProAsn: 3.253 ± 0.959
8.947ProPro: 8.947 ± 5.258
2.847ProGln: 2.847 ± 1.571
3.253ProArg: 3.253 ± 1.131
5.287ProSer: 5.287 ± 1.929
2.44ProThr: 2.44 ± 1.303
5.287ProVal: 5.287 ± 2.71
0.407ProTrp: 0.407 ± 0.513
0.813ProTyr: 0.813 ± 0.608
0.0ProXaa: 0.0 ± 0.0
Gln
2.033GlnAla: 2.033 ± 0.592
0.407GlnCys: 0.407 ± 0.341
1.627GlnAsp: 1.627 ± 1.518
1.627GlnGlu: 1.627 ± 0.756
1.627GlnPhe: 1.627 ± 0.518
1.22GlnGly: 1.22 ± 0.622
2.44GlnHis: 2.44 ± 1.65
2.033GlnIle: 2.033 ± 0.962
0.407GlnLys: 0.407 ± 0.484
4.067GlnLeu: 4.067 ± 0.83
1.22GlnMet: 1.22 ± 0.935
1.627GlnAsn: 1.627 ± 0.561
2.847GlnPro: 2.847 ± 0.762
2.44GlnGln: 2.44 ± 1.261
2.44GlnArg: 2.44 ± 0.491
1.22GlnSer: 1.22 ± 1.006
1.627GlnThr: 1.627 ± 0.653
2.44GlnVal: 2.44 ± 1.099
1.22GlnTrp: 1.22 ± 1.024
0.813GlnTyr: 0.813 ± 0.673
0.0GlnXaa: 0.0 ± 0.0
Arg
4.473ArgAla: 4.473 ± 1.584
2.033ArgCys: 2.033 ± 0.926
1.627ArgAsp: 1.627 ± 1.05
1.627ArgGlu: 1.627 ± 0.561
1.22ArgPhe: 1.22 ± 0.651
5.287ArgGly: 5.287 ± 1.116
3.253ArgHis: 3.253 ± 1.174
2.44ArgIle: 2.44 ± 1.032
4.473ArgLys: 4.473 ± 1.734
7.32ArgLeu: 7.32 ± 0.919
0.813ArgMet: 0.813 ± 0.416
2.033ArgAsn: 2.033 ± 0.873
6.507ArgPro: 6.507 ± 1.86
1.627ArgGln: 1.627 ± 0.678
8.133ArgArg: 8.133 ± 3.078
1.627ArgSer: 1.627 ± 0.675
2.44ArgThr: 2.44 ± 0.976
3.253ArgVal: 3.253 ± 1.076
0.0ArgTrp: 0.0 ± 0.0
1.627ArgTyr: 1.627 ± 0.841
0.0ArgXaa: 0.0 ± 0.0
Ser
6.913SerAla: 6.913 ± 1.464
0.407SerCys: 0.407 ± 0.336
5.693SerAsp: 5.693 ± 1.761
5.693SerGlu: 5.693 ± 1.902
3.253SerPhe: 3.253 ± 0.803
7.32SerGly: 7.32 ± 1.272
2.44SerHis: 2.44 ± 0.948
2.847SerIle: 2.847 ± 1.218
3.253SerLys: 3.253 ± 1.182
8.54SerLeu: 8.54 ± 1.766
2.847SerMet: 2.847 ± 0.925
2.44SerAsn: 2.44 ± 0.538
6.507SerPro: 6.507 ± 0.758
0.813SerGln: 0.813 ± 0.605
4.473SerArg: 4.473 ± 0.717
9.353SerSer: 9.353 ± 1.88
6.913SerThr: 6.913 ± 1.55
4.473SerVal: 4.473 ± 1.217
0.813SerTrp: 0.813 ± 0.617
1.22SerTyr: 1.22 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
2.033ThrAla: 2.033 ± 0.631
1.22ThrCys: 1.22 ± 0.655
4.067ThrAsp: 4.067 ± 1.129
2.847ThrGlu: 2.847 ± 1.353
1.627ThrPhe: 1.627 ± 0.653
5.693ThrGly: 5.693 ± 1.098
0.407ThrHis: 0.407 ± 0.341
1.627ThrIle: 1.627 ± 0.992
2.033ThrLys: 2.033 ± 0.769
4.473ThrLeu: 4.473 ± 1.548
2.033ThrMet: 2.033 ± 1.001
1.627ThrAsn: 1.627 ± 0.427
5.287ThrPro: 5.287 ± 2.097
1.627ThrGln: 1.627 ± 0.935
4.067ThrArg: 4.067 ± 1.102
4.88ThrSer: 4.88 ± 1.25
4.88ThrThr: 4.88 ± 0.798
6.1ThrVal: 6.1 ± 1.86
0.813ThrTrp: 0.813 ± 0.439
1.627ThrTyr: 1.627 ± 0.951
0.0ThrXaa: 0.0 ± 0.0
Val
4.473ValAla: 4.473 ± 0.714
2.44ValCys: 2.44 ± 0.971
2.033ValAsp: 2.033 ± 0.589
2.847ValGlu: 2.847 ± 0.944
2.847ValPhe: 2.847 ± 1.087
5.693ValGly: 5.693 ± 1.086
1.22ValHis: 1.22 ± 0.816
2.033ValIle: 2.033 ± 0.893
1.627ValLys: 1.627 ± 0.254
6.913ValLeu: 6.913 ± 0.937
0.813ValMet: 0.813 ± 0.683
2.847ValAsn: 2.847 ± 1.182
4.88ValPro: 4.88 ± 1.475
2.847ValGln: 2.847 ± 1.188
2.033ValArg: 2.033 ± 1.396
6.1ValSer: 6.1 ± 1.799
4.88ValThr: 4.88 ± 1.96
4.473ValVal: 4.473 ± 1.327
1.22ValTrp: 1.22 ± 0.489
2.847ValTyr: 2.847 ± 1.167
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.655
0.0TrpCys: 0.0 ± 0.0
1.22TrpAsp: 1.22 ± 0.574
0.0TrpGlu: 0.0 ± 0.0
1.22TrpPhe: 1.22 ± 0.436
0.813TrpGly: 0.813 ± 0.617
0.407TrpHis: 0.407 ± 0.335
0.813TrpIle: 0.813 ± 0.683
1.627TrpLys: 1.627 ± 1.366
1.22TrpLeu: 1.22 ± 0.614
0.0TrpMet: 0.0 ± 0.0
0.813TrpAsn: 0.813 ± 0.673
0.0TrpPro: 0.0 ± 0.0
0.407TrpGln: 0.407 ± 0.484
0.407TrpArg: 0.407 ± 0.484
0.813TrpSer: 0.813 ± 0.416
2.033TrpThr: 2.033 ± 0.769
1.627TrpVal: 1.627 ± 0.518
0.0TrpTrp: 0.0 ± 0.0
0.407TrpTyr: 0.407 ± 0.341
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.44TyrAla: 2.44 ± 0.472
1.22TyrCys: 1.22 ± 0.75
1.627TyrAsp: 1.627 ± 0.696
2.847TyrGlu: 2.847 ± 1.032
1.627TyrPhe: 1.627 ± 0.596
2.033TyrGly: 2.033 ± 0.589
0.407TyrHis: 0.407 ± 0.336
0.813TyrIle: 0.813 ± 0.671
2.033TyrLys: 2.033 ± 0.844
3.253TyrLeu: 3.253 ± 1.037
0.407TyrMet: 0.407 ± 0.336
1.22TyrAsn: 1.22 ± 0.349
0.407TyrPro: 0.407 ± 0.383
0.407TyrGln: 0.407 ± 0.336
1.22TyrArg: 1.22 ± 0.838
2.033TyrSer: 2.033 ± 1.266
0.407TyrThr: 0.407 ± 0.484
3.253TyrVal: 3.253 ± 0.956
0.0TyrTrp: 0.0 ± 0.0
2.033TyrTyr: 2.033 ± 0.74
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2460 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski