Amino acid dipepetide frequency for Achimota virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.701AlaAla: 4.701 ± 2.003
0.588AlaCys: 0.588 ± 0.58
1.959AlaAsp: 1.959 ± 0.529
2.938AlaGlu: 2.938 ± 0.499
1.959AlaPhe: 1.959 ± 0.891
3.526AlaGly: 3.526 ± 0.561
0.784AlaHis: 0.784 ± 0.437
3.526AlaIle: 3.526 ± 0.627
4.114AlaLys: 4.114 ± 0.831
5.681AlaLeu: 5.681 ± 1.182
1.567AlaMet: 1.567 ± 0.944
2.742AlaAsn: 2.742 ± 0.561
3.918AlaPro: 3.918 ± 1.0
2.742AlaGln: 2.742 ± 1.154
4.505AlaArg: 4.505 ± 1.967
5.485AlaSer: 5.485 ± 1.078
3.722AlaThr: 3.722 ± 0.374
3.134AlaVal: 3.134 ± 0.4
0.979AlaTrp: 0.979 ± 0.428
2.351AlaTyr: 2.351 ± 0.791
0.0AlaXaa: 0.0 ± 0.0
Cys
0.979CysAla: 0.979 ± 0.304
0.196CysCys: 0.196 ± 0.117
0.979CysAsp: 0.979 ± 0.433
1.175CysGlu: 1.175 ± 0.413
0.784CysPhe: 0.784 ± 0.293
0.979CysGly: 0.979 ± 0.304
0.392CysHis: 0.392 ± 0.497
0.784CysIle: 0.784 ± 0.314
0.979CysLys: 0.979 ± 0.232
1.567CysLeu: 1.567 ± 0.739
0.392CysMet: 0.392 ± 0.179
1.763CysAsn: 1.763 ± 1.036
0.979CysPro: 0.979 ± 0.564
0.392CysGln: 0.392 ± 0.233
0.784CysArg: 0.784 ± 0.455
0.979CysSer: 0.979 ± 0.42
1.175CysThr: 1.175 ± 0.421
1.959CysVal: 1.959 ± 0.333
0.0CysTrp: 0.0 ± 0.0
1.567CysTyr: 1.567 ± 0.655
0.0CysXaa: 0.0 ± 0.0
Asp
3.33AspAla: 3.33 ± 0.407
0.979AspCys: 0.979 ± 0.582
4.31AspAsp: 4.31 ± 1.02
5.093AspGlu: 5.093 ± 0.544
1.567AspPhe: 1.567 ± 0.307
0.979AspGly: 0.979 ± 0.457
2.155AspHis: 2.155 ± 0.512
1.959AspIle: 1.959 ± 0.436
2.155AspLys: 2.155 ± 0.636
6.072AspLeu: 6.072 ± 0.851
0.784AspMet: 0.784 ± 0.234
2.351AspAsn: 2.351 ± 0.485
3.33AspPro: 3.33 ± 1.243
3.526AspGln: 3.526 ± 0.917
2.547AspArg: 2.547 ± 0.679
2.742AspSer: 2.742 ± 0.783
2.938AspThr: 2.938 ± 0.724
1.763AspVal: 1.763 ± 0.489
0.979AspTrp: 0.979 ± 0.261
1.567AspTyr: 1.567 ± 0.541
0.0AspXaa: 0.0 ± 0.0
Glu
1.763GluAla: 1.763 ± 0.335
0.784GluCys: 0.784 ± 0.324
2.547GluAsp: 2.547 ± 0.671
3.918GluGlu: 3.918 ± 1.054
2.547GluPhe: 2.547 ± 0.647
2.547GluGly: 2.547 ± 0.53
0.979GluHis: 0.979 ± 0.547
4.701GluIle: 4.701 ± 0.928
3.134GluLys: 3.134 ± 0.844
5.093GluLeu: 5.093 ± 1.181
2.155GluMet: 2.155 ± 1.318
3.33GluAsn: 3.33 ± 0.947
1.371GluPro: 1.371 ± 0.432
2.742GluGln: 2.742 ± 0.594
2.547GluArg: 2.547 ± 0.815
6.072GluSer: 6.072 ± 1.173
2.938GluThr: 2.938 ± 0.762
3.722GluVal: 3.722 ± 0.829
0.588GluTrp: 0.588 ± 0.275
0.979GluTyr: 0.979 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
1.567PheAla: 1.567 ± 0.505
0.392PheCys: 0.392 ± 0.179
0.979PheAsp: 0.979 ± 0.329
1.567PheGlu: 1.567 ± 0.38
1.959PhePhe: 1.959 ± 0.45
1.371PheGly: 1.371 ± 0.295
0.392PheHis: 0.392 ± 0.497
2.938PheIle: 2.938 ± 0.747
2.155PheLys: 2.155 ± 0.743
3.722PheLeu: 3.722 ± 0.738
0.784PheMet: 0.784 ± 0.258
1.763PheAsn: 1.763 ± 0.495
1.959PhePro: 1.959 ± 0.517
2.351PheGln: 2.351 ± 1.002
1.763PheArg: 1.763 ± 0.519
3.33PheSer: 3.33 ± 0.728
2.547PheThr: 2.547 ± 0.564
3.722PheVal: 3.722 ± 0.401
0.196PheTrp: 0.196 ± 0.211
1.175PheTyr: 1.175 ± 0.412
0.0PheXaa: 0.0 ± 0.0
Gly
2.938GlyAla: 2.938 ± 1.204
0.784GlyCys: 0.784 ± 0.594
3.918GlyAsp: 3.918 ± 0.842
1.959GlyGlu: 1.959 ± 0.799
1.763GlyPhe: 1.763 ± 0.541
2.547GlyGly: 2.547 ± 0.777
0.392GlyHis: 0.392 ± 0.233
3.918GlyIle: 3.918 ± 1.267
1.959GlyLys: 1.959 ± 0.545
6.464GlyLeu: 6.464 ± 1.42
0.979GlyMet: 0.979 ± 0.452
2.938GlyAsn: 2.938 ± 0.232
2.351GlyPro: 2.351 ± 1.036
2.155GlyGln: 2.155 ± 0.651
3.134GlyArg: 3.134 ± 1.064
3.526GlySer: 3.526 ± 1.097
1.175GlyThr: 1.175 ± 0.604
5.093GlyVal: 5.093 ± 1.273
0.0GlyTrp: 0.0 ± 0.0
2.155GlyTyr: 2.155 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
0.784HisAla: 0.784 ± 0.286
0.392HisCys: 0.392 ± 0.233
0.784HisAsp: 0.784 ± 0.467
0.392HisGlu: 0.392 ± 0.233
0.784HisPhe: 0.784 ± 0.419
1.175HisGly: 1.175 ± 0.625
0.392HisHis: 0.392 ± 0.233
0.979HisIle: 0.979 ± 0.272
0.588HisLys: 0.588 ± 0.24
2.742HisLeu: 2.742 ± 1.243
0.392HisMet: 0.392 ± 0.283
0.784HisAsn: 0.784 ± 0.214
1.567HisPro: 1.567 ± 0.409
1.567HisGln: 1.567 ± 0.632
0.784HisArg: 0.784 ± 0.505
1.567HisSer: 1.567 ± 0.354
1.567HisThr: 1.567 ± 0.434
0.784HisVal: 0.784 ± 0.402
0.196HisTrp: 0.196 ± 0.117
0.392HisTyr: 0.392 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
4.897IleAla: 4.897 ± 0.977
1.763IleCys: 1.763 ± 0.503
3.918IleAsp: 3.918 ± 0.671
6.072IleGlu: 6.072 ± 1.004
2.155IlePhe: 2.155 ± 0.858
2.155IleGly: 2.155 ± 0.464
1.371IleHis: 1.371 ± 0.641
4.897IleIle: 4.897 ± 0.982
3.722IleLys: 3.722 ± 1.393
7.052IleLeu: 7.052 ± 0.966
2.547IleMet: 2.547 ± 0.753
3.526IleAsn: 3.526 ± 0.996
2.547IlePro: 2.547 ± 0.674
4.31IleGln: 4.31 ± 1.016
3.722IleArg: 3.722 ± 0.375
7.052IleSer: 7.052 ± 2.395
6.464IleThr: 6.464 ± 1.306
3.722IleVal: 3.722 ± 0.837
0.588IleTrp: 0.588 ± 0.35
2.155IleTyr: 2.155 ± 0.753
0.0IleXaa: 0.0 ± 0.0
Lys
3.33LysAla: 3.33 ± 0.882
1.175LysCys: 1.175 ± 0.457
3.134LysAsp: 3.134 ± 0.633
2.155LysGlu: 2.155 ± 0.709
2.547LysPhe: 2.547 ± 0.386
2.742LysGly: 2.742 ± 0.749
1.371LysHis: 1.371 ± 0.435
3.526LysIle: 3.526 ± 0.701
2.742LysLys: 2.742 ± 0.694
6.268LysLeu: 6.268 ± 1.036
1.175LysMet: 1.175 ± 0.413
2.547LysAsn: 2.547 ± 0.502
2.547LysPro: 2.547 ± 0.915
1.175LysGln: 1.175 ± 0.421
2.938LysArg: 2.938 ± 0.555
5.877LysSer: 5.877 ± 2.571
1.959LysThr: 1.959 ± 0.7
3.33LysVal: 3.33 ± 0.695
0.588LysTrp: 0.588 ± 0.454
1.763LysTyr: 1.763 ± 1.081
0.0LysXaa: 0.0 ± 0.0
Leu
6.464LeuAla: 6.464 ± 0.661
1.959LeuCys: 1.959 ± 0.394
5.681LeuAsp: 5.681 ± 1.905
4.701LeuGlu: 4.701 ± 0.822
3.134LeuPhe: 3.134 ± 1.005
4.701LeuGly: 4.701 ± 0.747
1.567LeuHis: 1.567 ± 0.545
8.815LeuIle: 8.815 ± 1.028
4.701LeuLys: 4.701 ± 1.066
8.227LeuLeu: 8.227 ± 1.465
3.134LeuMet: 3.134 ± 0.594
7.444LeuAsn: 7.444 ± 0.975
4.505LeuPro: 4.505 ± 1.014
4.31LeuGln: 4.31 ± 0.426
4.897LeuArg: 4.897 ± 0.737
9.207LeuSer: 9.207 ± 2.744
8.031LeuThr: 8.031 ± 1.325
4.897LeuVal: 4.897 ± 0.741
0.979LeuTrp: 0.979 ± 0.411
4.114LeuTyr: 4.114 ± 1.064
0.0LeuXaa: 0.0 ± 0.0
Met
2.351MetAla: 2.351 ± 0.809
0.588MetCys: 0.588 ± 0.275
1.763MetAsp: 1.763 ± 0.861
0.979MetGlu: 0.979 ± 0.703
0.0MetPhe: 0.0 ± 0.0
1.567MetGly: 1.567 ± 0.789
0.588MetHis: 0.588 ± 0.455
1.959MetIle: 1.959 ± 0.447
1.567MetLys: 1.567 ± 0.354
2.351MetLeu: 2.351 ± 0.484
0.979MetMet: 0.979 ± 0.753
0.784MetAsn: 0.784 ± 0.286
0.196MetPro: 0.196 ± 0.117
2.155MetGln: 2.155 ± 0.632
2.547MetArg: 2.547 ± 0.839
2.155MetSer: 2.155 ± 0.65
1.763MetThr: 1.763 ± 0.621
1.763MetVal: 1.763 ± 0.532
0.979MetTrp: 0.979 ± 0.287
0.392MetTyr: 0.392 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
1.959AsnAla: 1.959 ± 0.842
0.979AsnCys: 0.979 ± 0.547
3.33AsnAsp: 3.33 ± 0.363
2.155AsnGlu: 2.155 ± 0.535
1.567AsnPhe: 1.567 ± 0.335
2.742AsnGly: 2.742 ± 0.826
1.567AsnHis: 1.567 ± 0.378
4.114AsnIle: 4.114 ± 0.764
2.938AsnLys: 2.938 ± 0.885
5.093AsnLeu: 5.093 ± 1.287
0.784AsnMet: 0.784 ± 0.299
2.547AsnAsn: 2.547 ± 0.793
3.918AsnPro: 3.918 ± 0.591
2.547AsnGln: 2.547 ± 0.365
3.33AsnArg: 3.33 ± 0.6
3.33AsnSer: 3.33 ± 0.679
3.33AsnThr: 3.33 ± 1.027
2.351AsnVal: 2.351 ± 0.682
1.175AsnTrp: 1.175 ± 0.514
1.371AsnTyr: 1.371 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
3.33ProAla: 3.33 ± 0.955
0.392ProCys: 0.392 ± 0.218
3.33ProAsp: 3.33 ± 0.493
2.351ProGlu: 2.351 ± 0.537
2.938ProPhe: 2.938 ± 0.57
4.505ProGly: 4.505 ± 0.937
0.196ProHis: 0.196 ± 0.195
3.526ProIle: 3.526 ± 0.772
2.938ProLys: 2.938 ± 0.447
5.093ProLeu: 5.093 ± 0.42
1.763ProMet: 1.763 ± 0.223
1.959ProAsn: 1.959 ± 0.258
5.289ProPro: 5.289 ± 1.778
1.763ProGln: 1.763 ± 0.615
1.959ProArg: 1.959 ± 0.714
6.464ProSer: 6.464 ± 1.964
2.155ProThr: 2.155 ± 0.362
2.351ProVal: 2.351 ± 0.702
0.392ProTrp: 0.392 ± 0.422
2.155ProTyr: 2.155 ± 0.58
0.0ProXaa: 0.0 ± 0.0
Gln
4.897GlnAla: 4.897 ± 1.494
1.175GlnCys: 1.175 ± 0.384
1.567GlnAsp: 1.567 ± 1.296
1.567GlnGlu: 1.567 ± 0.375
2.938GlnPhe: 2.938 ± 0.574
2.938GlnGly: 2.938 ± 1.029
0.784GlnHis: 0.784 ± 0.419
4.31GlnIle: 4.31 ± 0.975
4.701GlnLys: 4.701 ± 1.409
4.701GlnLeu: 4.701 ± 0.86
2.155GlnMet: 2.155 ± 0.516
2.742GlnAsn: 2.742 ± 0.583
3.526GlnPro: 3.526 ± 1.871
2.547GlnGln: 2.547 ± 1.395
1.371GlnArg: 1.371 ± 0.333
2.742GlnSer: 2.742 ± 1.451
1.371GlnThr: 1.371 ± 0.715
2.547GlnVal: 2.547 ± 1.006
0.196GlnTrp: 0.196 ± 0.117
0.784GlnTyr: 0.784 ± 0.358
0.0GlnXaa: 0.0 ± 0.0
Arg
2.351ArgAla: 2.351 ± 0.533
0.588ArgCys: 0.588 ± 0.35
0.784ArgAsp: 0.784 ± 0.314
3.722ArgGlu: 3.722 ± 0.729
1.763ArgPhe: 1.763 ± 0.341
3.526ArgGly: 3.526 ± 0.725
0.979ArgHis: 0.979 ± 0.244
3.526ArgIle: 3.526 ± 0.73
2.742ArgLys: 2.742 ± 0.494
7.248ArgLeu: 7.248 ± 1.044
0.979ArgMet: 0.979 ± 0.378
1.959ArgAsn: 1.959 ± 0.429
3.33ArgPro: 3.33 ± 1.135
3.134ArgGln: 3.134 ± 1.578
3.722ArgArg: 3.722 ± 0.994
2.742ArgSer: 2.742 ± 0.663
1.959ArgThr: 1.959 ± 0.516
3.526ArgVal: 3.526 ± 0.904
0.784ArgTrp: 0.784 ± 0.48
1.567ArgTyr: 1.567 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
5.289SerAla: 5.289 ± 1.098
1.763SerCys: 1.763 ± 1.051
5.093SerAsp: 5.093 ± 0.678
6.072SerGlu: 6.072 ± 1.083
2.351SerPhe: 2.351 ± 0.377
4.505SerGly: 4.505 ± 1.198
2.155SerHis: 2.155 ± 0.509
6.268SerIle: 6.268 ± 0.996
5.093SerLys: 5.093 ± 0.938
8.031SerLeu: 8.031 ± 1.034
2.742SerMet: 2.742 ± 0.997
2.938SerAsn: 2.938 ± 0.696
3.918SerPro: 3.918 ± 0.848
4.31SerGln: 4.31 ± 0.885
2.742SerArg: 2.742 ± 0.822
7.835SerSer: 7.835 ± 1.596
5.289SerThr: 5.289 ± 1.001
6.268SerVal: 6.268 ± 0.93
0.392SerTrp: 0.392 ± 0.233
1.763SerTyr: 1.763 ± 0.369
0.0SerXaa: 0.0 ± 0.0
Thr
4.701ThrAla: 4.701 ± 1.102
0.979ThrCys: 0.979 ± 0.434
3.134ThrAsp: 3.134 ± 0.738
3.722ThrGlu: 3.722 ± 0.451
1.763ThrPhe: 1.763 ± 0.453
2.938ThrGly: 2.938 ± 0.625
0.979ThrHis: 0.979 ± 0.423
4.701ThrIle: 4.701 ± 0.842
2.351ThrLys: 2.351 ± 0.67
6.464ThrLeu: 6.464 ± 1.287
0.979ThrMet: 0.979 ± 0.272
2.742ThrAsn: 2.742 ± 0.83
3.918ThrPro: 3.918 ± 0.415
2.742ThrGln: 2.742 ± 0.782
3.526ThrArg: 3.526 ± 0.587
4.897ThrSer: 4.897 ± 0.938
3.134ThrThr: 3.134 ± 0.777
3.134ThrVal: 3.134 ± 0.964
0.588ThrTrp: 0.588 ± 0.35
1.567ThrTyr: 1.567 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
3.134ValAla: 3.134 ± 0.696
1.959ValCys: 1.959 ± 0.711
2.351ValAsp: 2.351 ± 0.936
2.351ValGlu: 2.351 ± 1.241
2.351ValPhe: 2.351 ± 0.561
2.742ValGly: 2.742 ± 0.59
0.979ValHis: 0.979 ± 0.264
5.681ValIle: 5.681 ± 1.312
2.351ValLys: 2.351 ± 1.172
4.897ValLeu: 4.897 ± 0.664
2.351ValMet: 2.351 ± 0.365
3.918ValAsn: 3.918 ± 0.842
3.33ValPro: 3.33 ± 0.812
2.155ValGln: 2.155 ± 0.715
2.547ValArg: 2.547 ± 0.568
3.918ValSer: 3.918 ± 0.855
5.877ValThr: 5.877 ± 1.304
3.722ValVal: 3.722 ± 0.972
0.588ValTrp: 0.588 ± 0.288
2.938ValTyr: 2.938 ± 0.611
0.0ValXaa: 0.0 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.249
0.392TrpCys: 0.392 ± 0.271
0.392TrpAsp: 0.392 ± 0.233
0.196TrpGlu: 0.196 ± 0.117
0.588TrpPhe: 0.588 ± 0.247
0.588TrpGly: 0.588 ± 0.247
0.0TrpHis: 0.0 ± 0.0
1.763TrpIle: 1.763 ± 0.354
0.784TrpLys: 0.784 ± 0.358
0.392TrpLeu: 0.392 ± 0.233
0.0TrpMet: 0.0 ± 0.0
0.392TrpAsn: 0.392 ± 0.311
0.784TrpPro: 0.784 ± 0.214
0.392TrpGln: 0.392 ± 0.179
0.392TrpArg: 0.392 ± 0.219
1.567TrpSer: 1.567 ± 0.354
0.588TrpThr: 0.588 ± 0.275
0.196TrpVal: 0.196 ± 0.205
0.0TrpTrp: 0.0 ± 0.0
0.196TrpTyr: 0.196 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.567TyrAla: 1.567 ± 0.332
0.979TyrCys: 0.979 ± 0.472
1.175TyrAsp: 1.175 ± 0.348
1.567TyrGlu: 1.567 ± 0.632
1.175TyrPhe: 1.175 ± 0.409
1.371TyrGly: 1.371 ± 0.316
0.588TyrHis: 0.588 ± 0.247
2.742TyrIle: 2.742 ± 0.591
1.175TyrLys: 1.175 ± 0.414
4.31TyrLeu: 4.31 ± 1.314
0.588TyrMet: 0.588 ± 0.222
1.959TyrAsn: 1.959 ± 0.519
1.567TyrPro: 1.567 ± 0.536
2.547TyrGln: 2.547 ± 0.743
1.175TyrArg: 1.175 ± 0.344
3.33TyrSer: 3.33 ± 0.648
0.979TyrThr: 0.979 ± 0.419
2.155TyrVal: 2.155 ± 0.465
0.0TyrTrp: 0.0 ± 0.0
1.371TyrTyr: 1.371 ± 0.446
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski