Amino acid dipepetide frequency for Methanobacterium virus Drs3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.161AlaAla: 5.161 ± 1.558
0.287AlaCys: 0.287 ± 0.165
3.058AlaAsp: 3.058 ± 0.482
4.588AlaGlu: 4.588 ± 0.773
2.867AlaPhe: 2.867 ± 0.404
4.97AlaGly: 4.97 ± 0.737
0.956AlaHis: 0.956 ± 0.236
4.492AlaIle: 4.492 ± 0.797
3.441AlaLys: 3.441 ± 0.598
6.021AlaLeu: 6.021 ± 0.918
2.103AlaMet: 2.103 ± 0.583
2.963AlaAsn: 2.963 ± 0.462
2.485AlaPro: 2.485 ± 0.521
1.434AlaGln: 1.434 ± 0.323
3.441AlaArg: 3.441 ± 0.585
3.536AlaSer: 3.536 ± 0.802
3.441AlaThr: 3.441 ± 0.781
4.205AlaVal: 4.205 ± 1.232
0.86AlaTrp: 0.86 ± 0.463
2.581AlaTyr: 2.581 ± 0.611
0.0AlaXaa: 0.0 ± 0.0
Cys
0.287CysAla: 0.287 ± 0.214
0.191CysCys: 0.191 ± 0.136
0.86CysAsp: 0.86 ± 0.296
0.86CysGlu: 0.86 ± 0.227
0.0CysPhe: 0.0 ± 0.0
0.573CysGly: 0.573 ± 0.235
0.191CysHis: 0.191 ± 0.132
0.573CysIle: 0.573 ± 0.207
0.478CysLys: 0.478 ± 0.182
0.765CysLeu: 0.765 ± 0.356
0.287CysMet: 0.287 ± 0.148
0.191CysAsn: 0.191 ± 0.126
0.382CysPro: 0.382 ± 0.195
0.287CysGln: 0.287 ± 0.168
0.191CysArg: 0.191 ± 0.125
0.478CysSer: 0.478 ± 0.178
1.242CysThr: 1.242 ± 0.337
0.669CysVal: 0.669 ± 0.186
0.287CysTrp: 0.287 ± 0.143
0.191CysTyr: 0.191 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
3.536AspAla: 3.536 ± 0.452
0.765AspCys: 0.765 ± 0.21
3.345AspAsp: 3.345 ± 0.609
4.396AspGlu: 4.396 ± 0.611
2.485AspPhe: 2.485 ± 0.429
5.639AspGly: 5.639 ± 0.842
0.573AspHis: 0.573 ± 0.26
4.014AspIle: 4.014 ± 0.667
4.11AspLys: 4.11 ± 0.717
5.352AspLeu: 5.352 ± 0.594
1.911AspMet: 1.911 ± 0.466
2.485AspAsn: 2.485 ± 0.537
2.772AspPro: 2.772 ± 0.784
1.147AspGln: 1.147 ± 0.318
2.389AspArg: 2.389 ± 0.37
3.154AspSer: 3.154 ± 0.511
3.441AspThr: 3.441 ± 0.462
2.198AspVal: 2.198 ± 0.452
1.051AspTrp: 1.051 ± 0.238
2.772AspTyr: 2.772 ± 0.674
0.0AspXaa: 0.0 ± 0.0
Glu
5.352GluAla: 5.352 ± 0.739
1.051GluCys: 1.051 ± 0.347
4.396GluAsp: 4.396 ± 0.792
6.404GluGlu: 6.404 ± 1.288
3.727GluPhe: 3.727 ± 0.671
5.734GluGly: 5.734 ± 0.866
0.86GluHis: 0.86 ± 0.202
6.499GluIle: 6.499 ± 1.025
6.499GluLys: 6.499 ± 1.051
6.977GluLeu: 6.977 ± 0.753
2.485GluMet: 2.485 ± 0.465
3.441GluAsn: 3.441 ± 0.766
2.772GluPro: 2.772 ± 0.625
2.198GluGln: 2.198 ± 0.507
3.919GluArg: 3.919 ± 0.638
3.632GluSer: 3.632 ± 0.655
4.492GluThr: 4.492 ± 0.713
4.014GluVal: 4.014 ± 0.619
1.529GluTrp: 1.529 ± 0.386
3.727GluTyr: 3.727 ± 0.893
0.0GluXaa: 0.0 ± 0.0
Phe
2.198PheAla: 2.198 ± 0.4
0.0PheCys: 0.0 ± 0.0
3.154PheAsp: 3.154 ± 0.418
3.536PheGlu: 3.536 ± 0.685
1.051PhePhe: 1.051 ± 0.413
2.103PheGly: 2.103 ± 0.411
0.382PheHis: 0.382 ± 0.186
2.294PheIle: 2.294 ± 0.461
2.485PheLys: 2.485 ± 0.511
3.632PheLeu: 3.632 ± 0.506
1.338PheMet: 1.338 ± 0.362
2.198PheAsn: 2.198 ± 0.593
1.816PhePro: 1.816 ± 0.48
1.434PheGln: 1.434 ± 0.371
1.816PheArg: 1.816 ± 0.587
2.676PheSer: 2.676 ± 0.53
2.963PheThr: 2.963 ± 0.405
1.529PheVal: 1.529 ± 0.331
0.287PheTrp: 0.287 ± 0.164
1.338PheTyr: 1.338 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
4.97GlyAla: 4.97 ± 0.91
0.573GlyCys: 0.573 ± 0.245
4.396GlyAsp: 4.396 ± 0.585
5.734GlyGlu: 5.734 ± 0.754
3.727GlyPhe: 3.727 ± 0.648
5.065GlyGly: 5.065 ± 0.644
1.051GlyHis: 1.051 ± 0.275
4.97GlyIle: 4.97 ± 0.789
6.212GlyLys: 6.212 ± 0.593
6.881GlyLeu: 6.881 ± 0.933
2.389GlyMet: 2.389 ± 0.567
3.919GlyAsn: 3.919 ± 0.576
1.147GlyPro: 1.147 ± 0.36
2.103GlyGln: 2.103 ± 0.309
3.727GlyArg: 3.727 ± 0.622
4.874GlySer: 4.874 ± 0.749
4.014GlyThr: 4.014 ± 0.456
4.683GlyVal: 4.683 ± 0.87
1.242GlyTrp: 1.242 ± 0.36
3.25GlyTyr: 3.25 ± 0.555
0.0GlyXaa: 0.0 ± 0.0
His
0.86HisAla: 0.86 ± 0.326
0.287HisCys: 0.287 ± 0.159
0.956HisAsp: 0.956 ± 0.261
1.242HisGlu: 1.242 ± 0.358
0.765HisPhe: 0.765 ± 0.245
1.529HisGly: 1.529 ± 0.454
0.287HisHis: 0.287 ± 0.156
1.338HisIle: 1.338 ± 0.399
1.338HisLys: 1.338 ± 0.329
1.147HisLeu: 1.147 ± 0.252
0.287HisMet: 0.287 ± 0.16
0.382HisAsn: 0.382 ± 0.272
0.573HisPro: 0.573 ± 0.267
0.573HisGln: 0.573 ± 0.31
0.765HisArg: 0.765 ± 0.279
0.86HisSer: 0.86 ± 0.249
0.86HisThr: 0.86 ± 0.24
0.765HisVal: 0.765 ± 0.286
0.096HisTrp: 0.096 ± 0.084
0.86HisTyr: 0.86 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
4.588IleAla: 4.588 ± 0.868
0.573IleCys: 0.573 ± 0.227
2.963IleAsp: 2.963 ± 0.731
6.021IleGlu: 6.021 ± 1.193
1.434IlePhe: 1.434 ± 0.353
4.683IleGly: 4.683 ± 1.002
0.765IleHis: 0.765 ± 0.195
4.11IleIle: 4.11 ± 0.68
6.212IleLys: 6.212 ± 0.649
6.021IleLeu: 6.021 ± 0.726
1.051IleMet: 1.051 ± 0.309
4.874IleAsn: 4.874 ± 0.619
3.632IlePro: 3.632 ± 0.523
2.007IleGln: 2.007 ± 0.459
2.485IleArg: 2.485 ± 0.465
4.396IleSer: 4.396 ± 0.681
4.588IleThr: 4.588 ± 0.806
3.25IleVal: 3.25 ± 0.46
0.669IleTrp: 0.669 ± 0.225
2.485IleTyr: 2.485 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
5.161LysAla: 5.161 ± 0.819
0.191LysCys: 0.191 ± 0.146
3.345LysAsp: 3.345 ± 0.624
6.212LysGlu: 6.212 ± 1.226
1.529LysPhe: 1.529 ± 0.258
4.11LysGly: 4.11 ± 0.613
1.242LysHis: 1.242 ± 0.28
5.543LysIle: 5.543 ± 0.894
6.499LysLys: 6.499 ± 1.068
5.543LysLeu: 5.543 ± 0.652
1.338LysMet: 1.338 ± 0.308
4.301LysAsn: 4.301 ± 0.879
2.007LysPro: 2.007 ± 0.629
3.154LysGln: 3.154 ± 0.5
4.205LysArg: 4.205 ± 0.67
5.734LysSer: 5.734 ± 1.002
5.257LysThr: 5.257 ± 0.781
6.595LysVal: 6.595 ± 0.888
1.434LysTrp: 1.434 ± 0.361
2.772LysTyr: 2.772 ± 0.697
0.0LysXaa: 0.0 ± 0.0
Leu
4.11LeuAla: 4.11 ± 0.8
1.051LeuCys: 1.051 ± 0.32
5.257LeuAsp: 5.257 ± 0.717
6.499LeuGlu: 6.499 ± 0.68
3.632LeuPhe: 3.632 ± 0.612
6.117LeuGly: 6.117 ± 1.013
1.625LeuHis: 1.625 ± 0.456
6.595LeuIle: 6.595 ± 0.964
8.602LeuLys: 8.602 ± 1.241
6.786LeuLeu: 6.786 ± 0.882
2.294LeuMet: 2.294 ± 0.515
4.779LeuAsn: 4.779 ± 0.631
3.632LeuPro: 3.632 ± 0.496
2.007LeuGln: 2.007 ± 0.277
4.11LeuArg: 4.11 ± 0.66
5.926LeuSer: 5.926 ± 0.621
4.97LeuThr: 4.97 ± 0.832
4.11LeuVal: 4.11 ± 0.387
0.86LeuTrp: 0.86 ± 0.392
2.867LeuTyr: 2.867 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
2.581MetAla: 2.581 ± 0.46
0.191MetCys: 0.191 ± 0.128
1.72MetAsp: 1.72 ± 0.42
2.103MetGlu: 2.103 ± 0.448
0.669MetPhe: 0.669 ± 0.216
2.198MetGly: 2.198 ± 0.511
0.191MetHis: 0.191 ± 0.145
1.625MetIle: 1.625 ± 0.38
1.72MetLys: 1.72 ± 0.443
1.338MetLeu: 1.338 ± 0.399
0.669MetMet: 0.669 ± 0.301
1.147MetAsn: 1.147 ± 0.248
1.147MetPro: 1.147 ± 0.318
0.669MetGln: 0.669 ± 0.205
0.956MetArg: 0.956 ± 0.322
1.625MetSer: 1.625 ± 0.512
1.338MetThr: 1.338 ± 0.337
2.007MetVal: 2.007 ± 0.394
0.382MetTrp: 0.382 ± 0.182
0.573MetTyr: 0.573 ± 0.317
0.0MetXaa: 0.0 ± 0.0
Asn
2.963AsnAla: 2.963 ± 0.963
0.669AsnCys: 0.669 ± 0.299
2.772AsnAsp: 2.772 ± 0.468
3.919AsnGlu: 3.919 ± 0.814
2.103AsnPhe: 2.103 ± 0.459
5.065AsnGly: 5.065 ± 0.758
0.86AsnHis: 0.86 ± 0.265
3.25AsnIle: 3.25 ± 0.574
4.492AsnLys: 4.492 ± 0.613
4.014AsnLeu: 4.014 ± 0.791
1.529AsnMet: 1.529 ± 0.334
3.154AsnAsn: 3.154 ± 0.777
3.536AsnPro: 3.536 ± 0.447
1.625AsnGln: 1.625 ± 0.321
2.103AsnArg: 2.103 ± 0.441
2.294AsnSer: 2.294 ± 0.394
4.683AsnThr: 4.683 ± 0.598
2.485AsnVal: 2.485 ± 0.633
0.669AsnTrp: 0.669 ± 0.197
2.007AsnTyr: 2.007 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
2.007ProAla: 2.007 ± 0.473
0.287ProCys: 0.287 ± 0.14
2.867ProAsp: 2.867 ± 0.463
4.97ProGlu: 4.97 ± 0.691
1.434ProPhe: 1.434 ± 0.452
2.963ProGly: 2.963 ± 0.59
0.573ProHis: 0.573 ± 0.241
2.103ProIle: 2.103 ± 0.521
1.242ProLys: 1.242 ± 0.35
3.919ProLeu: 3.919 ± 0.752
0.382ProMet: 0.382 ± 0.194
1.625ProAsn: 1.625 ± 0.355
1.434ProPro: 1.434 ± 0.415
0.86ProGln: 0.86 ± 0.217
2.103ProArg: 2.103 ± 0.31
2.963ProSer: 2.963 ± 0.605
2.103ProThr: 2.103 ± 0.505
3.441ProVal: 3.441 ± 0.599
0.191ProTrp: 0.191 ± 0.124
1.911ProTyr: 1.911 ± 0.332
0.0ProXaa: 0.0 ± 0.0
Gln
2.485GlnAla: 2.485 ± 0.517
0.096GlnCys: 0.096 ± 0.101
1.242GlnAsp: 1.242 ± 0.335
2.007GlnGlu: 2.007 ± 0.534
1.529GlnPhe: 1.529 ± 0.357
1.911GlnGly: 1.911 ± 0.348
0.669GlnHis: 0.669 ± 0.204
2.294GlnIle: 2.294 ± 0.518
2.485GlnLys: 2.485 ± 0.388
3.632GlnLeu: 3.632 ± 0.685
0.573GlnMet: 0.573 ± 0.264
1.72GlnAsn: 1.72 ± 0.44
0.287GlnPro: 0.287 ± 0.167
1.529GlnGln: 1.529 ± 0.382
1.625GlnArg: 1.625 ± 0.474
1.338GlnSer: 1.338 ± 0.289
1.338GlnThr: 1.338 ± 0.381
2.389GlnVal: 2.389 ± 0.397
0.382GlnTrp: 0.382 ± 0.168
1.338GlnTyr: 1.338 ± 0.457
0.0GlnXaa: 0.0 ± 0.0
Arg
2.676ArgAla: 2.676 ± 0.541
0.287ArgCys: 0.287 ± 0.158
3.058ArgAsp: 3.058 ± 0.663
4.396ArgGlu: 4.396 ± 0.711
1.816ArgPhe: 1.816 ± 0.321
3.345ArgGly: 3.345 ± 0.583
1.051ArgHis: 1.051 ± 0.262
2.867ArgIle: 2.867 ± 0.554
3.441ArgLys: 3.441 ± 0.699
3.632ArgLeu: 3.632 ± 0.491
1.147ArgMet: 1.147 ± 0.359
2.389ArgAsn: 2.389 ± 0.5
2.389ArgPro: 2.389 ± 0.465
1.338ArgGln: 1.338 ± 0.323
1.816ArgArg: 1.816 ± 0.398
2.581ArgSer: 2.581 ± 0.565
2.389ArgThr: 2.389 ± 0.406
4.11ArgVal: 4.11 ± 0.719
0.096ArgTrp: 0.096 ± 0.084
1.72ArgTyr: 1.72 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
3.536SerAla: 3.536 ± 0.686
0.478SerCys: 0.478 ± 0.188
2.485SerAsp: 2.485 ± 0.53
5.352SerGlu: 5.352 ± 0.6
2.963SerPhe: 2.963 ± 0.785
5.352SerGly: 5.352 ± 0.987
1.338SerHis: 1.338 ± 0.298
4.014SerIle: 4.014 ± 0.584
4.396SerLys: 4.396 ± 0.9
4.205SerLeu: 4.205 ± 0.508
1.434SerMet: 1.434 ± 0.328
2.867SerAsn: 2.867 ± 0.485
2.198SerPro: 2.198 ± 0.503
2.485SerGln: 2.485 ± 0.557
3.345SerArg: 3.345 ± 0.521
3.441SerSer: 3.441 ± 0.506
3.441SerThr: 3.441 ± 0.646
5.065SerVal: 5.065 ± 0.72
1.147SerTrp: 1.147 ± 0.324
1.816SerTyr: 1.816 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
4.301ThrAla: 4.301 ± 0.849
0.956ThrCys: 0.956 ± 0.36
2.963ThrAsp: 2.963 ± 0.406
3.727ThrGlu: 3.727 ± 0.634
1.816ThrPhe: 1.816 ± 0.31
5.065ThrGly: 5.065 ± 0.736
1.051ThrHis: 1.051 ± 0.233
3.823ThrIle: 3.823 ± 0.48
4.301ThrLys: 4.301 ± 0.672
5.257ThrLeu: 5.257 ± 0.808
1.242ThrMet: 1.242 ± 0.306
2.963ThrAsn: 2.963 ± 0.529
2.963ThrPro: 2.963 ± 0.562
1.72ThrGln: 1.72 ± 0.414
2.581ThrArg: 2.581 ± 0.401
3.727ThrSer: 3.727 ± 0.755
4.492ThrThr: 4.492 ± 0.879
5.161ThrVal: 5.161 ± 0.767
0.86ThrTrp: 0.86 ± 0.248
1.434ThrTyr: 1.434 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
4.014ValAla: 4.014 ± 0.565
0.382ValCys: 0.382 ± 0.196
4.014ValAsp: 4.014 ± 0.663
4.683ValGlu: 4.683 ± 0.601
2.485ValPhe: 2.485 ± 0.564
4.11ValGly: 4.11 ± 0.566
0.86ValHis: 0.86 ± 0.316
3.345ValIle: 3.345 ± 0.533
5.639ValLys: 5.639 ± 0.743
6.117ValLeu: 6.117 ± 0.896
1.625ValMet: 1.625 ± 0.349
3.632ValAsn: 3.632 ± 0.645
2.581ValPro: 2.581 ± 0.558
1.911ValGln: 1.911 ± 0.498
2.485ValArg: 2.485 ± 0.414
5.257ValSer: 5.257 ± 1.038
3.058ValThr: 3.058 ± 0.716
3.823ValVal: 3.823 ± 0.587
1.147ValTrp: 1.147 ± 0.354
2.389ValTyr: 2.389 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.573TrpAla: 0.573 ± 0.217
0.0TrpCys: 0.0 ± 0.0
0.765TrpAsp: 0.765 ± 0.225
1.051TrpGlu: 1.051 ± 0.408
0.765TrpPhe: 0.765 ± 0.217
0.86TrpGly: 0.86 ± 0.252
0.191TrpHis: 0.191 ± 0.14
0.765TrpIle: 0.765 ± 0.225
0.669TrpLys: 0.669 ± 0.217
1.529TrpLeu: 1.529 ± 0.374
0.382TrpMet: 0.382 ± 0.177
1.816TrpAsn: 1.816 ± 0.752
0.382TrpPro: 0.382 ± 0.141
0.573TrpGln: 0.573 ± 0.164
0.956TrpArg: 0.956 ± 0.258
0.478TrpSer: 0.478 ± 0.231
0.573TrpThr: 0.573 ± 0.214
1.242TrpVal: 1.242 ± 0.375
0.096TrpTrp: 0.096 ± 0.094
0.191TrpTyr: 0.191 ± 0.207
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.529TyrAla: 1.529 ± 0.373
0.669TyrCys: 0.669 ± 0.342
3.919TyrAsp: 3.919 ± 0.522
2.007TyrGlu: 2.007 ± 0.476
1.434TyrPhe: 1.434 ± 0.385
3.632TyrGly: 3.632 ± 0.68
1.051TyrHis: 1.051 ± 0.248
2.294TyrIle: 2.294 ± 0.517
1.911TyrLys: 1.911 ± 0.399
3.058TyrLeu: 3.058 ± 0.493
0.287TyrMet: 0.287 ± 0.159
3.345TyrAsn: 3.345 ± 0.621
1.242TyrPro: 1.242 ± 0.296
1.816TyrGln: 1.816 ± 0.399
1.529TyrArg: 1.529 ± 0.519
2.389TyrSer: 2.389 ± 0.58
1.72TyrThr: 1.72 ± 0.383
1.816TyrVal: 1.816 ± 0.606
0.573TyrTrp: 0.573 ± 0.224
1.911TyrTyr: 1.911 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (10464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski