Amino acid dipepetide frequency for Skermania phage SPI1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.583AlaAla: 20.583 ± 1.856
1.105AlaCys: 1.105 ± 0.379
10.582AlaAsp: 10.582 ± 0.803
7.035AlaGlu: 7.035 ± 0.548
2.616AlaPhe: 2.616 ± 0.471
8.954AlaGly: 8.954 ± 0.798
2.733AlaHis: 2.733 ± 0.523
4.768AlaIle: 4.768 ± 0.667
2.442AlaLys: 2.442 ± 0.465
12.268AlaLeu: 12.268 ± 0.968
3.023AlaMet: 3.023 ± 0.407
2.035AlaAsn: 2.035 ± 0.378
7.093AlaPro: 7.093 ± 0.725
3.896AlaGln: 3.896 ± 0.973
10.001AlaArg: 10.001 ± 1.008
7.21AlaSer: 7.21 ± 0.609
8.605AlaThr: 8.605 ± 0.583
10.175AlaVal: 10.175 ± 0.893
2.616AlaTrp: 2.616 ± 0.472
2.965AlaTyr: 2.965 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
1.454CysAla: 1.454 ± 0.41
0.058CysCys: 0.058 ± 0.06
0.814CysAsp: 0.814 ± 0.213
0.581CysGlu: 0.581 ± 0.178
0.116CysPhe: 0.116 ± 0.086
0.872CysGly: 0.872 ± 0.267
0.233CysHis: 0.233 ± 0.115
0.058CysIle: 0.058 ± 0.07
0.058CysLys: 0.058 ± 0.051
0.64CysLeu: 0.64 ± 0.207
0.116CysMet: 0.116 ± 0.086
0.116CysAsn: 0.116 ± 0.085
0.523CysPro: 0.523 ± 0.217
0.116CysGln: 0.116 ± 0.09
0.64CysArg: 0.64 ± 0.261
0.988CysSer: 0.988 ± 0.317
0.174CysThr: 0.174 ± 0.096
1.221CysVal: 1.221 ± 0.344
0.233CysTrp: 0.233 ± 0.11
0.174CysTyr: 0.174 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
10.117AspAla: 10.117 ± 0.896
0.465AspCys: 0.465 ± 0.182
6.047AspAsp: 6.047 ± 1.034
3.896AspGlu: 3.896 ± 0.655
1.047AspPhe: 1.047 ± 0.211
6.338AspGly: 6.338 ± 0.548
1.221AspHis: 1.221 ± 0.259
3.896AspIle: 3.896 ± 0.452
0.93AspLys: 0.93 ± 0.277
7.442AspLeu: 7.442 ± 0.868
1.454AspMet: 1.454 ± 0.325
0.581AspAsn: 0.581 ± 0.189
7.093AspPro: 7.093 ± 0.997
1.861AspGln: 1.861 ± 0.273
5.931AspArg: 5.931 ± 0.855
3.082AspSer: 3.082 ± 0.4
5.175AspThr: 5.175 ± 0.432
5.582AspVal: 5.582 ± 0.554
1.744AspTrp: 1.744 ± 0.349
2.093AspTyr: 2.093 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
5.058GluAla: 5.058 ± 0.544
0.64GluCys: 0.64 ± 0.189
3.256GluAsp: 3.256 ± 0.409
1.512GluGlu: 1.512 ± 0.3
0.698GluPhe: 0.698 ± 0.197
4.244GluGly: 4.244 ± 0.455
1.395GluHis: 1.395 ± 0.273
3.082GluIle: 3.082 ± 0.425
1.047GluLys: 1.047 ± 0.231
5.058GluLeu: 5.058 ± 0.75
1.454GluMet: 1.454 ± 0.269
1.047GluAsn: 1.047 ± 0.249
2.791GluPro: 2.791 ± 0.479
1.57GluGln: 1.57 ± 0.296
4.012GluArg: 4.012 ± 0.492
3.372GluSer: 3.372 ± 0.53
3.605GluThr: 3.605 ± 0.45
3.663GluVal: 3.663 ± 0.46
1.279GluTrp: 1.279 ± 0.315
1.279GluTyr: 1.279 ± 0.324
0.0GluXaa: 0.0 ± 0.0
Phe
2.849PheAla: 2.849 ± 0.428
0.174PheCys: 0.174 ± 0.095
1.628PheAsp: 1.628 ± 0.38
0.93PheGlu: 0.93 ± 0.269
0.465PhePhe: 0.465 ± 0.17
1.512PheGly: 1.512 ± 0.28
0.116PheHis: 0.116 ± 0.078
0.581PheIle: 0.581 ± 0.144
0.058PheLys: 0.058 ± 0.058
1.454PheLeu: 1.454 ± 0.279
0.291PheMet: 0.291 ± 0.148
0.291PheAsn: 0.291 ± 0.101
0.872PhePro: 0.872 ± 0.208
0.523PheGln: 0.523 ± 0.156
1.744PheArg: 1.744 ± 0.358
1.395PheSer: 1.395 ± 0.308
1.686PheThr: 1.686 ± 0.332
1.337PheVal: 1.337 ± 0.338
0.291PheTrp: 0.291 ± 0.119
0.233PheTyr: 0.233 ± 0.129
0.0PheXaa: 0.0 ± 0.0
Gly
9.303GlyAla: 9.303 ± 0.638
0.93GlyCys: 0.93 ± 0.231
6.105GlyAsp: 6.105 ± 0.627
4.128GlyGlu: 4.128 ± 0.458
1.744GlyPhe: 1.744 ± 0.369
7.559GlyGly: 7.559 ± 1.123
1.628GlyHis: 1.628 ± 0.335
4.651GlyIle: 4.651 ± 0.578
2.209GlyLys: 2.209 ± 0.407
6.512GlyLeu: 6.512 ± 0.653
1.802GlyMet: 1.802 ± 0.375
1.512GlyAsn: 1.512 ± 0.261
4.244GlyPro: 4.244 ± 0.571
1.919GlyGln: 1.919 ± 0.387
5.872GlyArg: 5.872 ± 0.73
5.64GlySer: 5.64 ± 0.671
7.326GlyThr: 7.326 ± 0.68
6.57GlyVal: 6.57 ± 0.723
1.977GlyTrp: 1.977 ± 0.37
3.198GlyTyr: 3.198 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
2.733HisAla: 2.733 ± 0.491
0.0HisCys: 0.0 ± 0.0
1.047HisAsp: 1.047 ± 0.25
0.64HisGlu: 0.64 ± 0.204
0.349HisPhe: 0.349 ± 0.129
1.628HisGly: 1.628 ± 0.285
0.407HisHis: 0.407 ± 0.143
0.64HisIle: 0.64 ± 0.206
0.291HisLys: 0.291 ± 0.137
1.686HisLeu: 1.686 ± 0.327
0.349HisMet: 0.349 ± 0.134
0.116HisAsn: 0.116 ± 0.074
1.686HisPro: 1.686 ± 0.346
0.349HisGln: 0.349 ± 0.136
1.221HisArg: 1.221 ± 0.36
0.523HisSer: 0.523 ± 0.165
1.105HisThr: 1.105 ± 0.284
2.035HisVal: 2.035 ± 0.331
0.465HisTrp: 0.465 ± 0.155
0.349HisTyr: 0.349 ± 0.113
0.0HisXaa: 0.0 ± 0.0
Ile
6.803IleAla: 6.803 ± 0.648
0.116IleCys: 0.116 ± 0.081
4.128IleAsp: 4.128 ± 0.528
2.791IleGlu: 2.791 ± 0.375
0.581IlePhe: 0.581 ± 0.257
3.605IleGly: 3.605 ± 0.765
0.64IleHis: 0.64 ± 0.193
1.163IleIle: 1.163 ± 0.291
0.93IleLys: 0.93 ± 0.287
3.14IleLeu: 3.14 ± 0.376
0.523IleMet: 0.523 ± 0.207
0.407IleAsn: 0.407 ± 0.151
2.442IlePro: 2.442 ± 0.373
1.221IleGln: 1.221 ± 0.381
3.605IleArg: 3.605 ± 0.499
2.384IleSer: 2.384 ± 0.387
3.023IleThr: 3.023 ± 0.383
3.721IleVal: 3.721 ± 0.632
0.407IleTrp: 0.407 ± 0.156
0.988IleTyr: 0.988 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
2.268LysAla: 2.268 ± 0.614
0.174LysCys: 0.174 ± 0.088
1.047LysAsp: 1.047 ± 0.307
0.407LysGlu: 0.407 ± 0.23
0.233LysPhe: 0.233 ± 0.17
0.93LysGly: 0.93 ± 0.261
0.116LysHis: 0.116 ± 0.079
0.93LysIle: 0.93 ± 0.259
0.116LysLys: 0.116 ± 0.104
1.395LysLeu: 1.395 ± 0.386
0.058LysMet: 0.058 ± 0.052
0.349LysAsn: 0.349 ± 0.128
0.814LysPro: 0.814 ± 0.202
0.349LysGln: 0.349 ± 0.159
2.268LysArg: 2.268 ± 0.383
1.221LysSer: 1.221 ± 0.256
1.047LysThr: 1.047 ± 0.341
1.628LysVal: 1.628 ± 0.289
0.407LysTrp: 0.407 ± 0.155
0.756LysTyr: 0.756 ± 0.204
0.0LysXaa: 0.0 ± 0.0
Leu
10.524LeuAla: 10.524 ± 0.968
0.814LeuCys: 0.814 ± 0.227
6.221LeuAsp: 6.221 ± 0.647
4.651LeuGlu: 4.651 ± 0.594
1.744LeuPhe: 1.744 ± 0.301
6.977LeuGly: 6.977 ± 0.95
2.151LeuHis: 2.151 ± 0.335
4.186LeuIle: 4.186 ± 0.686
1.512LeuLys: 1.512 ± 0.283
6.512LeuLeu: 6.512 ± 0.663
1.454LeuMet: 1.454 ± 0.324
0.872LeuAsn: 0.872 ± 0.21
5.814LeuPro: 5.814 ± 0.481
1.919LeuGln: 1.919 ± 0.4
8.954LeuArg: 8.954 ± 0.676
4.942LeuSer: 4.942 ± 0.553
7.849LeuThr: 7.849 ± 0.606
6.57LeuVal: 6.57 ± 0.518
1.512LeuTrp: 1.512 ± 0.281
1.802LeuTyr: 1.802 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
2.616MetAla: 2.616 ± 0.336
0.174MetCys: 0.174 ± 0.097
1.221MetAsp: 1.221 ± 0.21
0.581MetGlu: 0.581 ± 0.194
0.233MetPhe: 0.233 ± 0.107
0.988MetGly: 0.988 ± 0.256
0.349MetHis: 0.349 ± 0.157
1.163MetIle: 1.163 ± 0.287
0.174MetLys: 0.174 ± 0.094
1.512MetLeu: 1.512 ± 0.279
0.349MetMet: 0.349 ± 0.153
0.465MetAsn: 0.465 ± 0.173
1.279MetPro: 1.279 ± 0.283
0.174MetGln: 0.174 ± 0.089
1.802MetArg: 1.802 ± 0.314
2.093MetSer: 2.093 ± 0.37
2.151MetThr: 2.151 ± 0.349
1.047MetVal: 1.047 ± 0.223
0.349MetTrp: 0.349 ± 0.144
0.407MetTyr: 0.407 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
2.558AsnAla: 2.558 ± 0.486
0.291AsnCys: 0.291 ± 0.143
0.93AsnAsp: 0.93 ± 0.275
0.174AsnGlu: 0.174 ± 0.093
0.233AsnPhe: 0.233 ± 0.095
1.744AsnGly: 1.744 ± 0.345
0.407AsnHis: 0.407 ± 0.135
0.465AsnIle: 0.465 ± 0.174
0.291AsnLys: 0.291 ± 0.107
1.628AsnLeu: 1.628 ± 0.368
0.174AsnMet: 0.174 ± 0.103
0.233AsnAsn: 0.233 ± 0.097
1.163AsnPro: 1.163 ± 0.293
0.407AsnGln: 0.407 ± 0.148
0.93AsnArg: 0.93 ± 0.223
0.64AsnSer: 0.64 ± 0.157
1.105AsnThr: 1.105 ± 0.214
1.047AsnVal: 1.047 ± 0.29
0.291AsnTrp: 0.291 ± 0.116
0.233AsnTyr: 0.233 ± 0.104
0.0AsnXaa: 0.0 ± 0.0
Pro
7.152ProAla: 7.152 ± 0.576
0.233ProCys: 0.233 ± 0.117
6.163ProAsp: 6.163 ± 0.798
4.826ProGlu: 4.826 ± 0.595
0.872ProPhe: 0.872 ± 0.205
5.931ProGly: 5.931 ± 0.473
0.872ProHis: 0.872 ± 0.267
2.907ProIle: 2.907 ± 0.524
0.872ProLys: 0.872 ± 0.2
4.477ProLeu: 4.477 ± 0.42
0.814ProMet: 0.814 ± 0.196
1.163ProAsn: 1.163 ± 0.26
4.477ProPro: 4.477 ± 0.582
1.163ProGln: 1.163 ± 0.232
3.837ProArg: 3.837 ± 0.514
3.837ProSer: 3.837 ± 0.488
4.477ProThr: 4.477 ± 0.581
5.64ProVal: 5.64 ± 0.65
1.57ProTrp: 1.57 ± 0.359
1.221ProTyr: 1.221 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
3.314GlnAla: 3.314 ± 0.804
0.116GlnCys: 0.116 ± 0.083
1.395GlnAsp: 1.395 ± 0.288
1.105GlnGlu: 1.105 ± 0.308
0.291GlnPhe: 0.291 ± 0.113
1.977GlnGly: 1.977 ± 0.301
0.233GlnHis: 0.233 ± 0.122
1.686GlnIle: 1.686 ± 0.315
0.64GlnLys: 0.64 ± 0.24
3.082GlnLeu: 3.082 ± 0.721
0.523GlnMet: 0.523 ± 0.145
0.291GlnAsn: 0.291 ± 0.142
1.221GlnPro: 1.221 ± 0.278
0.814GlnGln: 0.814 ± 0.222
2.558GlnArg: 2.558 ± 0.364
0.988GlnSer: 0.988 ± 0.264
1.686GlnThr: 1.686 ± 0.398
1.861GlnVal: 1.861 ± 0.225
0.698GlnTrp: 0.698 ± 0.168
0.756GlnTyr: 0.756 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
10.698ArgAla: 10.698 ± 1.088
0.988ArgCys: 0.988 ± 0.29
6.221ArgAsp: 6.221 ± 0.697
4.593ArgGlu: 4.593 ± 0.444
0.988ArgPhe: 0.988 ± 0.297
6.512ArgGly: 6.512 ± 0.631
1.395ArgHis: 1.395 ± 0.33
3.14ArgIle: 3.14 ± 0.446
1.977ArgLys: 1.977 ± 0.435
7.5ArgLeu: 7.5 ± 0.627
2.151ArgMet: 2.151 ± 0.463
1.047ArgAsn: 1.047 ± 0.219
4.477ArgPro: 4.477 ± 0.665
2.326ArgGln: 2.326 ± 0.357
6.686ArgArg: 6.686 ± 0.816
4.303ArgSer: 4.303 ± 0.607
5.582ArgThr: 5.582 ± 0.705
7.5ArgVal: 7.5 ± 0.705
1.802ArgTrp: 1.802 ± 0.389
2.5ArgTyr: 2.5 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
8.082SerAla: 8.082 ± 0.768
0.64SerCys: 0.64 ± 0.202
4.303SerAsp: 4.303 ± 0.523
2.733SerGlu: 2.733 ± 0.373
1.454SerPhe: 1.454 ± 0.346
7.152SerGly: 7.152 ± 1.16
0.698SerHis: 0.698 ± 0.16
2.035SerIle: 2.035 ± 0.238
0.872SerLys: 0.872 ± 0.27
4.71SerLeu: 4.71 ± 0.637
1.105SerMet: 1.105 ± 0.311
0.988SerAsn: 0.988 ± 0.247
3.547SerPro: 3.547 ± 0.42
1.163SerGln: 1.163 ± 0.313
4.07SerArg: 4.07 ± 0.554
3.023SerSer: 3.023 ± 0.53
4.593SerThr: 4.593 ± 0.624
5.175SerVal: 5.175 ± 0.585
1.861SerTrp: 1.861 ± 0.41
0.93SerTyr: 0.93 ± 0.253
0.0SerXaa: 0.0 ± 0.0
Thr
8.78ThrAla: 8.78 ± 0.743
0.756ThrCys: 0.756 ± 0.251
4.826ThrAsp: 4.826 ± 0.547
3.023ThrGlu: 3.023 ± 0.365
1.395ThrPhe: 1.395 ± 0.376
7.617ThrGly: 7.617 ± 0.575
0.698ThrHis: 0.698 ± 0.18
2.675ThrIle: 2.675 ± 0.394
0.698ThrLys: 0.698 ± 0.202
6.221ThrLeu: 6.221 ± 0.523
1.454ThrMet: 1.454 ± 0.258
1.047ThrAsn: 1.047 ± 0.251
5.814ThrPro: 5.814 ± 0.684
1.744ThrGln: 1.744 ± 0.317
6.396ThrArg: 6.396 ± 0.756
4.303ThrSer: 4.303 ± 0.491
5.291ThrThr: 5.291 ± 0.847
7.268ThrVal: 7.268 ± 0.845
1.802ThrTrp: 1.802 ± 0.758
2.151ThrTyr: 2.151 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
10.001ValAla: 10.001 ± 0.693
1.163ValCys: 1.163 ± 0.364
7.093ValAsp: 7.093 ± 0.639
4.244ValGlu: 4.244 ± 0.52
1.919ValPhe: 1.919 ± 0.45
7.384ValGly: 7.384 ± 0.735
1.744ValHis: 1.744 ± 0.334
3.256ValIle: 3.256 ± 0.46
0.988ValLys: 0.988 ± 0.227
7.21ValLeu: 7.21 ± 0.644
1.163ValMet: 1.163 ± 0.239
1.57ValAsn: 1.57 ± 0.359
4.244ValPro: 4.244 ± 0.465
2.151ValGln: 2.151 ± 0.37
6.628ValArg: 6.628 ± 0.709
5.931ValSer: 5.931 ± 0.544
6.047ValThr: 6.047 ± 0.587
6.861ValVal: 6.861 ± 0.633
1.744ValTrp: 1.744 ± 0.366
1.686ValTyr: 1.686 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
2.558TrpAla: 2.558 ± 0.652
0.291TrpCys: 0.291 ± 0.151
1.163TrpAsp: 1.163 ± 0.309
1.221TrpGlu: 1.221 ± 0.342
0.756TrpPhe: 0.756 ± 0.287
0.93TrpGly: 0.93 ± 0.235
0.349TrpHis: 0.349 ± 0.142
0.523TrpIle: 0.523 ± 0.188
0.116TrpLys: 0.116 ± 0.074
2.151TrpLeu: 2.151 ± 0.353
0.581TrpMet: 0.581 ± 0.21
0.407TrpAsn: 0.407 ± 0.167
1.221TrpPro: 1.221 ± 0.288
0.698TrpGln: 0.698 ± 0.173
2.558TrpArg: 2.558 ± 0.487
1.744TrpSer: 1.744 ± 0.435
2.268TrpThr: 2.268 ± 0.501
1.337TrpVal: 1.337 ± 0.298
0.523TrpTrp: 0.523 ± 0.202
0.523TrpTyr: 0.523 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.256TyrAla: 3.256 ± 0.48
0.058TyrCys: 0.058 ± 0.059
1.802TyrAsp: 1.802 ± 0.382
0.988TyrGlu: 0.988 ± 0.276
0.698TyrPhe: 0.698 ± 0.238
1.977TyrGly: 1.977 ± 0.336
0.233TyrHis: 0.233 ± 0.147
0.814TyrIle: 0.814 ± 0.215
0.291TyrLys: 0.291 ± 0.112
2.384TyrLeu: 2.384 ± 0.425
0.233TyrMet: 0.233 ± 0.114
0.407TyrAsn: 0.407 ± 0.14
1.686TyrPro: 1.686 ± 0.315
0.872TyrGln: 0.872 ± 0.197
2.675TyrArg: 2.675 ± 0.385
1.512TyrSer: 1.512 ± 0.329
1.163TyrThr: 1.163 ± 0.267
2.791TyrVal: 2.791 ± 0.54
0.407TyrTrp: 0.407 ± 0.151
0.581TyrTyr: 0.581 ± 0.218
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (17200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski