Amino acid dipepetide frequency for Microbacterium phage Neferthena

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.926AlaAla: 11.926 ± 1.175
0.593AlaCys: 0.593 ± 0.236
5.481AlaAsp: 5.481 ± 0.672
6.0AlaGlu: 6.0 ± 0.853
3.185AlaPhe: 3.185 ± 0.493
8.519AlaGly: 8.519 ± 0.879
1.333AlaHis: 1.333 ± 0.377
5.111AlaIle: 5.111 ± 0.859
5.333AlaLys: 5.333 ± 0.734
9.704AlaLeu: 9.704 ± 1.227
2.815AlaMet: 2.815 ± 0.451
3.556AlaAsn: 3.556 ± 0.654
5.111AlaPro: 5.111 ± 0.702
4.444AlaGln: 4.444 ± 0.621
6.815AlaArg: 6.815 ± 0.813
4.741AlaSer: 4.741 ± 0.661
7.704AlaThr: 7.704 ± 0.799
7.852AlaVal: 7.852 ± 0.687
1.778AlaTrp: 1.778 ± 0.453
2.222AlaTyr: 2.222 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.296CysAla: 0.296 ± 0.13
0.074CysCys: 0.074 ± 0.065
0.519CysAsp: 0.519 ± 0.22
0.37CysGlu: 0.37 ± 0.174
0.148CysPhe: 0.148 ± 0.108
0.741CysGly: 0.741 ± 0.198
0.222CysHis: 0.222 ± 0.121
0.37CysIle: 0.37 ± 0.163
0.222CysLys: 0.222 ± 0.128
0.222CysLeu: 0.222 ± 0.12
0.0CysMet: 0.0 ± 0.0
0.148CysAsn: 0.148 ± 0.121
0.815CysPro: 0.815 ± 0.274
0.444CysGln: 0.444 ± 0.282
0.741CysArg: 0.741 ± 0.205
0.148CysSer: 0.148 ± 0.125
0.444CysThr: 0.444 ± 0.149
0.37CysVal: 0.37 ± 0.181
0.222CysTrp: 0.222 ± 0.135
0.148CysTyr: 0.148 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
6.37AspAla: 6.37 ± 0.76
0.444AspCys: 0.444 ± 0.209
4.593AspAsp: 4.593 ± 1.154
5.037AspGlu: 5.037 ± 1.512
1.926AspPhe: 1.926 ± 0.399
6.148AspGly: 6.148 ± 0.809
1.111AspHis: 1.111 ± 0.293
3.037AspIle: 3.037 ± 0.469
2.222AspLys: 2.222 ± 0.46
5.852AspLeu: 5.852 ± 0.654
1.63AspMet: 1.63 ± 0.328
1.556AspAsn: 1.556 ± 0.43
3.926AspPro: 3.926 ± 0.469
2.37AspGln: 2.37 ± 0.391
4.667AspArg: 4.667 ± 0.594
2.593AspSer: 2.593 ± 0.456
4.296AspThr: 4.296 ± 0.553
3.185AspVal: 3.185 ± 0.479
1.481AspTrp: 1.481 ± 0.348
2.148AspTyr: 2.148 ± 0.398
0.0AspXaa: 0.0 ± 0.0
Glu
7.111GluAla: 7.111 ± 0.794
0.37GluCys: 0.37 ± 0.148
5.333GluAsp: 5.333 ± 1.327
4.667GluGlu: 4.667 ± 1.376
1.704GluPhe: 1.704 ± 0.33
4.444GluGly: 4.444 ± 0.573
0.815GluHis: 0.815 ± 0.215
2.593GluIle: 2.593 ± 0.464
2.222GluLys: 2.222 ± 0.375
6.444GluLeu: 6.444 ± 0.66
1.556GluMet: 1.556 ± 0.367
1.481GluAsn: 1.481 ± 0.368
2.37GluPro: 2.37 ± 0.468
2.815GluGln: 2.815 ± 0.446
3.111GluArg: 3.111 ± 0.526
1.852GluSer: 1.852 ± 0.3
2.963GluThr: 2.963 ± 0.47
4.741GluVal: 4.741 ± 0.64
0.963GluTrp: 0.963 ± 0.275
1.481GluTyr: 1.481 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.444PheAla: 2.444 ± 0.43
0.519PheCys: 0.519 ± 0.253
1.852PheAsp: 1.852 ± 0.348
1.481PheGlu: 1.481 ± 0.347
0.667PhePhe: 0.667 ± 0.216
3.037PheGly: 3.037 ± 0.569
0.667PheHis: 0.667 ± 0.221
1.556PheIle: 1.556 ± 0.389
1.185PheLys: 1.185 ± 0.262
2.296PheLeu: 2.296 ± 0.342
0.815PheMet: 0.815 ± 0.227
1.333PheAsn: 1.333 ± 0.312
1.63PhePro: 1.63 ± 0.345
1.037PheGln: 1.037 ± 0.361
2.074PheArg: 2.074 ± 0.345
2.444PheSer: 2.444 ± 0.45
2.148PheThr: 2.148 ± 0.437
1.852PheVal: 1.852 ± 0.302
0.37PheTrp: 0.37 ± 0.133
0.37PheTyr: 0.37 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
6.815GlyAla: 6.815 ± 1.01
0.37GlyCys: 0.37 ± 0.185
4.593GlyAsp: 4.593 ± 0.572
3.852GlyGlu: 3.852 ± 0.509
2.889GlyPhe: 2.889 ± 0.416
7.185GlyGly: 7.185 ± 1.196
1.63GlyHis: 1.63 ± 0.378
4.148GlyIle: 4.148 ± 1.006
5.111GlyLys: 5.111 ± 0.687
6.519GlyLeu: 6.519 ± 0.892
2.222GlyMet: 2.222 ± 0.428
2.444GlyAsn: 2.444 ± 0.515
4.0GlyPro: 4.0 ± 0.892
4.593GlyGln: 4.593 ± 0.639
4.815GlyArg: 4.815 ± 0.617
4.889GlySer: 4.889 ± 0.738
5.852GlyThr: 5.852 ± 0.648
5.63GlyVal: 5.63 ± 0.705
0.963GlyTrp: 0.963 ± 0.239
3.037GlyTyr: 3.037 ± 0.54
0.0GlyXaa: 0.0 ± 0.0
His
1.407HisAla: 1.407 ± 0.391
0.148HisCys: 0.148 ± 0.143
1.185HisAsp: 1.185 ± 0.243
1.185HisGlu: 1.185 ± 0.277
0.815HisPhe: 0.815 ± 0.258
1.481HisGly: 1.481 ± 0.35
0.444HisHis: 0.444 ± 0.198
0.889HisIle: 0.889 ± 0.256
1.259HisLys: 1.259 ± 0.269
1.556HisLeu: 1.556 ± 0.449
0.296HisMet: 0.296 ± 0.123
0.444HisAsn: 0.444 ± 0.17
1.111HisPro: 1.111 ± 0.285
0.667HisGln: 0.667 ± 0.234
0.741HisArg: 0.741 ± 0.213
0.519HisSer: 0.519 ± 0.204
1.111HisThr: 1.111 ± 0.343
1.111HisVal: 1.111 ± 0.307
0.296HisTrp: 0.296 ± 0.137
0.963HisTyr: 0.963 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
5.111IleAla: 5.111 ± 0.565
0.296IleCys: 0.296 ± 0.141
2.444IleAsp: 2.444 ± 0.526
3.259IleGlu: 3.259 ± 0.447
1.111IlePhe: 1.111 ± 0.293
3.852IleGly: 3.852 ± 0.764
0.741IleHis: 0.741 ± 0.203
3.63IleIle: 3.63 ± 0.683
2.296IleLys: 2.296 ± 0.468
3.037IleLeu: 3.037 ± 0.559
0.815IleMet: 0.815 ± 0.253
2.222IleAsn: 2.222 ± 0.395
3.333IlePro: 3.333 ± 0.79
2.889IleGln: 2.889 ± 0.842
3.333IleArg: 3.333 ± 0.571
2.444IleSer: 2.444 ± 0.478
3.778IleThr: 3.778 ± 0.694
3.556IleVal: 3.556 ± 0.762
0.593IleTrp: 0.593 ± 0.192
1.63IleTyr: 1.63 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
5.407LysAla: 5.407 ± 0.767
0.296LysCys: 0.296 ± 0.149
2.963LysAsp: 2.963 ± 0.474
2.963LysGlu: 2.963 ± 0.574
1.111LysPhe: 1.111 ± 0.259
3.63LysGly: 3.63 ± 0.439
0.815LysHis: 0.815 ± 0.274
1.481LysIle: 1.481 ± 0.262
1.852LysLys: 1.852 ± 0.512
3.333LysLeu: 3.333 ± 0.423
1.185LysMet: 1.185 ± 0.256
1.185LysAsn: 1.185 ± 0.266
3.333LysPro: 3.333 ± 0.529
1.926LysGln: 1.926 ± 0.346
2.815LysArg: 2.815 ± 0.534
2.148LysSer: 2.148 ± 0.406
3.778LysThr: 3.778 ± 0.505
4.148LysVal: 4.148 ± 0.544
0.963LysTrp: 0.963 ± 0.262
0.815LysTyr: 0.815 ± 0.266
0.0LysXaa: 0.0 ± 0.0
Leu
9.926LeuAla: 9.926 ± 0.964
0.444LeuCys: 0.444 ± 0.172
6.074LeuAsp: 6.074 ± 0.543
4.148LeuGlu: 4.148 ± 0.496
2.296LeuPhe: 2.296 ± 0.476
6.0LeuGly: 6.0 ± 0.691
1.481LeuHis: 1.481 ± 0.351
5.185LeuIle: 5.185 ± 1.22
4.296LeuLys: 4.296 ± 0.535
6.815LeuLeu: 6.815 ± 0.729
0.963LeuMet: 0.963 ± 0.304
3.778LeuAsn: 3.778 ± 0.432
5.111LeuPro: 5.111 ± 0.737
2.444LeuGln: 2.444 ± 0.526
4.593LeuArg: 4.593 ± 0.758
4.815LeuSer: 4.815 ± 0.509
5.778LeuThr: 5.778 ± 0.709
5.778LeuVal: 5.778 ± 0.929
1.185LeuTrp: 1.185 ± 0.256
1.778LeuTyr: 1.778 ± 0.3
0.0LeuXaa: 0.0 ± 0.0
Met
2.444MetAla: 2.444 ± 0.371
0.37MetCys: 0.37 ± 0.147
2.074MetAsp: 2.074 ± 0.368
1.556MetGlu: 1.556 ± 0.348
0.963MetPhe: 0.963 ± 0.282
1.778MetGly: 1.778 ± 0.386
0.519MetHis: 0.519 ± 0.205
1.037MetIle: 1.037 ± 0.333
1.185MetLys: 1.185 ± 0.336
1.63MetLeu: 1.63 ± 0.412
0.815MetMet: 0.815 ± 0.235
0.667MetAsn: 0.667 ± 0.224
1.481MetPro: 1.481 ± 0.366
0.593MetGln: 0.593 ± 0.217
1.037MetArg: 1.037 ± 0.275
2.0MetSer: 2.0 ± 0.321
1.704MetThr: 1.704 ± 0.336
1.407MetVal: 1.407 ± 0.323
0.37MetTrp: 0.37 ± 0.156
0.519MetTyr: 0.519 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
3.778AsnAla: 3.778 ± 0.742
0.296AsnCys: 0.296 ± 0.177
2.296AsnAsp: 2.296 ± 0.373
1.556AsnGlu: 1.556 ± 0.376
1.259AsnPhe: 1.259 ± 0.267
3.037AsnGly: 3.037 ± 0.575
0.741AsnHis: 0.741 ± 0.24
1.259AsnIle: 1.259 ± 0.262
1.333AsnLys: 1.333 ± 0.3
2.444AsnLeu: 2.444 ± 0.424
0.444AsnMet: 0.444 ± 0.149
1.407AsnAsn: 1.407 ± 0.391
1.926AsnPro: 1.926 ± 0.363
1.037AsnGln: 1.037 ± 0.233
1.63AsnArg: 1.63 ± 0.415
2.963AsnSer: 2.963 ± 0.511
2.444AsnThr: 2.444 ± 0.422
2.074AsnVal: 2.074 ± 0.357
0.815AsnTrp: 0.815 ± 0.298
1.259AsnTyr: 1.259 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
6.963ProAla: 6.963 ± 0.857
0.222ProCys: 0.222 ± 0.141
3.926ProAsp: 3.926 ± 0.66
3.185ProGlu: 3.185 ± 0.622
1.037ProPhe: 1.037 ± 0.283
5.111ProGly: 5.111 ± 0.748
0.963ProHis: 0.963 ± 0.228
2.296ProIle: 2.296 ± 0.467
2.815ProLys: 2.815 ± 0.456
2.741ProLeu: 2.741 ± 0.478
1.259ProMet: 1.259 ± 0.363
1.852ProAsn: 1.852 ± 0.466
1.63ProPro: 1.63 ± 0.392
2.593ProGln: 2.593 ± 0.805
2.741ProArg: 2.741 ± 0.448
3.037ProSer: 3.037 ± 0.437
4.296ProThr: 4.296 ± 0.513
4.741ProVal: 4.741 ± 0.603
0.963ProTrp: 0.963 ± 0.251
1.704ProTyr: 1.704 ± 0.424
0.0ProXaa: 0.0 ± 0.0
Gln
5.259GlnAla: 5.259 ± 0.593
0.074GlnCys: 0.074 ± 0.068
2.519GlnAsp: 2.519 ± 0.359
3.333GlnGlu: 3.333 ± 0.554
0.889GlnPhe: 0.889 ± 0.346
4.0GlnGly: 4.0 ± 0.708
0.741GlnHis: 0.741 ± 0.263
2.0GlnIle: 2.0 ± 0.564
1.556GlnLys: 1.556 ± 0.345
3.778GlnLeu: 3.778 ± 0.701
1.259GlnMet: 1.259 ± 0.338
1.407GlnAsn: 1.407 ± 0.353
2.0GlnPro: 2.0 ± 0.414
2.519GlnGln: 2.519 ± 0.667
2.222GlnArg: 2.222 ± 0.37
2.37GlnSer: 2.37 ± 0.588
1.852GlnThr: 1.852 ± 0.34
3.407GlnVal: 3.407 ± 0.514
0.889GlnTrp: 0.889 ± 0.327
1.037GlnTyr: 1.037 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
5.037ArgAla: 5.037 ± 0.761
0.741ArgCys: 0.741 ± 0.254
3.556ArgAsp: 3.556 ± 0.51
2.963ArgGlu: 2.963 ± 0.545
2.148ArgPhe: 2.148 ± 0.36
4.0ArgGly: 4.0 ± 0.736
1.037ArgHis: 1.037 ± 0.302
3.037ArgIle: 3.037 ± 0.547
3.111ArgLys: 3.111 ± 0.562
5.63ArgLeu: 5.63 ± 0.774
2.296ArgMet: 2.296 ± 0.41
1.704ArgAsn: 1.704 ± 0.355
3.111ArgPro: 3.111 ± 0.536
2.593ArgGln: 2.593 ± 0.434
4.074ArgArg: 4.074 ± 0.801
3.333ArgSer: 3.333 ± 0.547
3.407ArgThr: 3.407 ± 0.582
5.259ArgVal: 5.259 ± 0.684
1.481ArgTrp: 1.481 ± 0.374
1.111ArgTyr: 1.111 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
5.926SerAla: 5.926 ± 0.86
0.519SerCys: 0.519 ± 0.196
3.407SerAsp: 3.407 ± 0.46
2.667SerGlu: 2.667 ± 0.483
1.259SerPhe: 1.259 ± 0.314
3.704SerGly: 3.704 ± 0.674
0.815SerHis: 0.815 ± 0.203
2.963SerIle: 2.963 ± 0.524
2.296SerLys: 2.296 ± 0.386
4.741SerLeu: 4.741 ± 0.732
1.556SerMet: 1.556 ± 0.325
2.0SerAsn: 2.0 ± 0.472
2.889SerPro: 2.889 ± 0.363
2.222SerGln: 2.222 ± 0.375
2.741SerArg: 2.741 ± 0.537
3.333SerSer: 3.333 ± 0.649
3.481SerThr: 3.481 ± 0.547
3.852SerVal: 3.852 ± 0.718
1.037SerTrp: 1.037 ± 0.346
2.815SerTyr: 2.815 ± 0.468
0.0SerXaa: 0.0 ± 0.0
Thr
5.852ThrAla: 5.852 ± 0.936
0.222ThrCys: 0.222 ± 0.111
3.037ThrAsp: 3.037 ± 0.514
3.259ThrGlu: 3.259 ± 0.56
2.815ThrPhe: 2.815 ± 0.601
5.778ThrGly: 5.778 ± 0.515
1.037ThrHis: 1.037 ± 0.317
3.185ThrIle: 3.185 ± 0.454
2.593ThrLys: 2.593 ± 0.461
7.037ThrLeu: 7.037 ± 0.632
1.407ThrMet: 1.407 ± 0.255
2.222ThrAsn: 2.222 ± 0.381
3.481ThrPro: 3.481 ± 0.431
2.519ThrGln: 2.519 ± 0.519
4.667ThrArg: 4.667 ± 0.637
4.296ThrSer: 4.296 ± 0.673
4.667ThrThr: 4.667 ± 1.111
5.852ThrVal: 5.852 ± 0.68
1.704ThrTrp: 1.704 ± 0.395
1.926ThrTyr: 1.926 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
8.074ValAla: 8.074 ± 0.711
0.222ValCys: 0.222 ± 0.118
5.407ValAsp: 5.407 ± 0.551
4.963ValGlu: 4.963 ± 0.688
2.0ValPhe: 2.0 ± 0.417
5.778ValGly: 5.778 ± 0.8
1.481ValHis: 1.481 ± 0.329
4.519ValIle: 4.519 ± 0.629
3.111ValLys: 3.111 ± 0.487
5.704ValLeu: 5.704 ± 0.647
1.481ValMet: 1.481 ± 0.392
2.0ValAsn: 2.0 ± 0.559
4.0ValPro: 4.0 ± 0.587
3.407ValGln: 3.407 ± 0.609
3.852ValArg: 3.852 ± 0.575
4.0ValSer: 4.0 ± 0.583
4.889ValThr: 4.889 ± 0.639
4.37ValVal: 4.37 ± 0.64
1.556ValTrp: 1.556 ± 0.351
2.37ValTyr: 2.37 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
1.556TrpAla: 1.556 ± 0.339
0.074TrpCys: 0.074 ± 0.068
1.333TrpAsp: 1.333 ± 0.229
1.111TrpGlu: 1.111 ± 0.315
0.741TrpPhe: 0.741 ± 0.24
0.889TrpGly: 0.889 ± 0.313
0.519TrpHis: 0.519 ± 0.186
1.037TrpIle: 1.037 ± 0.238
0.667TrpLys: 0.667 ± 0.196
1.63TrpLeu: 1.63 ± 0.337
0.37TrpMet: 0.37 ± 0.167
1.259TrpAsn: 1.259 ± 0.26
1.111TrpPro: 1.111 ± 0.295
0.815TrpGln: 0.815 ± 0.271
0.667TrpArg: 0.667 ± 0.217
0.741TrpSer: 0.741 ± 0.253
1.556TrpThr: 1.556 ± 0.407
1.481TrpVal: 1.481 ± 0.453
0.444TrpTrp: 0.444 ± 0.193
0.519TrpTyr: 0.519 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.519TyrAla: 2.519 ± 0.378
0.37TyrCys: 0.37 ± 0.176
2.0TyrAsp: 2.0 ± 0.448
1.481TyrGlu: 1.481 ± 0.293
0.815TyrPhe: 0.815 ± 0.211
2.444TyrGly: 2.444 ± 0.34
0.519TyrHis: 0.519 ± 0.184
1.037TyrIle: 1.037 ± 0.29
1.481TyrLys: 1.481 ± 0.353
1.852TyrLeu: 1.852 ± 0.332
0.889TyrMet: 0.889 ± 0.236
1.333TyrAsn: 1.333 ± 0.278
1.704TyrPro: 1.704 ± 0.388
1.185TyrGln: 1.185 ± 0.378
2.296TyrArg: 2.296 ± 0.496
1.556TyrSer: 1.556 ± 0.383
1.407TyrThr: 1.407 ± 0.409
2.519TyrVal: 2.519 ± 0.407
0.444TyrTrp: 0.444 ± 0.158
0.815TyrTyr: 0.815 ± 0.334
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13501 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski