Amino acid dipepetide frequency for Shewanella phage S0112

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.8AlaAla: 7.8 ± 1.201
1.049AlaCys: 1.049 ± 0.24
4.13AlaAsp: 4.13 ± 0.618
5.899AlaGlu: 5.899 ± 0.701
3.081AlaPhe: 3.081 ± 0.501
6.489AlaGly: 6.489 ± 0.799
1.442AlaHis: 1.442 ± 0.29
5.899AlaIle: 5.899 ± 0.508
5.703AlaLys: 5.703 ± 0.695
7.472AlaLeu: 7.472 ± 0.893
1.704AlaMet: 1.704 ± 0.316
3.998AlaAsn: 3.998 ± 0.55
3.212AlaPro: 3.212 ± 0.44
3.343AlaGln: 3.343 ± 0.468
3.081AlaArg: 3.081 ± 0.391
4.916AlaSer: 4.916 ± 0.69
6.489AlaThr: 6.489 ± 0.992
5.44AlaVal: 5.44 ± 0.591
1.049AlaTrp: 1.049 ± 0.371
3.605AlaTyr: 3.605 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.721CysAla: 0.721 ± 0.295
0.131CysCys: 0.131 ± 0.094
0.721CysAsp: 0.721 ± 0.203
1.245CysGlu: 1.245 ± 0.278
0.393CysPhe: 0.393 ± 0.161
0.918CysGly: 0.918 ± 0.288
0.328CysHis: 0.328 ± 0.141
0.393CysIle: 0.393 ± 0.164
0.524CysLys: 0.524 ± 0.193
0.655CysLeu: 0.655 ± 0.175
0.262CysMet: 0.262 ± 0.127
0.524CysAsn: 0.524 ± 0.214
0.721CysPro: 0.721 ± 0.224
0.787CysGln: 0.787 ± 0.248
0.918CysArg: 0.918 ± 0.256
0.787CysSer: 0.787 ± 0.219
0.524CysThr: 0.524 ± 0.197
0.787CysVal: 0.787 ± 0.241
0.131CysTrp: 0.131 ± 0.098
0.655CysTyr: 0.655 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
3.933AspAla: 3.933 ± 0.361
0.459AspCys: 0.459 ± 0.146
2.753AspAsp: 2.753 ± 0.366
3.54AspGlu: 3.54 ± 0.455
3.277AspPhe: 3.277 ± 0.372
3.998AspGly: 3.998 ± 0.463
0.852AspHis: 0.852 ± 0.22
4.982AspIle: 4.982 ± 0.647
3.54AspLys: 3.54 ± 0.412
6.62AspLeu: 6.62 ± 0.578
1.442AspMet: 1.442 ± 0.283
2.556AspAsn: 2.556 ± 0.463
3.802AspPro: 3.802 ± 0.606
2.032AspGln: 2.032 ± 0.348
2.753AspArg: 2.753 ± 0.484
4.326AspSer: 4.326 ± 0.545
3.474AspThr: 3.474 ± 0.341
2.622AspVal: 2.622 ± 0.369
1.114AspTrp: 1.114 ± 0.277
1.573AspTyr: 1.573 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
6.227GluAla: 6.227 ± 0.625
1.049GluCys: 1.049 ± 0.307
4.719GluAsp: 4.719 ± 0.521
6.162GluGlu: 6.162 ± 0.672
2.163GluPhe: 2.163 ± 0.383
4.523GluGly: 4.523 ± 0.581
1.442GluHis: 1.442 ± 0.295
3.605GluIle: 3.605 ± 0.49
3.605GluLys: 3.605 ± 0.571
5.899GluLeu: 5.899 ± 0.631
2.229GluMet: 2.229 ± 0.434
1.573GluAsn: 1.573 ± 0.326
1.639GluPro: 1.639 ± 0.362
3.081GluGln: 3.081 ± 0.525
2.491GluArg: 2.491 ± 0.432
4.392GluSer: 4.392 ± 0.497
4.064GluThr: 4.064 ± 0.507
4.392GluVal: 4.392 ± 0.575
0.918GluTrp: 0.918 ± 0.286
2.032GluTyr: 2.032 ± 0.374
0.0GluXaa: 0.0 ± 0.0
Phe
2.95PheAla: 2.95 ± 0.757
0.655PheCys: 0.655 ± 0.193
3.277PheAsp: 3.277 ± 0.589
2.819PheGlu: 2.819 ± 0.425
1.049PhePhe: 1.049 ± 0.255
2.884PheGly: 2.884 ± 0.36
0.787PheHis: 0.787 ± 0.23
2.95PheIle: 2.95 ± 0.49
1.77PheLys: 1.77 ± 0.347
3.015PheLeu: 3.015 ± 0.367
1.245PheMet: 1.245 ± 0.324
2.425PheAsn: 2.425 ± 0.374
1.377PhePro: 1.377 ± 0.247
1.77PheGln: 1.77 ± 0.437
2.491PheArg: 2.491 ± 0.459
2.95PheSer: 2.95 ± 0.465
2.687PheThr: 2.687 ± 0.387
2.294PheVal: 2.294 ± 0.408
0.787PheTrp: 0.787 ± 0.191
1.508PheTyr: 1.508 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
5.834GlyAla: 5.834 ± 0.531
0.983GlyCys: 0.983 ± 0.224
3.867GlyAsp: 3.867 ± 0.451
3.671GlyGlu: 3.671 ± 0.497
3.343GlyPhe: 3.343 ± 0.531
4.457GlyGly: 4.457 ± 0.592
1.442GlyHis: 1.442 ± 0.383
4.523GlyIle: 4.523 ± 0.448
4.457GlyLys: 4.457 ± 0.568
6.227GlyLeu: 6.227 ± 0.733
1.704GlyMet: 1.704 ± 0.383
2.884GlyAsn: 2.884 ± 0.409
2.294GlyPro: 2.294 ± 0.375
2.622GlyGln: 2.622 ± 0.358
3.933GlyArg: 3.933 ± 0.519
4.916GlySer: 4.916 ± 0.676
5.047GlyThr: 5.047 ± 0.598
5.244GlyVal: 5.244 ± 0.558
1.311GlyTrp: 1.311 ± 0.354
2.556GlyTyr: 2.556 ± 0.454
0.0GlyXaa: 0.0 ± 0.0
His
1.377HisAla: 1.377 ± 0.303
0.262HisCys: 0.262 ± 0.14
0.918HisAsp: 0.918 ± 0.251
0.918HisGlu: 0.918 ± 0.276
0.918HisPhe: 0.918 ± 0.286
0.983HisGly: 0.983 ± 0.293
0.524HisHis: 0.524 ± 0.214
1.77HisIle: 1.77 ± 0.378
1.114HisLys: 1.114 ± 0.251
1.639HisLeu: 1.639 ± 0.33
0.721HisMet: 0.721 ± 0.226
0.787HisAsn: 0.787 ± 0.226
0.721HisPro: 0.721 ± 0.277
0.524HisGln: 0.524 ± 0.167
0.721HisArg: 0.721 ± 0.21
1.18HisSer: 1.18 ± 0.283
0.852HisThr: 0.852 ± 0.226
1.18HisVal: 1.18 ± 0.3
0.393HisTrp: 0.393 ± 0.165
0.787HisTyr: 0.787 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
5.178IleAla: 5.178 ± 0.605
1.311IleCys: 1.311 ± 0.301
4.261IleAsp: 4.261 ± 0.67
4.654IleGlu: 4.654 ± 0.504
2.753IlePhe: 2.753 ± 0.313
3.343IleGly: 3.343 ± 0.599
1.049IleHis: 1.049 ± 0.302
4.064IleIle: 4.064 ± 0.682
3.081IleLys: 3.081 ± 0.521
5.113IleLeu: 5.113 ± 0.522
1.311IleMet: 1.311 ± 0.311
3.605IleAsn: 3.605 ± 0.494
5.44IlePro: 5.44 ± 0.55
3.343IleGln: 3.343 ± 0.466
2.819IleArg: 2.819 ± 0.422
3.408IleSer: 3.408 ± 0.421
4.195IleThr: 4.195 ± 0.716
3.408IleVal: 3.408 ± 0.513
0.328IleTrp: 0.328 ± 0.141
2.425IleTyr: 2.425 ± 0.384
0.0IleXaa: 0.0 ± 0.0
Lys
4.982LysAla: 4.982 ± 0.668
0.655LysCys: 0.655 ± 0.214
3.802LysAsp: 3.802 ± 0.577
5.309LysGlu: 5.309 ± 0.593
1.639LysPhe: 1.639 ± 0.372
3.933LysGly: 3.933 ± 0.463
1.114LysHis: 1.114 ± 0.33
3.671LysIle: 3.671 ± 0.511
3.933LysLys: 3.933 ± 0.728
5.244LysLeu: 5.244 ± 0.595
1.049LysMet: 1.049 ± 0.321
2.884LysAsn: 2.884 ± 0.479
2.884LysPro: 2.884 ± 0.572
1.508LysGln: 1.508 ± 0.263
3.277LysArg: 3.277 ± 0.412
2.425LysSer: 2.425 ± 0.385
2.032LysThr: 2.032 ± 0.352
3.998LysVal: 3.998 ± 0.474
0.459LysTrp: 0.459 ± 0.171
2.36LysTyr: 2.36 ± 0.378
0.0LysXaa: 0.0 ± 0.0
Leu
7.866LeuAla: 7.866 ± 0.672
0.983LeuCys: 0.983 ± 0.334
6.424LeuAsp: 6.424 ± 0.558
6.358LeuGlu: 6.358 ± 0.669
3.736LeuPhe: 3.736 ± 0.495
6.096LeuGly: 6.096 ± 0.758
1.442LeuHis: 1.442 ± 0.333
5.834LeuIle: 5.834 ± 0.677
4.326LeuLys: 4.326 ± 0.501
6.162LeuLeu: 6.162 ± 0.673
1.442LeuMet: 1.442 ± 0.288
4.064LeuAsn: 4.064 ± 0.512
3.867LeuPro: 3.867 ± 0.468
3.474LeuGln: 3.474 ± 0.508
4.719LeuArg: 4.719 ± 0.816
5.703LeuSer: 5.703 ± 0.446
5.309LeuThr: 5.309 ± 0.69
4.719LeuVal: 4.719 ± 0.536
0.983LeuTrp: 0.983 ± 0.18
2.687LeuTyr: 2.687 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
2.753MetAla: 2.753 ± 0.385
0.197MetCys: 0.197 ± 0.117
1.049MetAsp: 1.049 ± 0.221
0.983MetGlu: 0.983 ± 0.278
0.852MetPhe: 0.852 ± 0.261
1.245MetGly: 1.245 ± 0.253
0.459MetHis: 0.459 ± 0.153
0.918MetIle: 0.918 ± 0.234
1.835MetLys: 1.835 ± 0.348
1.901MetLeu: 1.901 ± 0.346
0.328MetMet: 0.328 ± 0.14
0.983MetAsn: 0.983 ± 0.291
1.377MetPro: 1.377 ± 0.319
0.59MetGln: 0.59 ± 0.164
1.377MetArg: 1.377 ± 0.327
1.508MetSer: 1.508 ± 0.252
1.508MetThr: 1.508 ± 0.305
1.704MetVal: 1.704 ± 0.337
0.262MetTrp: 0.262 ± 0.143
0.459MetTyr: 0.459 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
4.392AsnAla: 4.392 ± 0.58
0.459AsnCys: 0.459 ± 0.168
1.77AsnAsp: 1.77 ± 0.384
1.966AsnGlu: 1.966 ± 0.43
2.032AsnPhe: 2.032 ± 0.346
4.654AsnGly: 4.654 ± 0.418
0.721AsnHis: 0.721 ± 0.201
3.015AsnIle: 3.015 ± 0.502
2.229AsnLys: 2.229 ± 0.417
3.736AsnLeu: 3.736 ± 0.442
1.049AsnMet: 1.049 ± 0.233
1.442AsnAsn: 1.442 ± 0.382
2.556AsnPro: 2.556 ± 0.432
2.229AsnGln: 2.229 ± 0.385
2.032AsnArg: 2.032 ± 0.344
2.687AsnSer: 2.687 ± 0.366
2.819AsnThr: 2.819 ± 0.536
2.425AsnVal: 2.425 ± 0.426
0.459AsnTrp: 0.459 ± 0.164
1.639AsnTyr: 1.639 ± 0.342
0.0AsnXaa: 0.0 ± 0.0
Pro
4.392ProAla: 4.392 ± 0.529
0.59ProCys: 0.59 ± 0.193
2.556ProAsp: 2.556 ± 0.335
3.015ProGlu: 3.015 ± 0.458
2.032ProPhe: 2.032 ± 0.356
3.933ProGly: 3.933 ± 0.519
0.721ProHis: 0.721 ± 0.193
3.015ProIle: 3.015 ± 0.399
3.212ProLys: 3.212 ± 0.534
3.605ProLeu: 3.605 ± 0.408
0.655ProMet: 0.655 ± 0.196
1.901ProAsn: 1.901 ± 0.313
2.425ProPro: 2.425 ± 0.395
1.966ProGln: 1.966 ± 0.335
2.098ProArg: 2.098 ± 0.29
2.95ProSer: 2.95 ± 0.428
2.884ProThr: 2.884 ± 0.488
3.212ProVal: 3.212 ± 0.48
0.983ProTrp: 0.983 ± 0.254
2.098ProTyr: 2.098 ± 0.405
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 0.584
0.459GlnCys: 0.459 ± 0.184
1.966GlnAsp: 1.966 ± 0.312
2.95GlnGlu: 2.95 ± 0.422
1.377GlnPhe: 1.377 ± 0.239
2.556GlnGly: 2.556 ± 0.48
0.59GlnHis: 0.59 ± 0.204
2.032GlnIle: 2.032 ± 0.366
2.36GlnLys: 2.36 ± 0.451
3.671GlnLeu: 3.671 ± 0.704
1.245GlnMet: 1.245 ± 0.271
1.508GlnAsn: 1.508 ± 0.309
1.901GlnPro: 1.901 ± 0.359
1.508GlnGln: 1.508 ± 0.259
2.163GlnArg: 2.163 ± 0.429
1.77GlnSer: 1.77 ± 0.304
2.556GlnThr: 2.556 ± 0.508
2.753GlnVal: 2.753 ± 0.407
0.59GlnTrp: 0.59 ± 0.168
1.377GlnTyr: 1.377 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
2.95ArgAla: 2.95 ± 0.431
0.59ArgCys: 0.59 ± 0.205
2.687ArgAsp: 2.687 ± 0.427
2.884ArgGlu: 2.884 ± 0.486
2.36ArgPhe: 2.36 ± 0.359
3.212ArgGly: 3.212 ± 0.469
0.59ArgHis: 0.59 ± 0.218
3.54ArgIle: 3.54 ± 0.436
3.867ArgLys: 3.867 ± 0.55
4.064ArgLeu: 4.064 ± 0.572
1.245ArgMet: 1.245 ± 0.269
1.966ArgAsn: 1.966 ± 0.406
2.622ArgPro: 2.622 ± 0.364
3.081ArgGln: 3.081 ± 0.385
3.081ArgArg: 3.081 ± 0.464
2.95ArgSer: 2.95 ± 0.492
1.77ArgThr: 1.77 ± 0.377
3.671ArgVal: 3.671 ± 0.421
0.721ArgTrp: 0.721 ± 0.181
1.77ArgTyr: 1.77 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
5.506SerAla: 5.506 ± 0.677
0.262SerCys: 0.262 ± 0.136
3.474SerAsp: 3.474 ± 0.4
3.212SerGlu: 3.212 ± 0.424
2.95SerPhe: 2.95 ± 0.553
5.44SerGly: 5.44 ± 0.532
1.245SerHis: 1.245 ± 0.297
4.392SerIle: 4.392 ± 0.663
3.081SerLys: 3.081 ± 0.469
4.457SerLeu: 4.457 ± 0.737
1.18SerMet: 1.18 ± 0.305
2.95SerAsn: 2.95 ± 0.444
3.54SerPro: 3.54 ± 0.498
1.835SerGln: 1.835 ± 0.306
2.95SerArg: 2.95 ± 0.329
3.867SerSer: 3.867 ± 0.555
3.474SerThr: 3.474 ± 0.485
4.654SerVal: 4.654 ± 0.497
0.983SerTrp: 0.983 ± 0.286
2.229SerTyr: 2.229 ± 0.474
0.0SerXaa: 0.0 ± 0.0
Thr
5.834ThrAla: 5.834 ± 0.745
0.787ThrCys: 0.787 ± 0.248
3.343ThrAsp: 3.343 ± 0.493
3.736ThrGlu: 3.736 ± 0.515
2.95ThrPhe: 2.95 ± 0.428
5.572ThrGly: 5.572 ± 0.672
0.918ThrHis: 0.918 ± 0.246
3.867ThrIle: 3.867 ± 0.498
3.408ThrLys: 3.408 ± 0.53
5.637ThrLeu: 5.637 ± 0.687
1.114ThrMet: 1.114 ± 0.276
3.015ThrAsn: 3.015 ± 0.428
3.277ThrPro: 3.277 ± 0.518
1.835ThrGln: 1.835 ± 0.318
2.294ThrArg: 2.294 ± 0.401
3.998ThrSer: 3.998 ± 0.636
3.343ThrThr: 3.343 ± 0.538
2.884ThrVal: 2.884 ± 0.361
0.918ThrTrp: 0.918 ± 0.249
1.704ThrTyr: 1.704 ± 0.446
0.0ThrXaa: 0.0 ± 0.0
Val
5.572ValAla: 5.572 ± 0.695
0.524ValCys: 0.524 ± 0.205
4.588ValAsp: 4.588 ± 0.598
4.523ValGlu: 4.523 ± 0.555
3.277ValPhe: 3.277 ± 0.429
4.064ValGly: 4.064 ± 0.511
1.245ValHis: 1.245 ± 0.302
3.212ValIle: 3.212 ± 0.554
3.146ValLys: 3.146 ± 0.531
5.375ValLeu: 5.375 ± 0.496
1.311ValMet: 1.311 ± 0.316
2.95ValAsn: 2.95 ± 0.47
2.229ValPro: 2.229 ± 0.293
1.639ValGln: 1.639 ± 0.399
3.408ValArg: 3.408 ± 0.416
4.195ValSer: 4.195 ± 0.509
4.392ValThr: 4.392 ± 0.636
3.671ValVal: 3.671 ± 0.519
0.328ValTrp: 0.328 ± 0.147
2.229ValTyr: 2.229 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
0.983TrpAla: 0.983 ± 0.221
0.197TrpCys: 0.197 ± 0.105
1.245TrpAsp: 1.245 ± 0.307
0.787TrpGlu: 0.787 ± 0.265
0.524TrpPhe: 0.524 ± 0.175
0.787TrpGly: 0.787 ± 0.23
0.393TrpHis: 0.393 ± 0.159
0.459TrpIle: 0.459 ± 0.155
0.852TrpLys: 0.852 ± 0.283
1.377TrpLeu: 1.377 ± 0.273
0.197TrpMet: 0.197 ± 0.102
0.787TrpAsn: 0.787 ± 0.202
0.459TrpPro: 0.459 ± 0.158
0.655TrpGln: 0.655 ± 0.238
1.049TrpArg: 1.049 ± 0.207
0.59TrpSer: 0.59 ± 0.172
0.393TrpThr: 0.393 ± 0.165
0.918TrpVal: 0.918 ± 0.223
0.393TrpTrp: 0.393 ± 0.177
0.59TrpTyr: 0.59 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.95TyrAla: 2.95 ± 0.517
0.524TyrCys: 0.524 ± 0.224
2.098TyrAsp: 2.098 ± 0.407
1.573TyrGlu: 1.573 ± 0.311
1.049TyrPhe: 1.049 ± 0.227
2.229TyrGly: 2.229 ± 0.4
0.983TyrHis: 0.983 ± 0.202
2.95TyrIle: 2.95 ± 0.47
1.114TyrLys: 1.114 ± 0.254
4.326TyrLeu: 4.326 ± 0.593
0.787TyrMet: 0.787 ± 0.259
1.508TyrAsn: 1.508 ± 0.241
1.901TyrPro: 1.901 ± 0.333
1.114TyrGln: 1.114 ± 0.26
1.966TyrArg: 1.966 ± 0.383
2.163TyrSer: 2.163 ± 0.34
2.622TyrThr: 2.622 ± 0.469
1.901TyrVal: 1.901 ± 0.34
0.524TyrTrp: 0.524 ± 0.225
1.18TyrTyr: 1.18 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (15257 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski