Amino acid dipepetide frequency for Salmonella phage SE1 (in:Nonagvirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.867AlaAla: 8.867 ± 1.255
0.919AlaCys: 0.919 ± 0.227
4.055AlaAsp: 4.055 ± 0.465
4.92AlaGlu: 4.92 ± 0.642
2.812AlaPhe: 2.812 ± 0.48
5.677AlaGly: 5.677 ± 0.636
1.081AlaHis: 1.081 ± 0.223
5.839AlaIle: 5.839 ± 0.627
4.65AlaLys: 4.65 ± 0.705
5.353AlaLeu: 5.353 ± 0.564
2.541AlaMet: 2.541 ± 0.378
4.001AlaAsn: 4.001 ± 0.47
2.163AlaPro: 2.163 ± 0.276
4.866AlaGln: 4.866 ± 0.685
4.001AlaArg: 4.001 ± 0.531
4.055AlaSer: 4.055 ± 0.597
4.488AlaThr: 4.488 ± 0.567
4.488AlaVal: 4.488 ± 0.51
1.135AlaTrp: 1.135 ± 0.26
3.244AlaTyr: 3.244 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.211
0.378CysCys: 0.378 ± 0.182
0.811CysAsp: 0.811 ± 0.192
1.244CysGlu: 1.244 ± 0.26
0.541CysPhe: 0.541 ± 0.173
1.352CysGly: 1.352 ± 0.335
0.378CysHis: 0.378 ± 0.146
1.027CysIle: 1.027 ± 0.246
1.135CysLys: 1.135 ± 0.226
0.595CysLeu: 0.595 ± 0.184
0.216CysMet: 0.216 ± 0.117
0.865CysAsn: 0.865 ± 0.197
0.757CysPro: 0.757 ± 0.195
0.541CysGln: 0.541 ± 0.192
0.595CysArg: 0.595 ± 0.199
0.703CysSer: 0.703 ± 0.188
0.919CysThr: 0.919 ± 0.271
1.19CysVal: 1.19 ± 0.27
0.27CysTrp: 0.27 ± 0.119
0.541CysTyr: 0.541 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
4.596AspAla: 4.596 ± 0.608
0.541AspCys: 0.541 ± 0.166
3.028AspAsp: 3.028 ± 0.381
4.325AspGlu: 4.325 ± 0.467
2.271AspPhe: 2.271 ± 0.341
6.11AspGly: 6.11 ± 0.587
1.514AspHis: 1.514 ± 0.256
3.839AspIle: 3.839 ± 0.414
3.731AspLys: 3.731 ± 0.441
4.65AspLeu: 4.65 ± 0.421
1.244AspMet: 1.244 ± 0.232
2.703AspAsn: 2.703 ± 0.439
1.73AspPro: 1.73 ± 0.317
1.73AspGln: 1.73 ± 0.258
1.892AspArg: 1.892 ± 0.263
2.812AspSer: 2.812 ± 0.443
2.812AspThr: 2.812 ± 0.398
4.163AspVal: 4.163 ± 0.371
0.757AspTrp: 0.757 ± 0.201
2.109AspTyr: 2.109 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
4.92GluAla: 4.92 ± 0.667
1.135GluCys: 1.135 ± 0.312
3.839GluAsp: 3.839 ± 0.432
5.407GluGlu: 5.407 ± 0.574
2.271GluPhe: 2.271 ± 0.39
4.38GluGly: 4.38 ± 0.554
1.135GluHis: 1.135 ± 0.251
4.271GluIle: 4.271 ± 0.473
3.893GluLys: 3.893 ± 0.557
5.677GluLeu: 5.677 ± 0.628
2.812GluMet: 2.812 ± 0.427
2.758GluAsn: 2.758 ± 0.386
2.109GluPro: 2.109 ± 0.474
4.109GluGln: 4.109 ± 0.513
5.082GluArg: 5.082 ± 0.363
3.352GluSer: 3.352 ± 0.447
3.082GluThr: 3.082 ± 0.406
4.271GluVal: 4.271 ± 0.499
0.757GluTrp: 0.757 ± 0.279
2.487GluTyr: 2.487 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
2.433PheAla: 2.433 ± 0.408
0.703PheCys: 0.703 ± 0.182
3.028PheAsp: 3.028 ± 0.417
1.892PheGlu: 1.892 ± 0.335
1.19PhePhe: 1.19 ± 0.279
2.595PheGly: 2.595 ± 0.423
1.081PheHis: 1.081 ± 0.226
2.595PheIle: 2.595 ± 0.43
2.595PheLys: 2.595 ± 0.344
2.109PheLeu: 2.109 ± 0.285
1.027PheMet: 1.027 ± 0.249
3.028PheAsn: 3.028 ± 0.556
1.46PhePro: 1.46 ± 0.244
1.135PheGln: 1.135 ± 0.264
2.055PheArg: 2.055 ± 0.432
2.217PheSer: 2.217 ± 0.355
1.892PheThr: 1.892 ± 0.386
1.838PheVal: 1.838 ± 0.311
0.27PheTrp: 0.27 ± 0.121
0.973PheTyr: 0.973 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
4.488GlyAla: 4.488 ± 0.685
1.027GlyCys: 1.027 ± 0.249
3.298GlyAsp: 3.298 ± 0.393
4.217GlyGlu: 4.217 ± 0.45
3.46GlyPhe: 3.46 ± 0.493
5.569GlyGly: 5.569 ± 0.757
1.622GlyHis: 1.622 ± 0.309
4.055GlyIle: 4.055 ± 0.475
4.542GlyLys: 4.542 ± 0.478
6.056GlyLeu: 6.056 ± 0.409
2.541GlyMet: 2.541 ± 0.356
2.866GlyAsn: 2.866 ± 0.432
1.514GlyPro: 1.514 ± 0.225
2.974GlyGln: 2.974 ± 0.459
3.514GlyArg: 3.514 ± 0.363
5.623GlySer: 5.623 ± 0.542
4.325GlyThr: 4.325 ± 0.572
6.056GlyVal: 6.056 ± 0.584
1.135GlyTrp: 1.135 ± 0.258
2.541GlyTyr: 2.541 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.352HisAla: 1.352 ± 0.288
0.324HisCys: 0.324 ± 0.145
1.135HisAsp: 1.135 ± 0.269
1.73HisGlu: 1.73 ± 0.343
0.811HisPhe: 0.811 ± 0.233
1.298HisGly: 1.298 ± 0.235
0.487HisHis: 0.487 ± 0.168
1.568HisIle: 1.568 ± 0.338
0.757HisLys: 0.757 ± 0.185
1.622HisLeu: 1.622 ± 0.306
0.703HisMet: 0.703 ± 0.215
1.027HisAsn: 1.027 ± 0.254
0.757HisPro: 0.757 ± 0.159
0.973HisGln: 0.973 ± 0.294
0.703HisArg: 0.703 ± 0.194
1.081HisSer: 1.081 ± 0.218
0.865HisThr: 0.865 ± 0.215
1.46HisVal: 1.46 ± 0.248
0.541HisTrp: 0.541 ± 0.187
0.865HisTyr: 0.865 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
5.299IleAla: 5.299 ± 0.501
0.973IleCys: 0.973 ± 0.214
4.109IleAsp: 4.109 ± 0.466
4.055IleGlu: 4.055 ± 0.488
1.406IlePhe: 1.406 ± 0.26
3.785IleGly: 3.785 ± 0.503
1.135IleHis: 1.135 ± 0.204
4.38IleIle: 4.38 ± 0.472
4.596IleLys: 4.596 ± 0.478
3.893IleLeu: 3.893 ± 0.442
1.298IleMet: 1.298 ± 0.245
3.839IleAsn: 3.839 ± 0.384
2.595IlePro: 2.595 ± 0.304
2.758IleGln: 2.758 ± 0.372
3.46IleArg: 3.46 ± 0.437
3.244IleSer: 3.244 ± 0.452
3.677IleThr: 3.677 ± 0.419
3.947IleVal: 3.947 ± 0.572
0.865IleTrp: 0.865 ± 0.244
2.325IleTyr: 2.325 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
5.461LysAla: 5.461 ± 0.678
0.811LysCys: 0.811 ± 0.25
3.406LysAsp: 3.406 ± 0.381
4.92LysGlu: 4.92 ± 0.554
2.758LysPhe: 2.758 ± 0.462
3.19LysGly: 3.19 ± 0.375
1.46LysHis: 1.46 ± 0.303
3.893LysIle: 3.893 ± 0.451
4.001LysLys: 4.001 ± 0.513
4.163LysLeu: 4.163 ± 0.593
1.622LysMet: 1.622 ± 0.334
2.055LysAsn: 2.055 ± 0.33
2.703LysPro: 2.703 ± 0.437
3.028LysGln: 3.028 ± 0.42
3.244LysArg: 3.244 ± 0.438
3.514LysSer: 3.514 ± 0.389
3.136LysThr: 3.136 ± 0.379
4.109LysVal: 4.109 ± 0.495
0.865LysTrp: 0.865 ± 0.233
2.541LysTyr: 2.541 ± 0.411
0.0LysXaa: 0.0 ± 0.0
Leu
5.028LeuAla: 5.028 ± 0.522
1.135LeuCys: 1.135 ± 0.243
4.38LeuAsp: 4.38 ± 0.48
4.488LeuGlu: 4.488 ± 0.526
2.379LeuPhe: 2.379 ± 0.277
4.866LeuGly: 4.866 ± 0.471
1.676LeuHis: 1.676 ± 0.312
3.947LeuIle: 3.947 ± 0.506
4.325LeuLys: 4.325 ± 0.522
5.353LeuLeu: 5.353 ± 0.524
2.109LeuMet: 2.109 ± 0.323
3.785LeuAsn: 3.785 ± 0.397
4.163LeuPro: 4.163 ± 0.504
3.947LeuGln: 3.947 ± 0.457
4.866LeuArg: 4.866 ± 0.508
4.596LeuSer: 4.596 ± 0.584
5.515LeuThr: 5.515 ± 0.549
5.461LeuVal: 5.461 ± 0.531
0.595LeuTrp: 0.595 ± 0.169
2.541LeuTyr: 2.541 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
2.595MetAla: 2.595 ± 0.43
0.595MetCys: 0.595 ± 0.245
1.784MetAsp: 1.784 ± 0.313
2.325MetGlu: 2.325 ± 0.357
0.811MetPhe: 0.811 ± 0.202
1.784MetGly: 1.784 ± 0.31
0.595MetHis: 0.595 ± 0.173
1.081MetIle: 1.081 ± 0.249
1.622MetLys: 1.622 ± 0.337
2.595MetLeu: 2.595 ± 0.34
0.865MetMet: 0.865 ± 0.301
1.298MetAsn: 1.298 ± 0.266
1.135MetPro: 1.135 ± 0.296
1.784MetGln: 1.784 ± 0.334
1.892MetArg: 1.892 ± 0.24
2.649MetSer: 2.649 ± 0.42
1.73MetThr: 1.73 ± 0.332
1.244MetVal: 1.244 ± 0.241
0.324MetTrp: 0.324 ± 0.126
0.973MetTyr: 0.973 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
4.434AsnAla: 4.434 ± 0.528
0.757AsnCys: 0.757 ± 0.188
2.812AsnAsp: 2.812 ± 0.327
3.406AsnGlu: 3.406 ± 0.422
0.919AsnPhe: 0.919 ± 0.208
5.137AsnGly: 5.137 ± 0.632
0.973AsnHis: 0.973 ± 0.201
3.136AsnIle: 3.136 ± 0.368
3.136AsnLys: 3.136 ± 0.352
4.055AsnLeu: 4.055 ± 0.46
0.919AsnMet: 0.919 ± 0.251
2.487AsnAsn: 2.487 ± 0.38
1.838AsnPro: 1.838 ± 0.314
1.568AsnGln: 1.568 ± 0.317
2.163AsnArg: 2.163 ± 0.317
3.352AsnSer: 3.352 ± 0.499
2.379AsnThr: 2.379 ± 0.329
3.136AsnVal: 3.136 ± 0.43
0.919AsnTrp: 0.919 ± 0.237
1.946AsnTyr: 1.946 ± 0.445
0.0AsnXaa: 0.0 ± 0.0
Pro
1.946ProAla: 1.946 ± 0.325
0.433ProCys: 0.433 ± 0.149
2.758ProAsp: 2.758 ± 0.411
3.082ProGlu: 3.082 ± 0.385
1.892ProPhe: 1.892 ± 0.375
3.136ProGly: 3.136 ± 0.475
1.298ProHis: 1.298 ± 0.341
1.838ProIle: 1.838 ± 0.309
1.838ProLys: 1.838 ± 0.315
3.028ProLeu: 3.028 ± 0.385
1.298ProMet: 1.298 ± 0.194
1.838ProAsn: 1.838 ± 0.318
1.892ProPro: 1.892 ± 0.361
1.676ProGln: 1.676 ± 0.604
1.946ProArg: 1.946 ± 0.348
2.487ProSer: 2.487 ± 0.422
1.892ProThr: 1.892 ± 0.421
3.19ProVal: 3.19 ± 0.424
0.541ProTrp: 0.541 ± 0.188
1.298ProTyr: 1.298 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
4.271GlnAla: 4.271 ± 0.709
0.487GlnCys: 0.487 ± 0.21
2.109GlnAsp: 2.109 ± 0.305
3.082GlnGlu: 3.082 ± 0.404
1.73GlnPhe: 1.73 ± 0.311
2.055GlnGly: 2.055 ± 0.352
0.487GlnHis: 0.487 ± 0.193
2.92GlnIle: 2.92 ± 0.378
2.055GlnLys: 2.055 ± 0.337
4.109GlnLeu: 4.109 ± 0.363
1.676GlnMet: 1.676 ± 0.313
2.217GlnAsn: 2.217 ± 0.397
2.433GlnPro: 2.433 ± 0.863
3.731GlnGln: 3.731 ± 1.618
2.271GlnArg: 2.271 ± 0.451
2.703GlnSer: 2.703 ± 0.395
3.19GlnThr: 3.19 ± 0.388
3.082GlnVal: 3.082 ± 0.355
0.703GlnTrp: 0.703 ± 0.208
1.676GlnTyr: 1.676 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
3.623ArgAla: 3.623 ± 0.458
0.703ArgCys: 0.703 ± 0.172
2.649ArgAsp: 2.649 ± 0.316
3.298ArgGlu: 3.298 ± 0.546
1.946ArgPhe: 1.946 ± 0.319
3.082ArgGly: 3.082 ± 0.411
1.135ArgHis: 1.135 ± 0.287
3.244ArgIle: 3.244 ± 0.493
3.082ArgLys: 3.082 ± 0.438
4.542ArgLeu: 4.542 ± 0.481
2.055ArgMet: 2.055 ± 0.396
3.082ArgAsn: 3.082 ± 0.468
2.325ArgPro: 2.325 ± 0.364
2.541ArgGln: 2.541 ± 0.364
2.649ArgArg: 2.649 ± 0.343
3.082ArgSer: 3.082 ± 0.464
2.812ArgThr: 2.812 ± 0.432
3.785ArgVal: 3.785 ± 0.379
0.919ArgTrp: 0.919 ± 0.239
2.217ArgTyr: 2.217 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
4.92SerAla: 4.92 ± 0.517
1.081SerCys: 1.081 ± 0.232
3.298SerAsp: 3.298 ± 0.549
3.46SerGlu: 3.46 ± 0.446
2.595SerPhe: 2.595 ± 0.433
5.245SerGly: 5.245 ± 0.697
1.19SerHis: 1.19 ± 0.238
2.92SerIle: 2.92 ± 0.405
3.623SerLys: 3.623 ± 0.44
4.488SerLeu: 4.488 ± 0.53
2.001SerMet: 2.001 ± 0.345
2.595SerAsn: 2.595 ± 0.422
1.73SerPro: 1.73 ± 0.309
2.703SerGln: 2.703 ± 0.405
2.92SerArg: 2.92 ± 0.381
3.731SerSer: 3.731 ± 0.467
4.001SerThr: 4.001 ± 0.422
3.893SerVal: 3.893 ± 0.427
0.919SerTrp: 0.919 ± 0.225
2.595SerTyr: 2.595 ± 0.405
0.0SerXaa: 0.0 ± 0.0
Thr
4.758ThrAla: 4.758 ± 0.609
0.595ThrCys: 0.595 ± 0.199
3.46ThrAsp: 3.46 ± 0.509
3.46ThrGlu: 3.46 ± 0.506
2.433ThrPhe: 2.433 ± 0.422
4.109ThrGly: 4.109 ± 0.516
0.973ThrHis: 0.973 ± 0.311
4.001ThrIle: 4.001 ± 0.446
2.541ThrLys: 2.541 ± 0.348
4.38ThrLeu: 4.38 ± 0.599
1.622ThrMet: 1.622 ± 0.306
3.244ThrAsn: 3.244 ± 0.371
3.514ThrPro: 3.514 ± 0.466
1.73ThrGln: 1.73 ± 0.414
2.595ThrArg: 2.595 ± 0.471
3.569ThrSer: 3.569 ± 0.522
3.136ThrThr: 3.136 ± 0.468
4.596ThrVal: 4.596 ± 0.64
1.027ThrTrp: 1.027 ± 0.229
1.298ThrTyr: 1.298 ± 0.233
0.0ThrXaa: 0.0 ± 0.0
Val
5.353ValAla: 5.353 ± 0.649
1.027ValCys: 1.027 ± 0.224
3.947ValAsp: 3.947 ± 0.443
5.082ValGlu: 5.082 ± 0.551
2.109ValPhe: 2.109 ± 0.302
4.758ValGly: 4.758 ± 0.615
1.081ValHis: 1.081 ± 0.288
4.325ValIle: 4.325 ± 0.486
4.758ValLys: 4.758 ± 0.524
4.325ValLeu: 4.325 ± 0.501
1.946ValMet: 1.946 ± 0.344
2.92ValAsn: 2.92 ± 0.434
2.703ValPro: 2.703 ± 0.345
3.569ValGln: 3.569 ± 0.547
3.677ValArg: 3.677 ± 0.336
4.163ValSer: 4.163 ± 0.449
4.38ValThr: 4.38 ± 0.502
6.002ValVal: 6.002 ± 0.629
1.352ValTrp: 1.352 ± 0.301
2.541ValTyr: 2.541 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
1.568TrpAla: 1.568 ± 0.265
0.378TrpCys: 0.378 ± 0.146
0.973TrpAsp: 0.973 ± 0.205
0.865TrpGlu: 0.865 ± 0.232
0.595TrpPhe: 0.595 ± 0.16
0.919TrpGly: 0.919 ± 0.226
0.108TrpHis: 0.108 ± 0.078
0.757TrpIle: 0.757 ± 0.185
0.973TrpLys: 0.973 ± 0.231
1.298TrpLeu: 1.298 ± 0.224
0.162TrpMet: 0.162 ± 0.092
0.811TrpAsn: 0.811 ± 0.217
0.378TrpPro: 0.378 ± 0.136
0.595TrpGln: 0.595 ± 0.183
0.919TrpArg: 0.919 ± 0.268
0.757TrpSer: 0.757 ± 0.183
0.649TrpThr: 0.649 ± 0.192
1.19TrpVal: 1.19 ± 0.274
0.27TrpTrp: 0.27 ± 0.106
0.487TrpTyr: 0.487 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.487TyrAla: 2.487 ± 0.389
0.811TyrCys: 0.811 ± 0.187
1.676TyrAsp: 1.676 ± 0.286
2.487TyrGlu: 2.487 ± 0.389
1.19TyrPhe: 1.19 ± 0.246
2.379TyrGly: 2.379 ± 0.341
0.649TyrHis: 0.649 ± 0.163
2.001TyrIle: 2.001 ± 0.353
3.028TyrLys: 3.028 ± 0.416
2.866TyrLeu: 2.866 ± 0.496
1.027TyrMet: 1.027 ± 0.231
2.055TyrAsn: 2.055 ± 0.288
1.406TyrPro: 1.406 ± 0.284
1.027TyrGln: 1.027 ± 0.218
2.325TyrArg: 2.325 ± 0.356
2.217TyrSer: 2.217 ± 0.373
2.163TyrThr: 2.163 ± 0.311
2.92TyrVal: 2.92 ± 0.413
0.487TyrTrp: 0.487 ± 0.156
1.406TyrTyr: 1.406 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (18496 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski