Amino acid dipepetide frequency for Gordonia phage Remus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.551AlaAla: 9.551 ± 1.103
0.801AlaCys: 0.801 ± 0.2
5.608AlaAsp: 5.608 ± 0.649
6.655AlaGlu: 6.655 ± 0.846
3.636AlaPhe: 3.636 ± 0.512
7.518AlaGly: 7.518 ± 0.719
1.417AlaHis: 1.417 ± 0.249
4.56AlaIle: 4.56 ± 0.484
4.622AlaLys: 4.622 ± 0.62
6.594AlaLeu: 6.594 ± 0.694
2.342AlaMet: 2.342 ± 0.317
2.157AlaAsn: 2.157 ± 0.349
4.683AlaPro: 4.683 ± 0.696
3.697AlaGln: 3.697 ± 0.44
5.299AlaArg: 5.299 ± 0.509
5.546AlaSer: 5.546 ± 0.55
6.039AlaThr: 6.039 ± 0.7
7.087AlaVal: 7.087 ± 0.582
2.034AlaTrp: 2.034 ± 0.333
2.034AlaTyr: 2.034 ± 0.363
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.232
0.185CysCys: 0.185 ± 0.135
0.739CysAsp: 0.739 ± 0.204
0.246CysGlu: 0.246 ± 0.112
0.246CysPhe: 0.246 ± 0.136
0.678CysGly: 0.678 ± 0.21
0.185CysHis: 0.185 ± 0.103
0.185CysIle: 0.185 ± 0.11
0.431CysLys: 0.431 ± 0.183
0.801CysLeu: 0.801 ± 0.231
0.37CysMet: 0.37 ± 0.177
0.493CysAsn: 0.493 ± 0.154
0.678CysPro: 0.678 ± 0.243
0.185CysGln: 0.185 ± 0.139
0.431CysArg: 0.431 ± 0.169
0.739CysSer: 0.739 ± 0.189
0.246CysThr: 0.246 ± 0.122
0.431CysVal: 0.431 ± 0.184
0.246CysTrp: 0.246 ± 0.116
0.246CysTyr: 0.246 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
5.361AspAla: 5.361 ± 0.673
0.555AspCys: 0.555 ± 0.196
4.991AspAsp: 4.991 ± 0.686
4.56AspGlu: 4.56 ± 0.642
2.157AspPhe: 2.157 ± 0.288
5.854AspGly: 5.854 ± 0.649
1.602AspHis: 1.602 ± 0.423
3.204AspIle: 3.204 ± 0.468
2.157AspLys: 2.157 ± 0.457
5.977AspLeu: 5.977 ± 0.651
1.109AspMet: 1.109 ± 0.213
2.28AspAsn: 2.28 ± 0.348
4.868AspPro: 4.868 ± 0.548
2.034AspGln: 2.034 ± 0.365
3.451AspArg: 3.451 ± 0.446
3.451AspSer: 3.451 ± 0.423
4.005AspThr: 4.005 ± 0.503
4.005AspVal: 4.005 ± 0.495
1.294AspTrp: 1.294 ± 0.285
2.711AspTyr: 2.711 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
8.935GluAla: 8.935 ± 0.815
0.062GluCys: 0.062 ± 0.061
4.868GluAsp: 4.868 ± 0.554
3.882GluGlu: 3.882 ± 0.693
2.28GluPhe: 2.28 ± 0.381
5.176GluGly: 5.176 ± 0.548
1.602GluHis: 1.602 ± 0.317
4.437GluIle: 4.437 ± 0.496
3.081GluLys: 3.081 ± 0.507
6.532GluLeu: 6.532 ± 0.581
1.787GluMet: 1.787 ± 0.359
1.479GluAsn: 1.479 ± 0.273
2.342GluPro: 2.342 ± 0.397
2.157GluGln: 2.157 ± 0.364
3.759GluArg: 3.759 ± 0.492
3.143GluSer: 3.143 ± 0.353
4.19GluThr: 4.19 ± 0.55
5.608GluVal: 5.608 ± 0.609
1.479GluTrp: 1.479 ± 0.288
1.972GluTyr: 1.972 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
3.266PheAla: 3.266 ± 0.489
0.308PheCys: 0.308 ± 0.144
2.958PheAsp: 2.958 ± 0.406
3.266PheGlu: 3.266 ± 0.509
0.863PhePhe: 0.863 ± 0.225
3.389PheGly: 3.389 ± 0.355
0.801PheHis: 0.801 ± 0.22
1.171PheIle: 1.171 ± 0.25
1.294PheLys: 1.294 ± 0.314
2.095PheLeu: 2.095 ± 0.393
1.048PheMet: 1.048 ± 0.262
1.787PheAsn: 1.787 ± 0.334
1.725PhePro: 1.725 ± 0.341
0.924PheGln: 0.924 ± 0.206
1.541PheArg: 1.541 ± 0.265
2.218PheSer: 2.218 ± 0.416
2.218PheThr: 2.218 ± 0.32
2.403PheVal: 2.403 ± 0.383
0.431PheTrp: 0.431 ± 0.174
0.739PheTyr: 0.739 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
5.977GlyAla: 5.977 ± 0.594
0.924GlyCys: 0.924 ± 0.232
5.792GlyAsp: 5.792 ± 0.654
5.361GlyGlu: 5.361 ± 0.622
3.328GlyPhe: 3.328 ± 0.531
8.565GlyGly: 8.565 ± 1.671
2.157GlyHis: 2.157 ± 0.336
3.944GlyIle: 3.944 ± 0.598
3.821GlyLys: 3.821 ± 0.467
6.47GlyLeu: 6.47 ± 0.8
1.972GlyMet: 1.972 ± 0.314
2.958GlyAsn: 2.958 ± 0.4
3.451GlyPro: 3.451 ± 0.478
2.526GlyGln: 2.526 ± 0.417
4.622GlyArg: 4.622 ± 0.46
4.683GlySer: 4.683 ± 0.745
5.731GlyThr: 5.731 ± 0.711
5.176GlyVal: 5.176 ± 0.554
2.218GlyTrp: 2.218 ± 0.33
2.896GlyTyr: 2.896 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.787HisAla: 1.787 ± 0.313
0.308HisCys: 0.308 ± 0.123
1.109HisAsp: 1.109 ± 0.218
1.356HisGlu: 1.356 ± 0.31
0.616HisPhe: 0.616 ± 0.206
1.602HisGly: 1.602 ± 0.28
0.37HisHis: 0.37 ± 0.148
1.109HisIle: 1.109 ± 0.249
0.616HisLys: 0.616 ± 0.193
1.972HisLeu: 1.972 ± 0.379
0.308HisMet: 0.308 ± 0.141
0.924HisAsn: 0.924 ± 0.228
0.863HisPro: 0.863 ± 0.193
0.616HisGln: 0.616 ± 0.203
1.91HisArg: 1.91 ± 0.307
0.863HisSer: 0.863 ± 0.258
1.294HisThr: 1.294 ± 0.257
1.294HisVal: 1.294 ± 0.277
0.308HisTrp: 0.308 ± 0.146
0.739HisTyr: 0.739 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
4.498IleAla: 4.498 ± 0.506
0.555IleCys: 0.555 ± 0.165
3.944IleAsp: 3.944 ± 0.37
4.19IleGlu: 4.19 ± 0.557
1.602IlePhe: 1.602 ± 0.292
3.759IleGly: 3.759 ± 0.587
1.048IleHis: 1.048 ± 0.268
2.403IleIle: 2.403 ± 0.378
1.787IleLys: 1.787 ± 0.326
3.759IleLeu: 3.759 ± 0.42
0.924IleMet: 0.924 ± 0.212
2.095IleAsn: 2.095 ± 0.451
3.512IlePro: 3.512 ± 0.448
1.602IleGln: 1.602 ± 0.275
3.512IleArg: 3.512 ± 0.41
2.958IleSer: 2.958 ± 0.337
3.636IleThr: 3.636 ± 0.441
3.266IleVal: 3.266 ± 0.356
0.739IleTrp: 0.739 ± 0.165
1.048IleTyr: 1.048 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
4.93LysAla: 4.93 ± 0.673
0.246LysCys: 0.246 ± 0.158
2.465LysAsp: 2.465 ± 0.314
3.204LysGlu: 3.204 ± 0.439
1.356LysPhe: 1.356 ± 0.314
3.019LysGly: 3.019 ± 0.38
0.801LysHis: 0.801 ± 0.199
2.28LysIle: 2.28 ± 0.369
3.204LysLys: 3.204 ± 0.552
4.868LysLeu: 4.868 ± 0.461
1.171LysMet: 1.171 ± 0.246
0.986LysAsn: 0.986 ± 0.192
2.465LysPro: 2.465 ± 0.368
2.034LysGln: 2.034 ± 0.28
2.588LysArg: 2.588 ± 0.407
2.465LysSer: 2.465 ± 0.466
2.095LysThr: 2.095 ± 0.344
3.882LysVal: 3.882 ± 0.513
0.924LysTrp: 0.924 ± 0.215
1.171LysTyr: 1.171 ± 0.296
0.0LysXaa: 0.0 ± 0.0
Leu
8.072LeuAla: 8.072 ± 0.712
0.616LeuCys: 0.616 ± 0.169
5.361LeuAsp: 5.361 ± 0.592
7.087LeuGlu: 7.087 ± 0.595
2.711LeuPhe: 2.711 ± 0.492
6.532LeuGly: 6.532 ± 0.718
1.602LeuHis: 1.602 ± 0.347
4.19LeuIle: 4.19 ± 0.527
3.081LeuLys: 3.081 ± 0.537
6.039LeuLeu: 6.039 ± 0.626
2.218LeuMet: 2.218 ± 0.38
2.465LeuAsn: 2.465 ± 0.347
4.067LeuPro: 4.067 ± 0.479
1.972LeuGln: 1.972 ± 0.32
5.361LeuArg: 5.361 ± 0.484
4.498LeuSer: 4.498 ± 0.553
4.683LeuThr: 4.683 ± 0.493
6.655LeuVal: 6.655 ± 0.803
1.787LeuTrp: 1.787 ± 0.291
2.588LeuTyr: 2.588 ± 0.481
0.0LeuXaa: 0.0 ± 0.0
Met
2.588MetAla: 2.588 ± 0.343
0.185MetCys: 0.185 ± 0.141
0.924MetAsp: 0.924 ± 0.227
1.232MetGlu: 1.232 ± 0.224
1.356MetPhe: 1.356 ± 0.296
1.417MetGly: 1.417 ± 0.26
0.37MetHis: 0.37 ± 0.169
1.171MetIle: 1.171 ± 0.253
1.048MetLys: 1.048 ± 0.248
1.725MetLeu: 1.725 ± 0.29
0.739MetMet: 0.739 ± 0.21
0.678MetAsn: 0.678 ± 0.213
1.664MetPro: 1.664 ± 0.3
0.801MetGln: 0.801 ± 0.227
1.91MetArg: 1.91 ± 0.301
2.28MetSer: 2.28 ± 0.364
2.835MetThr: 2.835 ± 0.393
1.171MetVal: 1.171 ± 0.268
0.185MetTrp: 0.185 ± 0.089
0.801MetTyr: 0.801 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
2.835AsnAla: 2.835 ± 0.384
0.493AsnCys: 0.493 ± 0.21
1.725AsnAsp: 1.725 ± 0.306
2.034AsnGlu: 2.034 ± 0.33
0.924AsnPhe: 0.924 ± 0.184
3.944AsnGly: 3.944 ± 0.475
0.739AsnHis: 0.739 ± 0.189
1.602AsnIle: 1.602 ± 0.336
1.664AsnLys: 1.664 ± 0.335
3.143AsnLeu: 3.143 ± 0.46
0.739AsnMet: 0.739 ± 0.221
1.232AsnAsn: 1.232 ± 0.275
2.28AsnPro: 2.28 ± 0.331
1.417AsnGln: 1.417 ± 0.275
1.787AsnArg: 1.787 ± 0.34
1.972AsnSer: 1.972 ± 0.282
2.342AsnThr: 2.342 ± 0.4
1.664AsnVal: 1.664 ± 0.28
0.555AsnTrp: 0.555 ± 0.18
1.479AsnTyr: 1.479 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
4.067ProAla: 4.067 ± 0.514
0.308ProCys: 0.308 ± 0.155
3.882ProAsp: 3.882 ± 0.495
4.437ProGlu: 4.437 ± 0.503
1.664ProPhe: 1.664 ± 0.262
4.93ProGly: 4.93 ± 0.601
0.678ProHis: 0.678 ± 0.198
3.143ProIle: 3.143 ± 0.463
2.835ProLys: 2.835 ± 0.455
2.465ProLeu: 2.465 ± 0.443
1.479ProMet: 1.479 ± 0.294
1.972ProAsn: 1.972 ± 0.345
2.095ProPro: 2.095 ± 0.373
1.479ProGln: 1.479 ± 0.313
2.773ProArg: 2.773 ± 0.46
2.711ProSer: 2.711 ± 0.386
4.252ProThr: 4.252 ± 0.421
3.944ProVal: 3.944 ± 0.398
0.678ProTrp: 0.678 ± 0.215
1.725ProTyr: 1.725 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
3.389GlnAla: 3.389 ± 0.472
0.185GlnCys: 0.185 ± 0.099
1.479GlnAsp: 1.479 ± 0.359
2.28GlnGlu: 2.28 ± 0.406
1.232GlnPhe: 1.232 ± 0.284
2.157GlnGly: 2.157 ± 0.319
0.678GlnHis: 0.678 ± 0.197
1.91GlnIle: 1.91 ± 0.244
1.294GlnLys: 1.294 ± 0.296
3.574GlnLeu: 3.574 ± 0.4
1.232GlnMet: 1.232 ± 0.251
1.171GlnAsn: 1.171 ± 0.272
1.171GlnPro: 1.171 ± 0.253
1.048GlnGln: 1.048 ± 0.245
2.465GlnArg: 2.465 ± 0.36
1.541GlnSer: 1.541 ± 0.34
1.972GlnThr: 1.972 ± 0.332
2.28GlnVal: 2.28 ± 0.368
0.801GlnTrp: 0.801 ± 0.192
1.294GlnTyr: 1.294 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
4.745ArgAla: 4.745 ± 0.509
0.678ArgCys: 0.678 ± 0.26
3.821ArgAsp: 3.821 ± 0.394
3.697ArgGlu: 3.697 ± 0.55
2.403ArgPhe: 2.403 ± 0.41
4.19ArgGly: 4.19 ± 0.567
1.417ArgHis: 1.417 ± 0.279
3.821ArgIle: 3.821 ± 0.479
2.588ArgLys: 2.588 ± 0.37
4.991ArgLeu: 4.991 ± 0.508
1.91ArgMet: 1.91 ± 0.289
2.465ArgAsn: 2.465 ± 0.344
2.342ArgPro: 2.342 ± 0.388
1.602ArgGln: 1.602 ± 0.27
5.053ArgArg: 5.053 ± 0.621
3.759ArgSer: 3.759 ± 0.467
3.204ArgThr: 3.204 ± 0.485
3.821ArgVal: 3.821 ± 0.36
1.048ArgTrp: 1.048 ± 0.241
2.157ArgTyr: 2.157 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
5.546SerAla: 5.546 ± 0.607
0.37SerCys: 0.37 ± 0.164
2.65SerAsp: 2.65 ± 0.452
3.512SerGlu: 3.512 ± 0.346
1.972SerPhe: 1.972 ± 0.341
5.792SerGly: 5.792 ± 0.65
0.986SerHis: 0.986 ± 0.215
2.588SerIle: 2.588 ± 0.355
2.711SerLys: 2.711 ± 0.378
5.608SerLeu: 5.608 ± 0.662
1.972SerMet: 1.972 ± 0.306
2.342SerAsn: 2.342 ± 0.493
2.157SerPro: 2.157 ± 0.349
2.711SerGln: 2.711 ± 0.36
3.143SerArg: 3.143 ± 0.391
3.574SerSer: 3.574 ± 0.568
3.389SerThr: 3.389 ± 0.522
4.252SerVal: 4.252 ± 0.593
1.294SerTrp: 1.294 ± 0.271
1.541SerTyr: 1.541 ± 0.301
0.0SerXaa: 0.0 ± 0.0
Thr
5.484ThrAla: 5.484 ± 0.597
0.616ThrCys: 0.616 ± 0.192
4.622ThrAsp: 4.622 ± 0.484
3.697ThrGlu: 3.697 ± 0.441
2.095ThrPhe: 2.095 ± 0.344
4.991ThrGly: 4.991 ± 0.479
0.863ThrHis: 0.863 ± 0.23
3.328ThrIle: 3.328 ± 0.428
4.129ThrLys: 4.129 ± 0.703
5.299ThrLeu: 5.299 ± 0.446
1.171ThrMet: 1.171 ± 0.273
1.972ThrAsn: 1.972 ± 0.343
4.498ThrPro: 4.498 ± 0.533
2.095ThrGln: 2.095 ± 0.422
2.526ThrArg: 2.526 ± 0.395
4.005ThrSer: 4.005 ± 0.497
4.19ThrThr: 4.19 ± 0.58
5.484ThrVal: 5.484 ± 0.563
1.541ThrTrp: 1.541 ± 0.262
2.711ThrTyr: 2.711 ± 0.38
0.0ThrXaa: 0.0 ± 0.0
Val
5.977ValAla: 5.977 ± 0.6
0.801ValCys: 0.801 ± 0.238
5.176ValAsp: 5.176 ± 0.549
4.807ValGlu: 4.807 ± 0.515
2.218ValPhe: 2.218 ± 0.421
4.868ValGly: 4.868 ± 0.62
1.479ValHis: 1.479 ± 0.276
3.512ValIle: 3.512 ± 0.569
3.697ValLys: 3.697 ± 0.505
5.669ValLeu: 5.669 ± 0.712
1.109ValMet: 1.109 ± 0.24
2.773ValAsn: 2.773 ± 0.37
4.375ValPro: 4.375 ± 0.588
2.403ValGln: 2.403 ± 0.393
4.314ValArg: 4.314 ± 0.432
4.067ValSer: 4.067 ± 0.551
5.361ValThr: 5.361 ± 0.676
4.807ValVal: 4.807 ± 0.65
1.109ValTrp: 1.109 ± 0.251
2.403ValTyr: 2.403 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.541TrpAla: 1.541 ± 0.342
0.37TrpCys: 0.37 ± 0.179
1.479TrpAsp: 1.479 ± 0.286
1.602TrpGlu: 1.602 ± 0.294
0.801TrpPhe: 0.801 ± 0.212
1.417TrpGly: 1.417 ± 0.262
0.431TrpHis: 0.431 ± 0.166
1.048TrpIle: 1.048 ± 0.235
0.678TrpLys: 0.678 ± 0.198
1.109TrpLeu: 1.109 ± 0.256
0.185TrpMet: 0.185 ± 0.113
1.048TrpAsn: 1.048 ± 0.266
0.801TrpPro: 0.801 ± 0.224
0.863TrpGln: 0.863 ± 0.2
1.048TrpArg: 1.048 ± 0.262
1.417TrpSer: 1.417 ± 0.296
1.294TrpThr: 1.294 ± 0.276
1.479TrpVal: 1.479 ± 0.301
0.678TrpTrp: 0.678 ± 0.204
0.431TrpTyr: 0.431 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.526TyrAla: 2.526 ± 0.406
0.0TyrCys: 0.0 ± 0.0
2.034TyrAsp: 2.034 ± 0.303
1.664TyrGlu: 1.664 ± 0.309
0.986TyrPhe: 0.986 ± 0.259
2.773TyrGly: 2.773 ± 0.35
0.739TyrHis: 0.739 ± 0.209
1.232TyrIle: 1.232 ± 0.284
1.417TyrLys: 1.417 ± 0.335
2.896TyrLeu: 2.896 ± 0.429
1.232TyrMet: 1.232 ± 0.301
1.294TyrAsn: 1.294 ± 0.271
1.479TyrPro: 1.479 ± 0.286
0.986TyrGln: 0.986 ± 0.256
2.218TyrArg: 2.218 ± 0.358
2.218TyrSer: 2.218 ± 0.381
2.342TyrThr: 2.342 ± 0.361
2.218TyrVal: 2.218 ± 0.363
0.431TyrTrp: 0.431 ± 0.174
0.801TyrTyr: 0.801 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (16229 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski