Amino acid dipepetide frequency for Rhizobium phage 16-3 (Bacteriophage 16-3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.384AlaAla: 17.384 ± 1.213
1.074AlaCys: 1.074 ± 0.244
7.209AlaAsp: 7.209 ± 0.625
7.976AlaGlu: 7.976 ± 0.605
3.221AlaPhe: 3.221 ± 0.393
8.999AlaGly: 8.999 ± 0.762
2.505AlaHis: 2.505 ± 0.347
5.778AlaIle: 5.778 ± 0.61
6.442AlaLys: 6.442 ± 0.629
7.465AlaLeu: 7.465 ± 0.704
4.346AlaMet: 4.346 ± 0.537
4.09AlaAsn: 4.09 ± 0.538
4.244AlaPro: 4.244 ± 0.587
3.63AlaGln: 3.63 ± 0.499
7.158AlaArg: 7.158 ± 0.766
6.698AlaSer: 6.698 ± 0.621
5.42AlaThr: 5.42 ± 0.664
7.516AlaVal: 7.516 ± 0.574
2.352AlaTrp: 2.352 ± 0.352
3.17AlaTyr: 3.17 ± 0.351
0.0AlaXaa: 0.0 ± 0.0
Cys
1.074CysAla: 1.074 ± 0.293
0.46CysCys: 0.46 ± 0.148
0.818CysAsp: 0.818 ± 0.208
0.92CysGlu: 0.92 ± 0.229
0.358CysPhe: 0.358 ± 0.141
1.074CysGly: 1.074 ± 0.258
0.358CysHis: 0.358 ± 0.121
0.358CysIle: 0.358 ± 0.145
0.614CysLys: 0.614 ± 0.201
1.023CysLeu: 1.023 ± 0.309
0.307CysMet: 0.307 ± 0.121
0.307CysAsn: 0.307 ± 0.152
0.46CysPro: 0.46 ± 0.169
0.614CysGln: 0.614 ± 0.265
1.278CysArg: 1.278 ± 0.288
0.716CysSer: 0.716 ± 0.185
0.511CysThr: 0.511 ± 0.146
0.869CysVal: 0.869 ± 0.232
0.205CysTrp: 0.205 ± 0.093
0.153CysTyr: 0.153 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
5.931AspAla: 5.931 ± 0.592
1.023AspCys: 1.023 ± 0.265
3.681AspAsp: 3.681 ± 0.48
4.96AspGlu: 4.96 ± 0.594
2.199AspPhe: 2.199 ± 0.301
7.056AspGly: 7.056 ± 0.669
1.278AspHis: 1.278 ± 0.302
2.608AspIle: 2.608 ± 0.393
2.352AspLys: 2.352 ± 0.357
4.704AspLeu: 4.704 ± 0.499
1.278AspMet: 1.278 ± 0.237
2.761AspAsn: 2.761 ± 0.307
2.71AspPro: 2.71 ± 0.343
1.329AspGln: 1.329 ± 0.29
2.966AspArg: 2.966 ± 0.449
3.17AspSer: 3.17 ± 0.417
3.323AspThr: 3.323 ± 0.413
3.784AspVal: 3.784 ± 0.416
1.636AspTrp: 1.636 ± 0.252
2.301AspTyr: 2.301 ± 0.367
0.0AspXaa: 0.0 ± 0.0
Glu
8.794GluAla: 8.794 ± 0.708
0.46GluCys: 0.46 ± 0.172
3.886GluAsp: 3.886 ± 0.43
4.551GluGlu: 4.551 ± 0.437
2.863GluPhe: 2.863 ± 0.331
4.193GluGly: 4.193 ± 0.449
0.869GluHis: 0.869 ± 0.2
3.068GluIle: 3.068 ± 0.425
3.681GluLys: 3.681 ± 0.522
5.113GluLeu: 5.113 ± 0.553
1.636GluMet: 1.636 ± 0.288
1.79GluAsn: 1.79 ± 0.299
2.352GluPro: 2.352 ± 0.403
2.352GluGln: 2.352 ± 0.284
5.573GluArg: 5.573 ± 0.656
2.812GluSer: 2.812 ± 0.478
3.375GluThr: 3.375 ± 0.385
3.477GluVal: 3.477 ± 0.446
1.176GluTrp: 1.176 ± 0.249
2.352GluTyr: 2.352 ± 0.339
0.0GluXaa: 0.0 ± 0.0
Phe
3.528PheAla: 3.528 ± 0.404
0.716PheCys: 0.716 ± 0.195
2.147PheAsp: 2.147 ± 0.299
2.096PheGlu: 2.096 ± 0.352
1.125PhePhe: 1.125 ± 0.228
3.579PheGly: 3.579 ± 0.404
0.562PheHis: 0.562 ± 0.168
1.432PheIle: 1.432 ± 0.243
1.432PheLys: 1.432 ± 0.291
2.608PheLeu: 2.608 ± 0.376
0.562PheMet: 0.562 ± 0.162
1.329PheAsn: 1.329 ± 0.247
1.227PhePro: 1.227 ± 0.271
0.92PheGln: 0.92 ± 0.277
2.556PheArg: 2.556 ± 0.349
2.25PheSer: 2.25 ± 0.362
1.79PheThr: 1.79 ± 0.264
2.761PheVal: 2.761 ± 0.455
0.818PheTrp: 0.818 ± 0.192
1.023PheTyr: 1.023 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
8.334GlyAla: 8.334 ± 0.973
0.818GlyCys: 0.818 ± 0.236
5.369GlyAsp: 5.369 ± 0.491
5.675GlyGlu: 5.675 ± 0.643
2.914GlyPhe: 2.914 ± 0.472
8.539GlyGly: 8.539 ± 1.068
1.585GlyHis: 1.585 ± 0.31
3.732GlyIle: 3.732 ± 0.466
4.704GlyLys: 4.704 ± 0.528
5.829GlyLeu: 5.829 ± 0.76
1.738GlyMet: 1.738 ± 0.296
2.659GlyAsn: 2.659 ± 0.461
2.556GlyPro: 2.556 ± 0.406
3.221GlyGln: 3.221 ± 0.405
5.88GlyArg: 5.88 ± 0.528
4.96GlySer: 4.96 ± 0.581
4.908GlyThr: 4.908 ± 0.524
5.931GlyVal: 5.931 ± 0.466
1.176GlyTrp: 1.176 ± 0.254
2.301GlyTyr: 2.301 ± 0.318
0.0GlyXaa: 0.0 ± 0.0
His
2.147HisAla: 2.147 ± 0.286
0.511HisCys: 0.511 ± 0.219
1.329HisAsp: 1.329 ± 0.314
1.176HisGlu: 1.176 ± 0.267
0.511HisPhe: 0.511 ± 0.144
1.994HisGly: 1.994 ± 0.348
1.074HisHis: 1.074 ± 0.25
1.329HisIle: 1.329 ± 0.273
0.716HisLys: 0.716 ± 0.163
2.096HisLeu: 2.096 ± 0.429
0.307HisMet: 0.307 ± 0.117
0.562HisAsn: 0.562 ± 0.157
1.023HisPro: 1.023 ± 0.267
0.665HisGln: 0.665 ± 0.156
1.227HisArg: 1.227 ± 0.252
1.687HisSer: 1.687 ± 0.26
0.716HisThr: 0.716 ± 0.18
1.636HisVal: 1.636 ± 0.319
0.358HisTrp: 0.358 ± 0.145
0.869HisTyr: 0.869 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
6.34IleAla: 6.34 ± 0.539
0.256IleCys: 0.256 ± 0.121
4.09IleAsp: 4.09 ± 0.411
4.602IleGlu: 4.602 ± 0.499
1.534IlePhe: 1.534 ± 0.276
3.528IleGly: 3.528 ± 0.38
0.716IleHis: 0.716 ± 0.227
2.454IleIle: 2.454 ± 0.352
2.352IleLys: 2.352 ± 0.349
2.25IleLeu: 2.25 ± 0.379
0.818IleMet: 0.818 ± 0.183
1.125IleAsn: 1.125 ± 0.211
1.943IlePro: 1.943 ± 0.301
1.687IleGln: 1.687 ± 0.315
3.272IleArg: 3.272 ± 0.31
2.505IleSer: 2.505 ± 0.323
2.71IleThr: 2.71 ± 0.41
3.17IleVal: 3.17 ± 0.42
0.869IleTrp: 0.869 ± 0.232
0.869IleTyr: 0.869 ± 0.204
0.0IleXaa: 0.0 ± 0.0
Lys
5.675LysAla: 5.675 ± 0.612
0.665LysCys: 0.665 ± 0.197
2.659LysAsp: 2.659 ± 0.46
3.426LysGlu: 3.426 ± 0.421
1.585LysPhe: 1.585 ± 0.258
3.272LysGly: 3.272 ± 0.376
1.534LysHis: 1.534 ± 0.311
1.892LysIle: 1.892 ± 0.33
2.608LysLys: 2.608 ± 0.487
4.295LysLeu: 4.295 ± 0.443
1.074LysMet: 1.074 ± 0.246
1.483LysAsn: 1.483 ± 0.234
2.659LysPro: 2.659 ± 0.466
1.483LysGln: 1.483 ± 0.238
4.346LysArg: 4.346 ± 0.586
3.119LysSer: 3.119 ± 0.342
3.017LysThr: 3.017 ± 0.325
2.71LysVal: 2.71 ± 0.42
0.818LysTrp: 0.818 ± 0.196
1.278LysTyr: 1.278 ± 0.289
0.0LysXaa: 0.0 ± 0.0
Leu
10.379LeuAla: 10.379 ± 0.833
1.329LeuCys: 1.329 ± 0.317
4.551LeuAsp: 4.551 ± 0.503
4.755LeuGlu: 4.755 ± 0.599
1.994LeuPhe: 1.994 ± 0.331
6.596LeuGly: 6.596 ± 0.843
1.79LeuHis: 1.79 ± 0.329
2.812LeuIle: 2.812 ± 0.361
2.863LeuLys: 2.863 ± 0.392
6.033LeuLeu: 6.033 ± 0.587
1.585LeuMet: 1.585 ± 0.263
1.943LeuAsn: 1.943 ± 0.386
4.142LeuPro: 4.142 ± 0.442
2.608LeuGln: 2.608 ± 0.388
6.033LeuArg: 6.033 ± 0.606
4.448LeuSer: 4.448 ± 0.491
3.988LeuThr: 3.988 ± 0.459
5.471LeuVal: 5.471 ± 0.485
1.278LeuTrp: 1.278 ± 0.269
1.738LeuTyr: 1.738 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
3.477MetAla: 3.477 ± 0.337
0.205MetCys: 0.205 ± 0.098
1.074MetAsp: 1.074 ± 0.265
1.176MetGlu: 1.176 ± 0.252
0.716MetPhe: 0.716 ± 0.194
1.381MetGly: 1.381 ± 0.292
0.511MetHis: 0.511 ± 0.193
1.176MetIle: 1.176 ± 0.271
0.716MetLys: 0.716 ± 0.184
2.25MetLeu: 2.25 ± 0.404
0.665MetMet: 0.665 ± 0.176
0.767MetAsn: 0.767 ± 0.188
1.483MetPro: 1.483 ± 0.241
0.562MetGln: 0.562 ± 0.173
1.892MetArg: 1.892 ± 0.292
1.585MetSer: 1.585 ± 0.286
1.636MetThr: 1.636 ± 0.261
1.534MetVal: 1.534 ± 0.3
0.205MetTrp: 0.205 ± 0.093
0.511MetTyr: 0.511 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.477AsnAla: 3.477 ± 0.41
0.358AsnCys: 0.358 ± 0.154
2.863AsnAsp: 2.863 ± 0.31
1.636AsnGlu: 1.636 ± 0.282
1.074AsnPhe: 1.074 ± 0.207
2.556AsnGly: 2.556 ± 0.393
0.716AsnHis: 0.716 ± 0.166
1.278AsnIle: 1.278 ± 0.277
1.381AsnLys: 1.381 ± 0.294
2.045AsnLeu: 2.045 ± 0.323
0.767AsnMet: 0.767 ± 0.186
1.074AsnAsn: 1.074 ± 0.237
2.096AsnPro: 2.096 ± 0.29
0.92AsnGln: 0.92 ± 0.188
2.301AsnArg: 2.301 ± 0.272
1.381AsnSer: 1.381 ± 0.302
1.329AsnThr: 1.329 ± 0.318
2.812AsnVal: 2.812 ± 0.348
0.767AsnTrp: 0.767 ± 0.211
1.074AsnTyr: 1.074 ± 0.211
0.0AsnXaa: 0.0 ± 0.0
Pro
4.96ProAla: 4.96 ± 0.59
0.46ProCys: 0.46 ± 0.156
3.068ProAsp: 3.068 ± 0.364
2.505ProGlu: 2.505 ± 0.375
1.738ProPhe: 1.738 ± 0.271
3.323ProGly: 3.323 ± 0.371
0.818ProHis: 0.818 ± 0.226
2.25ProIle: 2.25 ± 0.306
2.556ProLys: 2.556 ± 0.348
3.732ProLeu: 3.732 ± 0.41
1.023ProMet: 1.023 ± 0.22
1.585ProAsn: 1.585 ± 0.276
3.119ProPro: 3.119 ± 0.48
1.227ProGln: 1.227 ± 0.3
2.966ProArg: 2.966 ± 0.409
2.812ProSer: 2.812 ± 0.382
2.659ProThr: 2.659 ± 0.314
3.426ProVal: 3.426 ± 0.421
0.716ProTrp: 0.716 ± 0.216
1.483ProTyr: 1.483 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
3.528GlnAla: 3.528 ± 0.513
0.511GlnCys: 0.511 ± 0.152
1.329GlnAsp: 1.329 ± 0.297
1.892GlnGlu: 1.892 ± 0.353
1.074GlnPhe: 1.074 ± 0.275
2.199GlnGly: 2.199 ± 0.315
0.869GlnHis: 0.869 ± 0.204
1.79GlnIle: 1.79 ± 0.316
2.096GlnLys: 2.096 ± 0.377
2.454GlnLeu: 2.454 ± 0.377
0.869GlnMet: 0.869 ± 0.198
1.636GlnAsn: 1.636 ± 0.278
1.943GlnPro: 1.943 ± 0.288
1.943GlnGln: 1.943 ± 0.482
2.352GlnArg: 2.352 ± 0.338
2.096GlnSer: 2.096 ± 0.38
1.943GlnThr: 1.943 ± 0.35
2.045GlnVal: 2.045 ± 0.343
0.46GlnTrp: 0.46 ± 0.145
0.511GlnTyr: 0.511 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
7.414ArgAla: 7.414 ± 0.627
0.92ArgCys: 0.92 ± 0.279
3.017ArgAsp: 3.017 ± 0.378
3.835ArgGlu: 3.835 ± 0.47
2.659ArgPhe: 2.659 ± 0.295
5.573ArgGly: 5.573 ± 0.493
2.454ArgHis: 2.454 ± 0.401
4.09ArgIle: 4.09 ± 0.521
4.09ArgLys: 4.09 ± 0.506
6.698ArgLeu: 6.698 ± 0.734
1.534ArgMet: 1.534 ± 0.323
1.892ArgAsn: 1.892 ± 0.32
2.812ArgPro: 2.812 ± 0.413
3.068ArgGln: 3.068 ± 0.427
6.289ArgArg: 6.289 ± 0.735
3.681ArgSer: 3.681 ± 0.425
2.761ArgThr: 2.761 ± 0.376
4.499ArgVal: 4.499 ± 0.519
1.176ArgTrp: 1.176 ± 0.265
2.045ArgTyr: 2.045 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
6.033SerAla: 6.033 ± 0.58
0.665SerCys: 0.665 ± 0.221
3.272SerAsp: 3.272 ± 0.377
3.579SerGlu: 3.579 ± 0.371
2.147SerPhe: 2.147 ± 0.321
6.187SerGly: 6.187 ± 0.518
0.971SerHis: 0.971 ± 0.218
2.608SerIle: 2.608 ± 0.355
2.403SerLys: 2.403 ± 0.373
5.471SerLeu: 5.471 ± 0.585
0.971SerMet: 0.971 ± 0.229
2.199SerAsn: 2.199 ± 0.368
2.914SerPro: 2.914 ± 0.323
1.943SerGln: 1.943 ± 0.31
3.886SerArg: 3.886 ± 0.506
4.295SerSer: 4.295 ± 0.472
3.528SerThr: 3.528 ± 0.317
4.193SerVal: 4.193 ± 0.463
0.716SerTrp: 0.716 ± 0.213
1.432SerTyr: 1.432 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
6.596ThrAla: 6.596 ± 0.727
0.818ThrCys: 0.818 ± 0.194
3.426ThrAsp: 3.426 ± 0.422
3.17ThrGlu: 3.17 ± 0.441
2.045ThrPhe: 2.045 ± 0.339
4.142ThrGly: 4.142 ± 0.373
0.562ThrHis: 0.562 ± 0.187
2.966ThrIle: 2.966 ± 0.441
3.068ThrLys: 3.068 ± 0.392
3.988ThrLeu: 3.988 ± 0.49
1.023ThrMet: 1.023 ± 0.232
1.329ThrAsn: 1.329 ± 0.318
2.812ThrPro: 2.812 ± 0.311
1.841ThrGln: 1.841 ± 0.26
2.659ThrArg: 2.659 ± 0.352
3.17ThrSer: 3.17 ± 0.369
2.914ThrThr: 2.914 ± 0.535
4.602ThrVal: 4.602 ± 0.488
0.818ThrTrp: 0.818 ± 0.189
1.381ThrTyr: 1.381 ± 0.265
0.0ThrXaa: 0.0 ± 0.0
Val
7.26ValAla: 7.26 ± 0.647
0.767ValCys: 0.767 ± 0.171
4.551ValAsp: 4.551 ± 0.429
4.295ValGlu: 4.295 ± 0.433
2.659ValPhe: 2.659 ± 0.419
5.624ValGly: 5.624 ± 0.504
1.534ValHis: 1.534 ± 0.309
3.119ValIle: 3.119 ± 0.371
3.426ValLys: 3.426 ± 0.462
4.142ValLeu: 4.142 ± 0.517
1.79ValMet: 1.79 ± 0.333
1.738ValAsn: 1.738 ± 0.245
4.09ValPro: 4.09 ± 0.393
1.841ValGln: 1.841 ± 0.353
4.244ValArg: 4.244 ± 0.605
5.266ValSer: 5.266 ± 0.628
4.448ValThr: 4.448 ± 0.471
5.982ValVal: 5.982 ± 0.711
1.381ValTrp: 1.381 ± 0.208
1.534ValTyr: 1.534 ± 0.287
0.0ValXaa: 0.0 ± 0.0
Trp
1.585TrpAla: 1.585 ± 0.305
0.153TrpCys: 0.153 ± 0.096
0.92TrpAsp: 0.92 ± 0.199
0.614TrpGlu: 0.614 ± 0.171
0.92TrpPhe: 0.92 ± 0.276
0.767TrpGly: 0.767 ± 0.166
0.767TrpHis: 0.767 ± 0.208
0.869TrpIle: 0.869 ± 0.219
0.665TrpLys: 0.665 ± 0.192
2.352TrpLeu: 2.352 ± 0.385
0.818TrpMet: 0.818 ± 0.182
0.46TrpAsn: 0.46 ± 0.164
0.869TrpPro: 0.869 ± 0.216
0.869TrpGln: 0.869 ± 0.227
1.432TrpArg: 1.432 ± 0.26
1.278TrpSer: 1.278 ± 0.325
0.869TrpThr: 0.869 ± 0.231
1.023TrpVal: 1.023 ± 0.275
0.511TrpTrp: 0.511 ± 0.162
0.358TrpTyr: 0.358 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.812TyrAla: 2.812 ± 0.348
0.307TyrCys: 0.307 ± 0.115
1.79TyrAsp: 1.79 ± 0.259
1.534TyrGlu: 1.534 ± 0.312
1.125TyrPhe: 1.125 ± 0.221
2.25TyrGly: 2.25 ± 0.361
0.46TyrHis: 0.46 ± 0.128
1.483TyrIle: 1.483 ± 0.34
1.432TyrLys: 1.432 ± 0.263
1.79TyrLeu: 1.79 ± 0.275
0.358TyrMet: 0.358 ± 0.121
1.176TyrAsn: 1.176 ± 0.255
0.971TyrPro: 0.971 ± 0.223
0.818TyrGln: 0.818 ± 0.183
2.199TyrArg: 2.199 ± 0.329
1.483TyrSer: 1.483 ± 0.241
1.483TyrThr: 1.483 ± 0.297
2.25TyrVal: 2.25 ± 0.368
0.665TyrTrp: 0.665 ± 0.219
0.358TyrTyr: 0.358 ± 0.159
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 110 proteins (19559 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski