Amino acid dipepetide frequency for Shewanella phage SppYZU05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.672AlaAla: 11.672 ± 1.682
1.297AlaCys: 1.297 ± 0.309
4.694AlaAsp: 4.694 ± 0.407
5.991AlaGlu: 5.991 ± 0.966
2.717AlaPhe: 2.717 ± 0.393
6.979AlaGly: 6.979 ± 0.696
1.606AlaHis: 1.606 ± 0.271
4.817AlaIle: 4.817 ± 0.465
5.497AlaLys: 5.497 ± 0.646
8.276AlaLeu: 8.276 ± 0.636
2.223AlaMet: 2.223 ± 0.403
3.582AlaAsn: 3.582 ± 0.476
4.447AlaPro: 4.447 ± 0.666
4.57AlaGln: 4.57 ± 0.458
6.423AlaArg: 6.423 ± 0.623
5.435AlaSer: 5.435 ± 0.754
7.473AlaThr: 7.473 ± 0.8
6.114AlaVal: 6.114 ± 0.554
1.05AlaTrp: 1.05 ± 0.221
3.088AlaTyr: 3.088 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.195
0.185CysCys: 0.185 ± 0.101
0.494CysAsp: 0.494 ± 0.177
0.741CysGlu: 0.741 ± 0.252
0.741CysPhe: 0.741 ± 0.195
1.297CysGly: 1.297 ± 0.296
0.309CysHis: 0.309 ± 0.141
0.556CysIle: 0.556 ± 0.171
0.679CysLys: 0.679 ± 0.175
1.112CysLeu: 1.112 ± 0.217
0.247CysMet: 0.247 ± 0.133
0.741CysAsn: 0.741 ± 0.222
0.062CysPro: 0.062 ± 0.074
0.432CysGln: 0.432 ± 0.172
0.741CysArg: 0.741 ± 0.213
0.556CysSer: 0.556 ± 0.193
0.803CysThr: 0.803 ± 0.205
0.926CysVal: 0.926 ± 0.246
0.185CysTrp: 0.185 ± 0.096
0.371CysTyr: 0.371 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
5.929AspAla: 5.929 ± 0.591
0.432AspCys: 0.432 ± 0.143
3.15AspAsp: 3.15 ± 0.412
4.508AspGlu: 4.508 ± 0.505
2.47AspPhe: 2.47 ± 0.425
4.632AspGly: 4.632 ± 0.563
1.235AspHis: 1.235 ± 0.31
3.582AspIle: 3.582 ± 0.55
2.779AspLys: 2.779 ± 0.486
6.238AspLeu: 6.238 ± 0.724
1.482AspMet: 1.482 ± 0.339
2.656AspAsn: 2.656 ± 0.496
3.335AspPro: 3.335 ± 0.56
1.173AspGln: 1.173 ± 0.244
2.779AspArg: 2.779 ± 0.379
2.47AspSer: 2.47 ± 0.384
3.088AspThr: 3.088 ± 0.429
4.138AspVal: 4.138 ± 0.516
1.359AspTrp: 1.359 ± 0.34
1.359AspTyr: 1.359 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
6.114GluAla: 6.114 ± 0.732
0.247GluCys: 0.247 ± 0.113
3.026GluAsp: 3.026 ± 0.463
4.385GluGlu: 4.385 ± 0.627
1.853GluPhe: 1.853 ± 0.293
3.644GluGly: 3.644 ± 0.671
1.235GluHis: 1.235 ± 0.297
4.2GluIle: 4.2 ± 0.389
2.841GluLys: 2.841 ± 0.528
5.805GluLeu: 5.805 ± 0.562
2.162GluMet: 2.162 ± 0.35
2.223GluAsn: 2.223 ± 0.3
2.656GluPro: 2.656 ± 0.454
2.594GluGln: 2.594 ± 0.43
2.964GluArg: 2.964 ± 0.348
3.767GluSer: 3.767 ± 0.41
3.582GluThr: 3.582 ± 0.407
4.817GluVal: 4.817 ± 0.632
1.173GluTrp: 1.173 ± 0.266
2.717GluTyr: 2.717 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
2.409PheAla: 2.409 ± 0.369
0.926PheCys: 0.926 ± 0.228
2.656PheAsp: 2.656 ± 0.459
2.409PheGlu: 2.409 ± 0.415
1.173PhePhe: 1.173 ± 0.261
2.841PheGly: 2.841 ± 0.418
1.05PheHis: 1.05 ± 0.237
1.544PheIle: 1.544 ± 0.36
1.544PheLys: 1.544 ± 0.366
2.532PheLeu: 2.532 ± 0.306
1.544PheMet: 1.544 ± 0.346
1.976PheAsn: 1.976 ± 0.389
1.235PhePro: 1.235 ± 0.28
1.173PheGln: 1.173 ± 0.246
1.235PheArg: 1.235 ± 0.315
1.915PheSer: 1.915 ± 0.279
2.347PheThr: 2.347 ± 0.354
2.162PheVal: 2.162 ± 0.426
0.556PheTrp: 0.556 ± 0.171
0.679PheTyr: 0.679 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
5.373GlyAla: 5.373 ± 0.684
1.235GlyCys: 1.235 ± 0.334
4.385GlyAsp: 4.385 ± 0.524
4.014GlyGlu: 4.014 ± 0.446
2.903GlyPhe: 2.903 ± 0.401
4.941GlyGly: 4.941 ± 0.68
1.173GlyHis: 1.173 ± 0.291
4.076GlyIle: 4.076 ± 0.499
3.829GlyLys: 3.829 ± 0.559
6.176GlyLeu: 6.176 ± 0.64
1.791GlyMet: 1.791 ± 0.302
3.088GlyAsn: 3.088 ± 0.717
1.112GlyPro: 1.112 ± 0.258
2.903GlyGln: 2.903 ± 0.41
3.767GlyArg: 3.767 ± 0.436
3.335GlySer: 3.335 ± 0.481
4.57GlyThr: 4.57 ± 0.64
5.867GlyVal: 5.867 ± 0.769
0.926GlyTrp: 0.926 ± 0.258
2.656GlyTyr: 2.656 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
2.1HisAla: 2.1 ± 0.325
0.432HisCys: 0.432 ± 0.133
1.729HisAsp: 1.729 ± 0.316
1.112HisGlu: 1.112 ± 0.259
0.618HisPhe: 0.618 ± 0.179
1.915HisGly: 1.915 ± 0.344
0.309HisHis: 0.309 ± 0.155
1.235HisIle: 1.235 ± 0.255
0.988HisLys: 0.988 ± 0.248
1.544HisLeu: 1.544 ± 0.296
0.679HisMet: 0.679 ± 0.212
0.988HisAsn: 0.988 ± 0.227
0.865HisPro: 0.865 ± 0.23
0.926HisGln: 0.926 ± 0.248
0.926HisArg: 0.926 ± 0.202
1.235HisSer: 1.235 ± 0.223
1.173HisThr: 1.173 ± 0.268
1.482HisVal: 1.482 ± 0.271
0.247HisTrp: 0.247 ± 0.12
0.865HisTyr: 0.865 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
6.176IleAla: 6.176 ± 0.602
0.679IleCys: 0.679 ± 0.2
4.57IleAsp: 4.57 ± 0.353
2.656IleGlu: 2.656 ± 0.369
1.359IlePhe: 1.359 ± 0.316
3.15IleGly: 3.15 ± 0.539
1.112IleHis: 1.112 ± 0.299
2.717IleIle: 2.717 ± 0.495
2.47IleLys: 2.47 ± 0.518
3.088IleLeu: 3.088 ± 0.395
0.926IleMet: 0.926 ± 0.254
2.47IleAsn: 2.47 ± 0.357
2.964IlePro: 2.964 ± 0.525
2.47IleGln: 2.47 ± 0.455
3.15IleArg: 3.15 ± 0.394
3.273IleSer: 3.273 ± 0.416
4.138IleThr: 4.138 ± 0.511
3.953IleVal: 3.953 ± 0.509
0.741IleTrp: 0.741 ± 0.242
0.926IleTyr: 0.926 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
4.385LysAla: 4.385 ± 0.622
0.556LysCys: 0.556 ± 0.231
3.273LysAsp: 3.273 ± 0.595
3.088LysGlu: 3.088 ± 0.444
2.1LysPhe: 2.1 ± 0.395
2.47LysGly: 2.47 ± 0.397
1.482LysHis: 1.482 ± 0.279
2.223LysIle: 2.223 ± 0.415
2.779LysLys: 2.779 ± 0.381
4.879LysLeu: 4.879 ± 0.569
1.606LysMet: 1.606 ± 0.323
1.297LysAsn: 1.297 ± 0.246
2.409LysPro: 2.409 ± 0.339
2.223LysGln: 2.223 ± 0.357
3.52LysArg: 3.52 ± 0.515
2.903LysSer: 2.903 ± 0.403
3.211LysThr: 3.211 ± 0.556
3.644LysVal: 3.644 ± 0.552
0.618LysTrp: 0.618 ± 0.307
1.544LysTyr: 1.544 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
8.09LeuAla: 8.09 ± 0.739
1.112LeuCys: 1.112 ± 0.27
5.064LeuAsp: 5.064 ± 0.497
6.423LeuGlu: 6.423 ± 0.697
2.409LeuPhe: 2.409 ± 0.391
4.941LeuGly: 4.941 ± 0.508
1.853LeuHis: 1.853 ± 0.386
3.458LeuIle: 3.458 ± 0.454
3.891LeuLys: 3.891 ± 0.514
5.682LeuLeu: 5.682 ± 0.692
2.594LeuMet: 2.594 ± 0.327
4.076LeuAsn: 4.076 ± 0.443
4.694LeuPro: 4.694 ± 0.389
3.767LeuGln: 3.767 ± 0.503
6.361LeuArg: 6.361 ± 0.53
5.126LeuSer: 5.126 ± 0.64
5.188LeuThr: 5.188 ± 0.802
5.929LeuVal: 5.929 ± 0.53
1.112LeuTrp: 1.112 ± 0.322
1.976LeuTyr: 1.976 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
2.47MetAla: 2.47 ± 0.383
0.247MetCys: 0.247 ± 0.114
0.803MetAsp: 0.803 ± 0.243
1.482MetGlu: 1.482 ± 0.276
0.988MetPhe: 0.988 ± 0.254
1.359MetGly: 1.359 ± 0.294
0.494MetHis: 0.494 ± 0.173
1.235MetIle: 1.235 ± 0.253
1.544MetLys: 1.544 ± 0.308
2.223MetLeu: 2.223 ± 0.322
1.05MetMet: 1.05 ± 0.199
1.235MetAsn: 1.235 ± 0.239
1.297MetPro: 1.297 ± 0.26
1.606MetGln: 1.606 ± 0.308
1.853MetArg: 1.853 ± 0.326
2.223MetSer: 2.223 ± 0.328
1.853MetThr: 1.853 ± 0.289
1.482MetVal: 1.482 ± 0.33
0.309MetTrp: 0.309 ± 0.128
1.297MetTyr: 1.297 ± 0.239
0.0MetXaa: 0.0 ± 0.0
Asn
4.508AsnAla: 4.508 ± 0.561
0.618AsnCys: 0.618 ± 0.163
2.347AsnAsp: 2.347 ± 0.304
1.544AsnGlu: 1.544 ± 0.373
1.482AsnPhe: 1.482 ± 0.266
3.953AsnGly: 3.953 ± 0.568
0.988AsnHis: 0.988 ± 0.26
2.409AsnIle: 2.409 ± 0.403
1.791AsnLys: 1.791 ± 0.424
3.458AsnLeu: 3.458 ± 0.567
0.865AsnMet: 0.865 ± 0.229
1.482AsnAsn: 1.482 ± 0.279
2.285AsnPro: 2.285 ± 0.495
1.853AsnGln: 1.853 ± 0.277
2.223AsnArg: 2.223 ± 0.262
1.915AsnSer: 1.915 ± 0.319
2.841AsnThr: 2.841 ± 0.45
3.52AsnVal: 3.52 ± 0.425
0.618AsnTrp: 0.618 ± 0.25
1.606AsnTyr: 1.606 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
5.929ProAla: 5.929 ± 0.891
0.247ProCys: 0.247 ± 0.122
3.458ProAsp: 3.458 ± 0.414
3.52ProGlu: 3.52 ± 0.461
1.359ProPhe: 1.359 ± 0.298
1.359ProGly: 1.359 ± 0.335
0.926ProHis: 0.926 ± 0.236
2.162ProIle: 2.162 ± 0.33
2.409ProLys: 2.409 ± 0.377
4.014ProLeu: 4.014 ± 0.497
0.988ProMet: 0.988 ± 0.215
1.729ProAsn: 1.729 ± 0.257
1.297ProPro: 1.297 ± 0.331
1.173ProGln: 1.173 ± 0.318
1.915ProArg: 1.915 ± 0.293
2.964ProSer: 2.964 ± 0.381
3.458ProThr: 3.458 ± 0.439
3.706ProVal: 3.706 ± 0.496
0.309ProTrp: 0.309 ± 0.126
1.667ProTyr: 1.667 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
4.138GlnAla: 4.138 ± 0.533
0.371GlnCys: 0.371 ± 0.156
2.038GlnAsp: 2.038 ± 0.329
1.791GlnGlu: 1.791 ± 0.38
2.1GlnPhe: 2.1 ± 0.376
2.594GlnGly: 2.594 ± 0.552
0.865GlnHis: 0.865 ± 0.195
2.038GlnIle: 2.038 ± 0.342
1.544GlnLys: 1.544 ± 0.359
3.706GlnLeu: 3.706 ± 0.496
0.926GlnMet: 0.926 ± 0.229
1.976GlnAsn: 1.976 ± 0.348
1.976GlnPro: 1.976 ± 0.376
2.1GlnGln: 2.1 ± 0.435
3.211GlnArg: 3.211 ± 0.353
2.038GlnSer: 2.038 ± 0.39
1.915GlnThr: 1.915 ± 0.345
3.15GlnVal: 3.15 ± 0.469
0.988GlnTrp: 0.988 ± 0.229
2.038GlnTyr: 2.038 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
5.188ArgAla: 5.188 ± 0.6
0.803ArgCys: 0.803 ± 0.27
4.138ArgAsp: 4.138 ± 0.553
3.582ArgGlu: 3.582 ± 0.545
1.853ArgPhe: 1.853 ± 0.452
4.261ArgGly: 4.261 ± 0.414
1.235ArgHis: 1.235 ± 0.326
4.447ArgIle: 4.447 ± 0.47
3.026ArgLys: 3.026 ± 0.355
4.138ArgLeu: 4.138 ± 0.527
1.853ArgMet: 1.853 ± 0.308
2.47ArgAsn: 2.47 ± 0.381
1.667ArgPro: 1.667 ± 0.3
2.285ArgGln: 2.285 ± 0.371
3.088ArgArg: 3.088 ± 0.433
3.52ArgSer: 3.52 ± 0.381
2.717ArgThr: 2.717 ± 0.295
4.57ArgVal: 4.57 ± 0.53
1.297ArgTrp: 1.297 ± 0.295
1.976ArgTyr: 1.976 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
5.558SerAla: 5.558 ± 0.745
0.865SerCys: 0.865 ± 0.252
3.582SerAsp: 3.582 ± 0.648
3.026SerGlu: 3.026 ± 0.413
1.976SerPhe: 1.976 ± 0.386
4.755SerGly: 4.755 ± 0.576
1.173SerHis: 1.173 ± 0.312
2.594SerIle: 2.594 ± 0.343
3.088SerLys: 3.088 ± 0.513
5.744SerLeu: 5.744 ± 0.616
1.235SerMet: 1.235 ± 0.247
2.409SerAsn: 2.409 ± 0.405
2.47SerPro: 2.47 ± 0.399
2.223SerGln: 2.223 ± 0.314
2.841SerArg: 2.841 ± 0.339
3.15SerSer: 3.15 ± 0.456
4.941SerThr: 4.941 ± 0.591
3.767SerVal: 3.767 ± 0.477
0.926SerTrp: 0.926 ± 0.214
1.915SerTyr: 1.915 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
5.867ThrAla: 5.867 ± 0.903
0.556ThrCys: 0.556 ± 0.215
2.594ThrAsp: 2.594 ± 0.43
3.891ThrGlu: 3.891 ± 0.498
2.162ThrPhe: 2.162 ± 0.357
5.373ThrGly: 5.373 ± 0.729
1.297ThrHis: 1.297 ± 0.288
4.385ThrIle: 4.385 ± 0.577
3.273ThrLys: 3.273 ± 0.389
4.694ThrLeu: 4.694 ± 0.498
1.42ThrMet: 1.42 ± 0.315
3.273ThrAsn: 3.273 ± 0.387
5.002ThrPro: 5.002 ± 0.553
2.1ThrGln: 2.1 ± 0.301
3.767ThrArg: 3.767 ± 0.402
3.767ThrSer: 3.767 ± 0.539
3.953ThrThr: 3.953 ± 0.586
5.126ThrVal: 5.126 ± 0.511
0.556ThrTrp: 0.556 ± 0.184
2.223ThrTyr: 2.223 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
7.288ValAla: 7.288 ± 0.766
0.679ValCys: 0.679 ± 0.201
3.953ValAsp: 3.953 ± 0.38
4.632ValGlu: 4.632 ± 0.561
2.285ValPhe: 2.285 ± 0.338
4.385ValGly: 4.385 ± 0.689
1.297ValHis: 1.297 ± 0.299
3.273ValIle: 3.273 ± 0.389
4.138ValLys: 4.138 ± 0.432
6.299ValLeu: 6.299 ± 0.712
1.976ValMet: 1.976 ± 0.284
2.47ValAsn: 2.47 ± 0.371
2.779ValPro: 2.779 ± 0.458
3.644ValGln: 3.644 ± 0.551
4.755ValArg: 4.755 ± 0.516
4.694ValSer: 4.694 ± 0.532
5.188ValThr: 5.188 ± 0.586
6.361ValVal: 6.361 ± 0.781
1.235ValTrp: 1.235 ± 0.269
2.285ValTyr: 2.285 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
1.359TrpAla: 1.359 ± 0.245
0.185TrpCys: 0.185 ± 0.094
0.988TrpAsp: 0.988 ± 0.246
0.803TrpGlu: 0.803 ± 0.253
0.371TrpPhe: 0.371 ± 0.164
0.926TrpGly: 0.926 ± 0.252
0.371TrpHis: 0.371 ± 0.13
0.494TrpIle: 0.494 ± 0.166
0.618TrpLys: 0.618 ± 0.2
1.359TrpLeu: 1.359 ± 0.236
0.494TrpMet: 0.494 ± 0.195
0.494TrpAsn: 0.494 ± 0.186
0.556TrpPro: 0.556 ± 0.298
1.05TrpGln: 1.05 ± 0.201
0.803TrpArg: 0.803 ± 0.217
0.865TrpSer: 0.865 ± 0.271
0.371TrpThr: 0.371 ± 0.148
1.173TrpVal: 1.173 ± 0.29
0.185TrpTrp: 0.185 ± 0.097
1.173TrpTyr: 1.173 ± 0.218
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.594TyrAla: 2.594 ± 0.379
0.309TyrCys: 0.309 ± 0.15
1.853TyrAsp: 1.853 ± 0.334
2.409TyrGlu: 2.409 ± 0.432
0.988TyrPhe: 0.988 ± 0.232
2.347TyrGly: 2.347 ± 0.428
1.359TyrHis: 1.359 ± 0.288
1.544TyrIle: 1.544 ± 0.319
1.482TyrLys: 1.482 ± 0.334
2.841TyrLeu: 2.841 ± 0.463
0.926TyrMet: 0.926 ± 0.219
1.667TyrAsn: 1.667 ± 0.329
1.482TyrPro: 1.482 ± 0.26
1.235TyrGln: 1.235 ± 0.257
1.791TyrArg: 1.791 ± 0.36
3.088TyrSer: 3.088 ± 0.448
2.47TyrThr: 2.47 ± 0.383
1.729TyrVal: 1.729 ± 0.262
0.247TyrTrp: 0.247 ± 0.139
1.235TyrTyr: 1.235 ± 0.377
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (16193 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski