Amino acid dipepetide frequency for Arthrobacter phage Herb

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.228AlaAla: 12.228 ± 1.232
0.715AlaCys: 0.715 ± 0.3
5.435AlaAsp: 5.435 ± 0.53
6.865AlaGlu: 6.865 ± 0.753
2.574AlaPhe: 2.574 ± 0.461
8.367AlaGly: 8.367 ± 1.106
1.073AlaHis: 1.073 ± 0.282
5.363AlaIle: 5.363 ± 0.797
5.792AlaLys: 5.792 ± 0.852
8.867AlaLeu: 8.867 ± 1.305
2.574AlaMet: 2.574 ± 0.465
3.79AlaAsn: 3.79 ± 0.467
4.434AlaPro: 4.434 ± 0.556
4.434AlaGln: 4.434 ± 0.581
5.792AlaArg: 5.792 ± 0.765
6.364AlaSer: 6.364 ± 0.656
6.364AlaThr: 6.364 ± 0.712
5.935AlaVal: 5.935 ± 0.6
2.002AlaTrp: 2.002 ± 0.412
3.003AlaTyr: 3.003 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
0.429CysAla: 0.429 ± 0.183
0.0CysCys: 0.0 ± 0.0
0.501CysAsp: 0.501 ± 0.195
0.572CysGlu: 0.572 ± 0.208
0.215CysPhe: 0.215 ± 0.125
1.216CysGly: 1.216 ± 0.375
0.215CysHis: 0.215 ± 0.13
0.429CysIle: 0.429 ± 0.16
0.286CysLys: 0.286 ± 0.147
0.358CysLeu: 0.358 ± 0.148
0.0CysMet: 0.0 ± 0.0
0.143CysAsn: 0.143 ± 0.104
0.358CysPro: 0.358 ± 0.146
0.501CysGln: 0.501 ± 0.184
0.572CysArg: 0.572 ± 0.24
0.358CysSer: 0.358 ± 0.15
0.501CysThr: 0.501 ± 0.209
0.215CysVal: 0.215 ± 0.116
0.358CysTrp: 0.358 ± 0.173
0.215CysTyr: 0.215 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
6.078AspAla: 6.078 ± 0.597
0.572AspCys: 0.572 ± 0.233
3.289AspAsp: 3.289 ± 0.441
4.934AspGlu: 4.934 ± 0.734
1.859AspPhe: 1.859 ± 0.417
5.292AspGly: 5.292 ± 0.582
1.073AspHis: 1.073 ± 0.356
2.217AspIle: 2.217 ± 0.373
2.002AspLys: 2.002 ± 0.354
5.649AspLeu: 5.649 ± 0.646
1.287AspMet: 1.287 ± 0.286
1.359AspAsn: 1.359 ± 0.359
3.576AspPro: 3.576 ± 0.437
2.217AspGln: 2.217 ± 0.588
3.719AspArg: 3.719 ± 0.545
3.218AspSer: 3.218 ± 0.419
3.361AspThr: 3.361 ± 0.627
2.646AspVal: 2.646 ± 0.439
1.144AspTrp: 1.144 ± 0.26
1.716AspTyr: 1.716 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
7.008GluAla: 7.008 ± 0.872
0.644GluCys: 0.644 ± 0.248
3.933GluAsp: 3.933 ± 0.706
4.076GluGlu: 4.076 ± 0.526
2.36GluPhe: 2.36 ± 0.499
6.436GluGly: 6.436 ± 0.57
1.287GluHis: 1.287 ± 0.334
2.431GluIle: 2.431 ± 0.426
3.075GluLys: 3.075 ± 0.551
5.935GluLeu: 5.935 ± 0.66
1.43GluMet: 1.43 ± 0.373
2.074GluAsn: 2.074 ± 0.278
2.503GluPro: 2.503 ± 0.388
3.218GluGln: 3.218 ± 0.576
4.219GluArg: 4.219 ± 0.59
2.789GluSer: 2.789 ± 0.418
3.504GluThr: 3.504 ± 0.495
4.291GluVal: 4.291 ± 0.632
1.43GluTrp: 1.43 ± 0.343
1.216GluTyr: 1.216 ± 0.225
0.0GluXaa: 0.0 ± 0.0
Phe
2.86PheAla: 2.86 ± 0.574
0.143PheCys: 0.143 ± 0.103
2.574PheAsp: 2.574 ± 0.434
2.074PheGlu: 2.074 ± 0.381
1.573PhePhe: 1.573 ± 0.539
3.146PheGly: 3.146 ± 0.555
0.644PheHis: 0.644 ± 0.228
1.001PheIle: 1.001 ± 0.354
1.502PheLys: 1.502 ± 0.294
1.716PheLeu: 1.716 ± 0.259
0.858PheMet: 0.858 ± 0.31
1.287PheAsn: 1.287 ± 0.327
0.572PhePro: 0.572 ± 0.209
1.359PheGln: 1.359 ± 0.379
2.217PheArg: 2.217 ± 0.362
1.859PheSer: 1.859 ± 0.558
2.002PheThr: 2.002 ± 0.475
1.788PheVal: 1.788 ± 0.422
0.215PheTrp: 0.215 ± 0.128
0.787PheTyr: 0.787 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
8.367GlyAla: 8.367 ± 0.946
0.644GlyCys: 0.644 ± 0.238
4.505GlyAsp: 4.505 ± 0.613
5.649GlyGlu: 5.649 ± 0.684
3.218GlyPhe: 3.218 ± 0.736
5.935GlyGly: 5.935 ± 0.55
2.145GlyHis: 2.145 ± 0.529
4.362GlyIle: 4.362 ± 0.554
3.576GlyLys: 3.576 ± 0.411
7.223GlyLeu: 7.223 ± 0.828
2.074GlyMet: 2.074 ± 0.367
4.148GlyAsn: 4.148 ± 0.836
4.148GlyPro: 4.148 ± 0.508
3.504GlyGln: 3.504 ± 0.453
4.934GlyArg: 4.934 ± 0.576
6.436GlySer: 6.436 ± 0.838
6.078GlyThr: 6.078 ± 0.999
7.509GlyVal: 7.509 ± 0.783
1.788GlyTrp: 1.788 ± 0.401
2.217GlyTyr: 2.217 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
1.859HisAla: 1.859 ± 0.438
0.215HisCys: 0.215 ± 0.132
0.858HisAsp: 0.858 ± 0.238
0.787HisGlu: 0.787 ± 0.254
0.644HisPhe: 0.644 ± 0.18
1.931HisGly: 1.931 ± 0.395
0.501HisHis: 0.501 ± 0.194
1.359HisIle: 1.359 ± 0.307
0.858HisLys: 0.858 ± 0.233
1.788HisLeu: 1.788 ± 0.401
0.501HisMet: 0.501 ± 0.184
0.787HisAsn: 0.787 ± 0.239
0.715HisPro: 0.715 ± 0.246
0.572HisGln: 0.572 ± 0.213
0.644HisArg: 0.644 ± 0.216
1.073HisSer: 1.073 ± 0.257
1.144HisThr: 1.144 ± 0.318
0.787HisVal: 0.787 ± 0.229
0.429HisTrp: 0.429 ± 0.17
1.144HisTyr: 1.144 ± 0.331
0.0HisXaa: 0.0 ± 0.0
Ile
4.72IleAla: 4.72 ± 0.676
0.215IleCys: 0.215 ± 0.132
2.86IleAsp: 2.86 ± 0.474
2.789IleGlu: 2.789 ± 0.445
1.144IlePhe: 1.144 ± 0.347
3.862IleGly: 3.862 ± 0.642
0.858IleHis: 0.858 ± 0.26
2.002IleIle: 2.002 ± 0.592
2.646IleLys: 2.646 ± 0.518
3.647IleLeu: 3.647 ± 0.514
0.93IleMet: 0.93 ± 0.389
2.145IleAsn: 2.145 ± 0.318
1.931IlePro: 1.931 ± 0.405
2.789IleGln: 2.789 ± 0.345
3.504IleArg: 3.504 ± 0.512
3.719IleSer: 3.719 ± 0.663
3.003IleThr: 3.003 ± 0.499
3.289IleVal: 3.289 ± 0.556
0.644IleTrp: 0.644 ± 0.198
0.93IleTyr: 0.93 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
5.864LysAla: 5.864 ± 0.897
0.572LysCys: 0.572 ± 0.206
2.789LysAsp: 2.789 ± 0.488
2.646LysGlu: 2.646 ± 0.41
1.716LysPhe: 1.716 ± 0.38
4.076LysGly: 4.076 ± 0.594
1.359LysHis: 1.359 ± 0.374
2.646LysIle: 2.646 ± 0.408
2.074LysLys: 2.074 ± 0.434
4.72LysLeu: 4.72 ± 0.635
1.287LysMet: 1.287 ± 0.274
0.715LysAsn: 0.715 ± 0.222
2.789LysPro: 2.789 ± 0.475
1.573LysGln: 1.573 ± 0.32
2.574LysArg: 2.574 ± 0.462
2.717LysSer: 2.717 ± 0.517
2.932LysThr: 2.932 ± 0.517
3.933LysVal: 3.933 ± 0.638
0.501LysTrp: 0.501 ± 0.19
1.216LysTyr: 1.216 ± 0.23
0.0LysXaa: 0.0 ± 0.0
Leu
9.01LeuAla: 9.01 ± 0.962
0.644LeuCys: 0.644 ± 0.226
5.721LeuAsp: 5.721 ± 0.514
4.076LeuGlu: 4.076 ± 0.556
2.431LeuPhe: 2.431 ± 0.367
8.295LeuGly: 8.295 ± 1.128
1.502LeuHis: 1.502 ± 0.361
4.219LeuIle: 4.219 ± 0.502
2.932LeuLys: 2.932 ± 0.465
7.008LeuLeu: 7.008 ± 0.878
1.788LeuMet: 1.788 ± 0.438
3.647LeuAsn: 3.647 ± 0.522
5.077LeuPro: 5.077 ± 0.685
3.361LeuGln: 3.361 ± 0.363
5.864LeuArg: 5.864 ± 0.659
4.219LeuSer: 4.219 ± 0.557
5.22LeuThr: 5.22 ± 0.422
6.364LeuVal: 6.364 ± 0.616
1.502LeuTrp: 1.502 ± 0.373
2.002LeuTyr: 2.002 ± 0.308
0.0LeuXaa: 0.0 ± 0.0
Met
2.789MetAla: 2.789 ± 0.475
0.072MetCys: 0.072 ± 0.066
1.073MetAsp: 1.073 ± 0.353
1.716MetGlu: 1.716 ± 0.336
0.286MetPhe: 0.286 ± 0.153
1.502MetGly: 1.502 ± 0.323
0.286MetHis: 0.286 ± 0.137
1.788MetIle: 1.788 ± 0.387
1.216MetLys: 1.216 ± 0.289
1.216MetLeu: 1.216 ± 0.248
0.358MetMet: 0.358 ± 0.167
1.001MetAsn: 1.001 ± 0.234
1.788MetPro: 1.788 ± 0.404
0.501MetGln: 0.501 ± 0.257
0.644MetArg: 0.644 ± 0.208
1.788MetSer: 1.788 ± 0.321
1.573MetThr: 1.573 ± 0.358
2.503MetVal: 2.503 ± 0.407
0.358MetTrp: 0.358 ± 0.181
0.501MetTyr: 0.501 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
2.86AsnAla: 2.86 ± 0.351
0.358AsnCys: 0.358 ± 0.157
1.716AsnAsp: 1.716 ± 0.391
1.716AsnGlu: 1.716 ± 0.34
0.93AsnPhe: 0.93 ± 0.353
4.076AsnGly: 4.076 ± 0.597
0.858AsnHis: 0.858 ± 0.342
1.359AsnIle: 1.359 ± 0.283
1.859AsnLys: 1.859 ± 0.329
3.218AsnLeu: 3.218 ± 0.432
0.501AsnMet: 0.501 ± 0.196
1.359AsnAsn: 1.359 ± 0.478
1.931AsnPro: 1.931 ± 0.369
1.859AsnGln: 1.859 ± 0.393
1.716AsnArg: 1.716 ± 0.362
1.788AsnSer: 1.788 ± 0.366
2.86AsnThr: 2.86 ± 0.391
3.075AsnVal: 3.075 ± 0.495
1.073AsnTrp: 1.073 ± 0.275
1.073AsnTyr: 1.073 ± 0.271
0.0AsnXaa: 0.0 ± 0.0
Pro
5.506ProAla: 5.506 ± 0.654
0.501ProCys: 0.501 ± 0.232
2.932ProAsp: 2.932 ± 0.45
3.289ProGlu: 3.289 ± 0.506
1.073ProPhe: 1.073 ± 0.322
4.863ProGly: 4.863 ± 0.699
0.501ProHis: 0.501 ± 0.231
2.145ProIle: 2.145 ± 0.41
3.003ProLys: 3.003 ± 0.542
3.146ProLeu: 3.146 ± 0.489
1.216ProMet: 1.216 ± 0.301
1.859ProAsn: 1.859 ± 0.359
1.645ProPro: 1.645 ± 0.287
1.073ProGln: 1.073 ± 0.232
2.431ProArg: 2.431 ± 0.473
3.933ProSer: 3.933 ± 0.609
3.719ProThr: 3.719 ± 0.66
5.077ProVal: 5.077 ± 0.718
1.502ProTrp: 1.502 ± 0.415
0.572ProTyr: 0.572 ± 0.247
0.0ProXaa: 0.0 ± 0.0
Gln
3.576GlnAla: 3.576 ± 0.474
0.143GlnCys: 0.143 ± 0.106
2.217GlnAsp: 2.217 ± 0.422
2.36GlnGlu: 2.36 ± 0.39
1.287GlnPhe: 1.287 ± 0.297
2.789GlnGly: 2.789 ± 0.446
0.501GlnHis: 0.501 ± 0.141
1.859GlnIle: 1.859 ± 0.326
2.217GlnLys: 2.217 ± 0.328
5.077GlnLeu: 5.077 ± 0.725
1.216GlnMet: 1.216 ± 0.306
1.645GlnAsn: 1.645 ± 0.374
2.145GlnPro: 2.145 ± 0.436
1.859GlnGln: 1.859 ± 0.457
2.002GlnArg: 2.002 ± 0.52
1.573GlnSer: 1.573 ± 0.346
2.145GlnThr: 2.145 ± 0.367
3.432GlnVal: 3.432 ± 0.505
1.001GlnTrp: 1.001 ± 0.273
0.858GlnTyr: 0.858 ± 0.254
0.0GlnXaa: 0.0 ± 0.0
Arg
5.721ArgAla: 5.721 ± 0.707
0.572ArgCys: 0.572 ± 0.211
3.647ArgAsp: 3.647 ± 0.613
3.576ArgGlu: 3.576 ± 0.534
1.645ArgPhe: 1.645 ± 0.352
3.647ArgGly: 3.647 ± 0.44
1.144ArgHis: 1.144 ± 0.333
3.146ArgIle: 3.146 ± 0.444
4.076ArgLys: 4.076 ± 0.688
5.649ArgLeu: 5.649 ± 0.737
1.43ArgMet: 1.43 ± 0.416
2.36ArgAsn: 2.36 ± 0.501
2.86ArgPro: 2.86 ± 0.431
2.36ArgGln: 2.36 ± 0.392
3.79ArgArg: 3.79 ± 0.62
3.218ArgSer: 3.218 ± 0.382
3.218ArgThr: 3.218 ± 0.465
3.79ArgVal: 3.79 ± 0.56
1.001ArgTrp: 1.001 ± 0.373
1.359ArgTyr: 1.359 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
4.362SerAla: 4.362 ± 0.61
0.429SerCys: 0.429 ± 0.199
3.075SerAsp: 3.075 ± 0.444
4.791SerGlu: 4.791 ± 0.473
1.645SerPhe: 1.645 ± 0.365
8.081SerGly: 8.081 ± 0.993
1.144SerHis: 1.144 ± 0.244
3.576SerIle: 3.576 ± 0.563
2.36SerLys: 2.36 ± 0.367
5.006SerLeu: 5.006 ± 0.814
1.502SerMet: 1.502 ± 0.39
1.645SerAsn: 1.645 ± 0.336
3.146SerPro: 3.146 ± 0.56
1.645SerGln: 1.645 ± 0.357
2.932SerArg: 2.932 ± 0.479
3.361SerSer: 3.361 ± 0.555
4.291SerThr: 4.291 ± 0.501
4.72SerVal: 4.72 ± 0.569
1.073SerTrp: 1.073 ± 0.42
2.288SerTyr: 2.288 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
7.366ThrAla: 7.366 ± 0.978
0.358ThrCys: 0.358 ± 0.162
3.289ThrAsp: 3.289 ± 0.482
4.434ThrGlu: 4.434 ± 0.447
2.002ThrPhe: 2.002 ± 0.455
5.292ThrGly: 5.292 ± 1.056
0.93ThrHis: 0.93 ± 0.246
2.36ThrIle: 2.36 ± 0.456
3.075ThrLys: 3.075 ± 0.592
5.006ThrLeu: 5.006 ± 0.643
1.216ThrMet: 1.216 ± 0.29
2.145ThrAsn: 2.145 ± 0.389
3.862ThrPro: 3.862 ± 0.468
2.431ThrGln: 2.431 ± 0.441
3.576ThrArg: 3.576 ± 0.514
4.72ThrSer: 4.72 ± 0.623
5.506ThrThr: 5.506 ± 0.939
4.434ThrVal: 4.434 ± 0.579
0.93ThrTrp: 0.93 ± 0.297
1.788ThrTyr: 1.788 ± 0.402
0.0ThrXaa: 0.0 ± 0.0
Val
6.436ValAla: 6.436 ± 0.675
0.286ValCys: 0.286 ± 0.126
3.79ValAsp: 3.79 ± 0.548
4.863ValGlu: 4.863 ± 0.567
2.002ValPhe: 2.002 ± 0.564
5.721ValGly: 5.721 ± 0.797
1.573ValHis: 1.573 ± 0.315
3.576ValIle: 3.576 ± 0.678
3.862ValLys: 3.862 ± 0.537
5.864ValLeu: 5.864 ± 0.749
2.145ValMet: 2.145 ± 0.368
2.717ValAsn: 2.717 ± 0.433
4.291ValPro: 4.291 ± 0.471
3.576ValGln: 3.576 ± 0.467
3.933ValArg: 3.933 ± 0.397
5.149ValSer: 5.149 ± 0.615
4.005ValThr: 4.005 ± 0.521
4.934ValVal: 4.934 ± 0.71
1.859ValTrp: 1.859 ± 0.358
1.716ValTyr: 1.716 ± 0.345
0.0ValXaa: 0.0 ± 0.0
Trp
2.717TrpAla: 2.717 ± 0.415
0.072TrpCys: 0.072 ± 0.076
1.359TrpAsp: 1.359 ± 0.432
1.001TrpGlu: 1.001 ± 0.338
0.787TrpPhe: 0.787 ± 0.235
1.43TrpGly: 1.43 ± 0.328
0.358TrpHis: 0.358 ± 0.135
0.858TrpIle: 0.858 ± 0.246
1.001TrpLys: 1.001 ± 0.253
1.859TrpLeu: 1.859 ± 0.336
0.358TrpMet: 0.358 ± 0.165
0.644TrpAsn: 0.644 ± 0.238
0.787TrpPro: 0.787 ± 0.377
0.429TrpGln: 0.429 ± 0.177
1.073TrpArg: 1.073 ± 0.303
1.502TrpSer: 1.502 ± 0.325
1.144TrpThr: 1.144 ± 0.339
1.931TrpVal: 1.931 ± 0.317
0.787TrpTrp: 0.787 ± 0.201
0.286TrpTyr: 0.286 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.217TyrAla: 2.217 ± 0.438
0.286TyrCys: 0.286 ± 0.146
1.645TyrAsp: 1.645 ± 0.284
1.645TyrGlu: 1.645 ± 0.307
0.787TyrPhe: 0.787 ± 0.187
2.503TyrGly: 2.503 ± 0.531
0.644TyrHis: 0.644 ± 0.19
0.93TyrIle: 0.93 ± 0.224
1.287TyrLys: 1.287 ± 0.343
1.931TyrLeu: 1.931 ± 0.415
0.215TyrMet: 0.215 ± 0.126
0.715TyrAsn: 0.715 ± 0.231
1.43TyrPro: 1.43 ± 0.283
0.644TyrGln: 0.644 ± 0.232
2.074TyrArg: 2.074 ± 0.341
1.287TyrSer: 1.287 ± 0.314
2.145TyrThr: 2.145 ± 0.451
1.645TyrVal: 1.645 ± 0.291
0.858TyrTrp: 0.858 ± 0.226
0.501TyrTyr: 0.501 ± 0.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13985 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski