Amino acid dipepetide frequency for Streptomyces phage Rima

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.757AlaAla: 10.757 ± 1.589
0.286AlaCys: 0.286 ± 0.132
4.978AlaAsp: 4.978 ± 0.709
6.809AlaGlu: 6.809 ± 0.596
2.689AlaPhe: 2.689 ± 0.375
7.038AlaGly: 7.038 ± 1.409
1.373AlaHis: 1.373 ± 0.338
4.749AlaIle: 4.749 ± 0.447
7.095AlaLys: 7.095 ± 0.859
8.525AlaLeu: 8.525 ± 1.498
3.776AlaMet: 3.776 ± 0.539
2.918AlaAsn: 2.918 ± 0.334
2.861AlaPro: 2.861 ± 0.451
3.49AlaGln: 3.49 ± 0.447
4.234AlaArg: 4.234 ± 0.596
3.834AlaSer: 3.834 ± 0.426
5.779AlaThr: 5.779 ± 0.603
5.493AlaVal: 5.493 ± 0.515
1.144AlaTrp: 1.144 ± 0.35
3.433AlaTyr: 3.433 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.458CysAla: 0.458 ± 0.183
0.172CysCys: 0.172 ± 0.11
0.343CysAsp: 0.343 ± 0.147
0.343CysGlu: 0.343 ± 0.148
0.229CysPhe: 0.229 ± 0.114
0.401CysGly: 0.401 ± 0.159
0.114CysHis: 0.114 ± 0.087
0.172CysIle: 0.172 ± 0.113
0.172CysLys: 0.172 ± 0.091
0.744CysLeu: 0.744 ± 0.26
0.114CysMet: 0.114 ± 0.096
0.286CysAsn: 0.286 ± 0.137
0.343CysPro: 0.343 ± 0.171
0.172CysGln: 0.172 ± 0.101
0.172CysArg: 0.172 ± 0.105
0.401CysSer: 0.401 ± 0.15
0.401CysThr: 0.401 ± 0.16
0.343CysVal: 0.343 ± 0.161
0.172CysTrp: 0.172 ± 0.089
0.172CysTyr: 0.172 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
5.493AspAla: 5.493 ± 0.711
0.401AspCys: 0.401 ± 0.168
4.062AspAsp: 4.062 ± 0.571
6.408AspGlu: 6.408 ± 0.764
2.575AspPhe: 2.575 ± 0.461
5.493AspGly: 5.493 ± 0.609
0.973AspHis: 0.973 ± 0.255
3.891AspIle: 3.891 ± 0.577
3.261AspLys: 3.261 ± 0.426
4.234AspLeu: 4.234 ± 0.523
1.602AspMet: 1.602 ± 0.316
2.403AspAsn: 2.403 ± 0.409
3.948AspPro: 3.948 ± 0.759
1.659AspGln: 1.659 ± 0.395
3.605AspArg: 3.605 ± 0.591
3.433AspSer: 3.433 ± 0.464
2.746AspThr: 2.746 ± 0.421
3.261AspVal: 3.261 ± 0.403
1.488AspTrp: 1.488 ± 0.296
1.774AspTyr: 1.774 ± 0.427
0.0AspXaa: 0.0 ± 0.0
Glu
6.008GluAla: 6.008 ± 0.76
0.229GluCys: 0.229 ± 0.119
5.264GluAsp: 5.264 ± 0.745
6.466GluGlu: 6.466 ± 0.88
3.433GluPhe: 3.433 ± 0.542
5.15GluGly: 5.15 ± 0.53
1.087GluHis: 1.087 ± 0.263
3.834GluIle: 3.834 ± 0.455
4.806GluLys: 4.806 ± 0.488
5.951GluLeu: 5.951 ± 0.636
2.975GluMet: 2.975 ± 0.397
3.261GluAsn: 3.261 ± 0.415
2.575GluPro: 2.575 ± 0.395
2.804GluGln: 2.804 ± 0.382
4.291GluArg: 4.291 ± 0.631
3.605GluSer: 3.605 ± 0.478
3.662GluThr: 3.662 ± 0.434
5.493GluVal: 5.493 ± 0.605
1.202GluTrp: 1.202 ± 0.238
1.945GluTyr: 1.945 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
3.376PheAla: 3.376 ± 0.425
0.343PheCys: 0.343 ± 0.163
3.376PheAsp: 3.376 ± 0.494
2.918PheGlu: 2.918 ± 0.42
1.545PhePhe: 1.545 ± 0.281
2.861PheGly: 2.861 ± 0.559
0.458PheHis: 0.458 ± 0.154
1.774PheIle: 1.774 ± 0.419
1.945PheLys: 1.945 ± 0.384
2.06PheLeu: 2.06 ± 0.456
0.973PheMet: 0.973 ± 0.233
1.717PheAsn: 1.717 ± 0.327
1.202PhePro: 1.202 ± 0.236
1.373PheGln: 1.373 ± 0.278
2.232PheArg: 2.232 ± 0.334
1.774PheSer: 1.774 ± 0.466
2.346PheThr: 2.346 ± 0.422
2.174PheVal: 2.174 ± 0.338
0.572PheTrp: 0.572 ± 0.206
0.858PheTyr: 0.858 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
6.58GlyAla: 6.58 ± 1.053
0.401GlyCys: 0.401 ± 0.175
4.52GlyAsp: 4.52 ± 0.498
4.177GlyGlu: 4.177 ± 0.46
3.261GlyPhe: 3.261 ± 0.621
5.035GlyGly: 5.035 ± 0.743
0.973GlyHis: 0.973 ± 0.252
4.52GlyIle: 4.52 ± 0.893
4.291GlyLys: 4.291 ± 0.602
6.065GlyLeu: 6.065 ± 1.479
2.346GlyMet: 2.346 ± 0.323
2.746GlyAsn: 2.746 ± 0.47
2.518GlyPro: 2.518 ± 0.391
1.945GlyGln: 1.945 ± 0.415
3.09GlyArg: 3.09 ± 0.352
4.635GlySer: 4.635 ± 0.699
6.122GlyThr: 6.122 ± 0.551
6.122GlyVal: 6.122 ± 0.885
1.03GlyTrp: 1.03 ± 0.211
3.319GlyTyr: 3.319 ± 0.636
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.279
0.229HisCys: 0.229 ± 0.13
1.316HisAsp: 1.316 ± 0.35
1.43HisGlu: 1.43 ± 0.299
0.401HisPhe: 0.401 ± 0.171
1.602HisGly: 1.602 ± 0.345
0.572HisHis: 0.572 ± 0.167
1.259HisIle: 1.259 ± 0.309
1.373HisLys: 1.373 ± 0.32
1.316HisLeu: 1.316 ± 0.3
0.458HisMet: 0.458 ± 0.185
0.515HisAsn: 0.515 ± 0.165
0.915HisPro: 0.915 ± 0.191
0.572HisGln: 0.572 ± 0.21
0.801HisArg: 0.801 ± 0.252
1.259HisSer: 1.259 ± 0.269
0.515HisThr: 0.515 ± 0.159
1.43HisVal: 1.43 ± 0.283
0.343HisTrp: 0.343 ± 0.176
1.316HisTyr: 1.316 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
4.749IleAla: 4.749 ± 0.494
0.515IleCys: 0.515 ± 0.205
3.948IleAsp: 3.948 ± 0.48
4.291IleGlu: 4.291 ± 0.412
1.945IlePhe: 1.945 ± 0.335
3.548IleGly: 3.548 ± 0.775
1.259IleHis: 1.259 ± 0.311
2.06IleIle: 2.06 ± 0.474
3.204IleLys: 3.204 ± 0.523
4.864IleLeu: 4.864 ± 0.398
0.915IleMet: 0.915 ± 0.218
1.945IleAsn: 1.945 ± 0.307
2.346IlePro: 2.346 ± 0.279
1.717IleGln: 1.717 ± 0.281
2.918IleArg: 2.918 ± 0.407
3.776IleSer: 3.776 ± 0.495
3.662IleThr: 3.662 ± 0.528
3.548IleVal: 3.548 ± 0.363
0.286IleTrp: 0.286 ± 0.154
1.43IleTyr: 1.43 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
6.637LysAla: 6.637 ± 0.774
0.172LysCys: 0.172 ± 0.119
3.719LysAsp: 3.719 ± 0.614
4.062LysGlu: 4.062 ± 0.692
1.888LysPhe: 1.888 ± 0.287
5.035LysGly: 5.035 ± 1.016
1.259LysHis: 1.259 ± 0.319
3.49LysIle: 3.49 ± 0.531
6.866LysLys: 6.866 ± 0.958
6.752LysLeu: 6.752 ± 0.701
2.117LysMet: 2.117 ± 0.641
3.147LysAsn: 3.147 ± 0.482
2.518LysPro: 2.518 ± 0.51
2.518LysGln: 2.518 ± 0.378
3.261LysArg: 3.261 ± 0.507
4.577LysSer: 4.577 ± 0.539
4.406LysThr: 4.406 ± 0.674
4.291LysVal: 4.291 ± 0.474
0.858LysTrp: 0.858 ± 0.229
2.003LysTyr: 2.003 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
7.667LeuAla: 7.667 ± 1.328
0.343LeuCys: 0.343 ± 0.141
4.749LeuAsp: 4.749 ± 0.502
6.695LeuGlu: 6.695 ± 0.557
2.518LeuPhe: 2.518 ± 0.419
5.951LeuGly: 5.951 ± 1.004
1.774LeuHis: 1.774 ± 0.303
3.948LeuIle: 3.948 ± 0.672
6.351LeuLys: 6.351 ± 0.736
5.378LeuLeu: 5.378 ± 0.61
2.06LeuMet: 2.06 ± 0.256
2.918LeuAsn: 2.918 ± 0.515
2.46LeuPro: 2.46 ± 0.394
2.575LeuGln: 2.575 ± 0.43
4.062LeuArg: 4.062 ± 0.501
4.52LeuSer: 4.52 ± 0.52
5.779LeuThr: 5.779 ± 0.68
6.008LeuVal: 6.008 ± 0.725
0.973LeuTrp: 0.973 ± 0.202
2.518LeuTyr: 2.518 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
2.46MetAla: 2.46 ± 0.315
0.114MetCys: 0.114 ± 0.07
2.117MetAsp: 2.117 ± 0.237
1.43MetGlu: 1.43 ± 0.292
1.373MetPhe: 1.373 ± 0.254
1.717MetGly: 1.717 ± 0.264
0.229MetHis: 0.229 ± 0.097
1.316MetIle: 1.316 ± 0.345
1.659MetLys: 1.659 ± 0.269
2.804MetLeu: 2.804 ± 0.436
0.572MetMet: 0.572 ± 0.181
1.602MetAsn: 1.602 ± 0.317
1.373MetPro: 1.373 ± 0.286
0.629MetGln: 0.629 ± 0.207
1.545MetArg: 1.545 ± 0.238
2.117MetSer: 2.117 ± 0.38
2.174MetThr: 2.174 ± 0.368
1.202MetVal: 1.202 ± 0.26
0.401MetTrp: 0.401 ± 0.148
0.801MetTyr: 0.801 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
3.605AsnAla: 3.605 ± 0.376
0.286AsnCys: 0.286 ± 0.157
2.632AsnAsp: 2.632 ± 0.361
3.147AsnGlu: 3.147 ± 0.434
0.858AsnPhe: 0.858 ± 0.299
2.918AsnGly: 2.918 ± 0.4
1.087AsnHis: 1.087 ± 0.254
1.774AsnIle: 1.774 ± 0.255
2.46AsnLys: 2.46 ± 0.39
2.403AsnLeu: 2.403 ± 0.486
0.915AsnMet: 0.915 ± 0.197
2.003AsnAsn: 2.003 ± 0.379
2.403AsnPro: 2.403 ± 0.433
1.373AsnGln: 1.373 ± 0.262
2.346AsnArg: 2.346 ± 0.492
3.09AsnSer: 3.09 ± 0.444
2.46AsnThr: 2.46 ± 0.368
2.746AsnVal: 2.746 ± 0.341
0.343AsnTrp: 0.343 ± 0.138
1.259AsnTyr: 1.259 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
2.918ProAla: 2.918 ± 0.472
0.401ProCys: 0.401 ± 0.135
3.548ProAsp: 3.548 ± 0.555
3.548ProGlu: 3.548 ± 0.558
1.545ProPhe: 1.545 ± 0.309
2.861ProGly: 2.861 ± 0.473
0.687ProHis: 0.687 ± 0.232
1.717ProIle: 1.717 ± 0.356
3.319ProLys: 3.319 ± 0.426
2.632ProLeu: 2.632 ± 0.383
0.858ProMet: 0.858 ± 0.239
1.545ProAsn: 1.545 ± 0.319
2.117ProPro: 2.117 ± 0.434
1.03ProGln: 1.03 ± 0.278
1.945ProArg: 1.945 ± 0.392
2.632ProSer: 2.632 ± 0.52
2.975ProThr: 2.975 ± 0.479
3.605ProVal: 3.605 ± 0.438
0.343ProTrp: 0.343 ± 0.135
1.144ProTyr: 1.144 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
3.033GlnAla: 3.033 ± 0.439
0.286GlnCys: 0.286 ± 0.131
1.717GlnAsp: 1.717 ± 0.307
1.888GlnGlu: 1.888 ± 0.417
1.202GlnPhe: 1.202 ± 0.224
2.06GlnGly: 2.06 ± 0.39
0.915GlnHis: 0.915 ± 0.272
2.403GlnIle: 2.403 ± 0.446
2.289GlnLys: 2.289 ± 0.326
2.804GlnLeu: 2.804 ± 0.479
0.744GlnMet: 0.744 ± 0.241
1.202GlnAsn: 1.202 ± 0.287
1.202GlnPro: 1.202 ± 0.323
1.545GlnGln: 1.545 ± 0.369
1.659GlnArg: 1.659 ± 0.266
2.06GlnSer: 2.06 ± 0.332
2.174GlnThr: 2.174 ± 0.332
2.575GlnVal: 2.575 ± 0.43
0.401GlnTrp: 0.401 ± 0.133
1.144GlnTyr: 1.144 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
4.177ArgAla: 4.177 ± 0.591
0.458ArgCys: 0.458 ± 0.135
2.46ArgAsp: 2.46 ± 0.474
3.433ArgGlu: 3.433 ± 0.552
0.973ArgPhe: 0.973 ± 0.197
3.147ArgGly: 3.147 ± 0.496
0.915ArgHis: 0.915 ± 0.3
2.804ArgIle: 2.804 ± 0.473
4.406ArgLys: 4.406 ± 0.568
4.864ArgLeu: 4.864 ± 0.723
1.488ArgMet: 1.488 ± 0.294
1.831ArgAsn: 1.831 ± 0.308
2.117ArgPro: 2.117 ± 0.39
2.174ArgGln: 2.174 ± 0.439
4.12ArgArg: 4.12 ± 0.694
3.147ArgSer: 3.147 ± 0.532
3.261ArgThr: 3.261 ± 0.416
2.861ArgVal: 2.861 ± 0.412
0.858ArgTrp: 0.858 ± 0.303
1.545ArgTyr: 1.545 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
5.607SerAla: 5.607 ± 0.699
0.229SerCys: 0.229 ± 0.119
3.548SerAsp: 3.548 ± 0.56
4.463SerGlu: 4.463 ± 0.441
2.289SerPhe: 2.289 ± 0.466
4.864SerGly: 4.864 ± 0.539
1.144SerHis: 1.144 ± 0.317
3.033SerIle: 3.033 ± 0.46
4.062SerLys: 4.062 ± 0.629
4.749SerLeu: 4.749 ± 0.608
1.373SerMet: 1.373 ± 0.245
2.403SerAsn: 2.403 ± 0.397
2.918SerPro: 2.918 ± 0.408
1.945SerGln: 1.945 ± 0.371
2.403SerArg: 2.403 ± 0.496
4.406SerSer: 4.406 ± 0.681
4.005SerThr: 4.005 ± 0.522
4.177SerVal: 4.177 ± 0.604
1.087SerTrp: 1.087 ± 0.255
1.602SerTyr: 1.602 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
6.122ThrAla: 6.122 ± 0.841
0.229ThrCys: 0.229 ± 0.119
3.605ThrAsp: 3.605 ± 0.416
4.12ThrGlu: 4.12 ± 0.562
2.632ThrPhe: 2.632 ± 0.322
5.722ThrGly: 5.722 ± 0.556
1.488ThrHis: 1.488 ± 0.316
3.948ThrIle: 3.948 ± 0.497
4.005ThrLys: 4.005 ± 0.447
4.749ThrLeu: 4.749 ± 0.516
1.488ThrMet: 1.488 ± 0.292
1.602ThrAsn: 1.602 ± 0.288
3.319ThrPro: 3.319 ± 0.433
1.545ThrGln: 1.545 ± 0.274
2.861ThrArg: 2.861 ± 0.399
3.776ThrSer: 3.776 ± 0.717
5.607ThrThr: 5.607 ± 0.775
5.665ThrVal: 5.665 ± 0.675
1.087ThrTrp: 1.087 ± 0.315
2.003ThrTyr: 2.003 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
6.752ValAla: 6.752 ± 0.614
0.343ValCys: 0.343 ± 0.142
3.719ValAsp: 3.719 ± 0.432
4.921ValGlu: 4.921 ± 0.726
2.518ValPhe: 2.518 ± 0.448
4.978ValGly: 4.978 ± 0.588
1.144ValHis: 1.144 ± 0.241
3.719ValIle: 3.719 ± 0.392
5.092ValLys: 5.092 ± 0.509
5.55ValLeu: 5.55 ± 0.638
1.717ValMet: 1.717 ± 0.345
2.975ValAsn: 2.975 ± 0.381
2.575ValPro: 2.575 ± 0.45
2.06ValGln: 2.06 ± 0.314
3.433ValArg: 3.433 ± 0.435
3.776ValSer: 3.776 ± 0.461
5.092ValThr: 5.092 ± 0.561
4.635ValVal: 4.635 ± 0.626
1.316ValTrp: 1.316 ± 0.311
2.975ValTyr: 2.975 ± 0.509
0.0ValXaa: 0.0 ± 0.0
Trp
1.144TrpAla: 1.144 ± 0.239
0.114TrpCys: 0.114 ± 0.083
1.202TrpAsp: 1.202 ± 0.298
1.316TrpGlu: 1.316 ± 0.248
0.401TrpPhe: 0.401 ± 0.159
1.43TrpGly: 1.43 ± 0.256
0.343TrpHis: 0.343 ± 0.143
0.515TrpIle: 0.515 ± 0.179
0.801TrpLys: 0.801 ± 0.215
0.801TrpLeu: 0.801 ± 0.224
0.629TrpMet: 0.629 ± 0.213
1.144TrpAsn: 1.144 ± 0.293
0.286TrpPro: 0.286 ± 0.136
0.343TrpGln: 0.343 ± 0.124
0.286TrpArg: 0.286 ± 0.18
0.973TrpSer: 0.973 ± 0.25
0.858TrpThr: 0.858 ± 0.247
1.259TrpVal: 1.259 ± 0.347
0.114TrpTrp: 0.114 ± 0.08
0.515TrpTyr: 0.515 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.804TyrAla: 2.804 ± 0.527
0.172TyrCys: 0.172 ± 0.095
1.888TyrAsp: 1.888 ± 0.346
2.174TyrGlu: 2.174 ± 0.352
1.602TyrPhe: 1.602 ± 0.377
1.888TyrGly: 1.888 ± 0.373
0.801TyrHis: 0.801 ± 0.259
2.003TyrIle: 2.003 ± 0.403
2.117TyrLys: 2.117 ± 0.394
2.003TyrLeu: 2.003 ± 0.353
0.458TyrMet: 0.458 ± 0.149
1.945TyrAsn: 1.945 ± 0.347
1.373TyrPro: 1.373 ± 0.285
1.831TyrGln: 1.831 ± 0.314
1.774TyrArg: 1.774 ± 0.271
2.518TyrSer: 2.518 ± 0.426
1.545TyrThr: 1.545 ± 0.385
2.403TyrVal: 2.403 ± 0.322
0.458TyrTrp: 0.458 ± 0.198
1.488TyrTyr: 1.488 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (17478 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski