Amino acid dipepetide frequency for Gordonia phage NatB6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.177AlaAla: 12.177 ± 1.311
0.843AlaCys: 0.843 ± 0.208
7.634AlaAsp: 7.634 ± 0.668
6.885AlaGlu: 6.885 ± 0.611
3.044AlaPhe: 3.044 ± 0.327
8.056AlaGly: 8.056 ± 0.724
2.014AlaHis: 2.014 ± 0.425
5.386AlaIle: 5.386 ± 0.588
5.105AlaLys: 5.105 ± 0.558
8.009AlaLeu: 8.009 ± 0.9
2.201AlaMet: 2.201 ± 0.334
2.998AlaAsn: 2.998 ± 0.327
4.215AlaPro: 4.215 ± 0.658
3.981AlaGln: 3.981 ± 0.534
6.136AlaArg: 6.136 ± 0.554
4.871AlaSer: 4.871 ± 0.529
6.229AlaThr: 6.229 ± 0.509
7.213AlaVal: 7.213 ± 0.47
1.686AlaTrp: 1.686 ± 0.257
2.576AlaTyr: 2.576 ± 0.338
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.187
0.094CysCys: 0.094 ± 0.068
0.937CysAsp: 0.937 ± 0.243
0.281CysGlu: 0.281 ± 0.105
0.094CysPhe: 0.094 ± 0.067
0.89CysGly: 0.89 ± 0.248
0.375CysHis: 0.375 ± 0.156
0.422CysIle: 0.422 ± 0.14
0.234CysLys: 0.234 ± 0.102
0.656CysLeu: 0.656 ± 0.172
0.0CysMet: 0.0 ± 0.0
0.187CysAsn: 0.187 ± 0.078
0.422CysPro: 0.422 ± 0.211
0.468CysGln: 0.468 ± 0.156
0.515CysArg: 0.515 ± 0.153
0.749CysSer: 0.749 ± 0.241
0.562CysThr: 0.562 ± 0.15
0.609CysVal: 0.609 ± 0.197
0.187CysTrp: 0.187 ± 0.101
0.281CysTyr: 0.281 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
6.932AspAla: 6.932 ± 0.465
0.422AspCys: 0.422 ± 0.155
6.651AspAsp: 6.651 ± 0.822
5.855AspGlu: 5.855 ± 0.73
2.295AspPhe: 2.295 ± 0.345
7.119AspGly: 7.119 ± 0.664
1.358AspHis: 1.358 ± 0.205
3.466AspIle: 3.466 ± 0.432
2.201AspLys: 2.201 ± 0.365
5.574AspLeu: 5.574 ± 0.487
1.592AspMet: 1.592 ± 0.24
2.248AspAsn: 2.248 ± 0.338
5.105AspPro: 5.105 ± 0.503
2.435AspGln: 2.435 ± 0.352
5.105AspArg: 5.105 ± 0.653
2.763AspSer: 2.763 ± 0.351
3.747AspThr: 3.747 ± 0.523
4.449AspVal: 4.449 ± 0.437
1.265AspTrp: 1.265 ± 0.215
1.967AspTyr: 1.967 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
5.761GluAla: 5.761 ± 0.524
0.422GluCys: 0.422 ± 0.168
4.309GluAsp: 4.309 ± 0.405
4.262GluGlu: 4.262 ± 0.633
2.576GluPhe: 2.576 ± 0.361
4.777GluGly: 4.777 ± 0.426
1.405GluHis: 1.405 ± 0.225
3.794GluIle: 3.794 ± 0.359
3.091GluLys: 3.091 ± 0.441
5.199GluLeu: 5.199 ± 0.455
1.686GluMet: 1.686 ± 0.302
2.061GluAsn: 2.061 ± 0.304
3.185GluPro: 3.185 ± 0.714
1.92GluGln: 1.92 ± 0.224
5.527GluArg: 5.527 ± 0.62
3.466GluSer: 3.466 ± 0.374
3.747GluThr: 3.747 ± 0.387
4.403GluVal: 4.403 ± 0.379
1.452GluTrp: 1.452 ± 0.25
1.827GluTyr: 1.827 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
2.81PheAla: 2.81 ± 0.367
0.375PheCys: 0.375 ± 0.147
3.044PheAsp: 3.044 ± 0.453
2.67PheGlu: 2.67 ± 0.329
0.89PhePhe: 0.89 ± 0.206
3.185PheGly: 3.185 ± 0.331
0.796PheHis: 0.796 ± 0.149
1.265PheIle: 1.265 ± 0.222
0.843PheLys: 0.843 ± 0.206
2.482PheLeu: 2.482 ± 0.38
0.562PheMet: 0.562 ± 0.137
0.703PheAsn: 0.703 ± 0.152
1.733PhePro: 1.733 ± 0.292
0.703PheGln: 0.703 ± 0.146
2.482PheArg: 2.482 ± 0.359
1.499PheSer: 1.499 ± 0.274
1.733PheThr: 1.733 ± 0.284
1.546PheVal: 1.546 ± 0.265
0.749PheTrp: 0.749 ± 0.174
0.749PheTyr: 0.749 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
7.728GlyAla: 7.728 ± 0.837
0.749GlyCys: 0.749 ± 0.222
5.667GlyAsp: 5.667 ± 0.451
5.011GlyGlu: 5.011 ± 0.463
2.576GlyPhe: 2.576 ± 0.378
7.822GlyGly: 7.822 ± 1.308
1.873GlyHis: 1.873 ± 0.283
3.981GlyIle: 3.981 ± 0.605
3.841GlyLys: 3.841 ± 0.401
6.557GlyLeu: 6.557 ± 0.748
2.201GlyMet: 2.201 ± 0.31
2.67GlyAsn: 2.67 ± 0.389
4.543GlyPro: 4.543 ± 0.515
2.857GlyGln: 2.857 ± 0.431
6.744GlyArg: 6.744 ± 0.542
5.152GlySer: 5.152 ± 0.545
5.292GlyThr: 5.292 ± 0.52
5.948GlyVal: 5.948 ± 0.661
1.639GlyTrp: 1.639 ± 0.288
2.998GlyTyr: 2.998 ± 0.402
0.0GlyXaa: 0.0 ± 0.0
His
1.452HisAla: 1.452 ± 0.294
0.141HisCys: 0.141 ± 0.087
0.89HisAsp: 0.89 ± 0.223
0.749HisGlu: 0.749 ± 0.216
0.609HisPhe: 0.609 ± 0.162
1.686HisGly: 1.686 ± 0.293
0.328HisHis: 0.328 ± 0.122
0.89HisIle: 0.89 ± 0.184
0.562HisLys: 0.562 ± 0.167
1.311HisLeu: 1.311 ± 0.311
0.375HisMet: 0.375 ± 0.129
0.468HisAsn: 0.468 ± 0.157
1.78HisPro: 1.78 ± 0.311
0.422HisGln: 0.422 ± 0.144
2.061HisArg: 2.061 ± 0.355
1.077HisSer: 1.077 ± 0.23
0.796HisThr: 0.796 ± 0.167
1.967HisVal: 1.967 ± 0.34
0.468HisTrp: 0.468 ± 0.128
1.077HisTyr: 1.077 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
6.089IleAla: 6.089 ± 0.634
0.234IleCys: 0.234 ± 0.123
3.887IleAsp: 3.887 ± 0.393
4.356IleGlu: 4.356 ± 0.451
0.89IlePhe: 0.89 ± 0.202
4.403IleGly: 4.403 ± 0.639
0.937IleHis: 0.937 ± 0.188
1.733IleIle: 1.733 ± 0.236
2.154IleLys: 2.154 ± 0.352
3.887IleLeu: 3.887 ± 0.407
0.796IleMet: 0.796 ± 0.197
2.342IleAsn: 2.342 ± 0.355
2.529IlePro: 2.529 ± 0.283
1.733IleGln: 1.733 ± 0.296
2.763IleArg: 2.763 ± 0.337
2.998IleSer: 2.998 ± 0.375
2.67IleThr: 2.67 ± 0.363
2.81IleVal: 2.81 ± 0.401
0.796IleTrp: 0.796 ± 0.195
1.265IleTyr: 1.265 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
3.325LysAla: 3.325 ± 0.418
0.328LysCys: 0.328 ± 0.134
3.138LysAsp: 3.138 ± 0.421
1.827LysGlu: 1.827 ± 0.32
1.218LysPhe: 1.218 ± 0.222
2.998LysGly: 2.998 ± 0.486
0.562LysHis: 0.562 ± 0.154
1.873LysIle: 1.873 ± 0.305
2.295LysLys: 2.295 ± 0.38
3.232LysLeu: 3.232 ± 0.4
1.171LysMet: 1.171 ± 0.237
0.984LysAsn: 0.984 ± 0.205
2.67LysPro: 2.67 ± 0.38
1.499LysGln: 1.499 ± 0.253
3.372LysArg: 3.372 ± 0.42
2.061LysSer: 2.061 ± 0.275
2.998LysThr: 2.998 ± 0.394
3.7LysVal: 3.7 ± 0.352
0.656LysTrp: 0.656 ± 0.179
1.03LysTyr: 1.03 ± 0.218
0.0LysXaa: 0.0 ± 0.0
Leu
8.009LeuAla: 8.009 ± 0.74
0.89LeuCys: 0.89 ± 0.251
6.51LeuAsp: 6.51 ± 0.681
3.606LeuGlu: 3.606 ± 0.371
2.201LeuPhe: 2.201 ± 0.284
6.463LeuGly: 6.463 ± 0.818
1.077LeuHis: 1.077 ± 0.276
3.934LeuIle: 3.934 ± 0.466
2.67LeuLys: 2.67 ± 0.358
4.496LeuLeu: 4.496 ± 0.487
1.78LeuMet: 1.78 ± 0.298
2.529LeuAsn: 2.529 ± 0.296
4.73LeuPro: 4.73 ± 0.566
1.92LeuGln: 1.92 ± 0.263
6.136LeuArg: 6.136 ± 0.496
3.794LeuSer: 3.794 ± 0.383
5.386LeuThr: 5.386 ± 0.624
5.386LeuVal: 5.386 ± 0.429
1.124LeuTrp: 1.124 ± 0.229
1.311LeuTyr: 1.311 ± 0.255
0.0LeuXaa: 0.0 ± 0.0
Met
2.951MetAla: 2.951 ± 0.515
0.0MetCys: 0.0 ± 0.0
1.03MetAsp: 1.03 ± 0.195
1.639MetGlu: 1.639 ± 0.245
0.749MetPhe: 0.749 ± 0.173
0.749MetGly: 0.749 ± 0.231
0.515MetHis: 0.515 ± 0.153
1.077MetIle: 1.077 ± 0.187
0.937MetLys: 0.937 ± 0.212
1.592MetLeu: 1.592 ± 0.242
0.515MetMet: 0.515 ± 0.145
0.843MetAsn: 0.843 ± 0.224
1.733MetPro: 1.733 ± 0.339
0.609MetGln: 0.609 ± 0.16
1.592MetArg: 1.592 ± 0.253
1.499MetSer: 1.499 ± 0.262
2.389MetThr: 2.389 ± 0.341
1.077MetVal: 1.077 ± 0.211
0.562MetTrp: 0.562 ± 0.178
0.562MetTyr: 0.562 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
2.717AsnAla: 2.717 ± 0.271
0.375AsnCys: 0.375 ± 0.13
2.201AsnAsp: 2.201 ± 0.335
1.733AsnGlu: 1.733 ± 0.255
1.077AsnPhe: 1.077 ± 0.19
3.419AsnGly: 3.419 ± 0.402
0.562AsnHis: 0.562 ± 0.177
1.171AsnIle: 1.171 ± 0.297
0.937AsnLys: 0.937 ± 0.208
1.92AsnLeu: 1.92 ± 0.226
0.515AsnMet: 0.515 ± 0.134
1.03AsnAsn: 1.03 ± 0.182
2.342AsnPro: 2.342 ± 0.342
0.937AsnGln: 0.937 ± 0.182
2.248AsnArg: 2.248 ± 0.287
1.405AsnSer: 1.405 ± 0.247
2.435AsnThr: 2.435 ± 0.368
2.201AsnVal: 2.201 ± 0.301
0.749AsnTrp: 0.749 ± 0.205
1.124AsnTyr: 1.124 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
5.901ProAla: 5.901 ± 0.725
0.141ProCys: 0.141 ± 0.09
4.73ProAsp: 4.73 ± 0.483
4.684ProGlu: 4.684 ± 0.844
1.873ProPhe: 1.873 ± 0.245
4.965ProGly: 4.965 ± 0.604
0.984ProHis: 0.984 ± 0.22
3.091ProIle: 3.091 ± 0.484
2.857ProLys: 2.857 ± 0.353
4.122ProLeu: 4.122 ± 0.383
1.639ProMet: 1.639 ± 0.31
1.827ProAsn: 1.827 ± 0.345
2.435ProPro: 2.435 ± 0.382
1.358ProGln: 1.358 ± 0.201
4.403ProArg: 4.403 ± 0.531
3.138ProSer: 3.138 ± 0.445
4.028ProThr: 4.028 ± 0.512
4.356ProVal: 4.356 ± 0.415
0.656ProTrp: 0.656 ± 0.145
0.796ProTyr: 0.796 ± 0.165
0.0ProXaa: 0.0 ± 0.0
Gln
3.887GlnAla: 3.887 ± 0.427
0.234GlnCys: 0.234 ± 0.116
1.358GlnAsp: 1.358 ± 0.289
1.592GlnGlu: 1.592 ± 0.26
1.124GlnPhe: 1.124 ± 0.238
1.967GlnGly: 1.967 ± 0.228
0.281GlnHis: 0.281 ± 0.106
1.92GlnIle: 1.92 ± 0.327
1.077GlnLys: 1.077 ± 0.233
2.81GlnLeu: 2.81 ± 0.439
1.03GlnMet: 1.03 ± 0.193
1.171GlnAsn: 1.171 ± 0.262
1.92GlnPro: 1.92 ± 0.31
1.499GlnGln: 1.499 ± 0.282
2.81GlnArg: 2.81 ± 0.44
1.592GlnSer: 1.592 ± 0.273
1.78GlnThr: 1.78 ± 0.321
2.389GlnVal: 2.389 ± 0.396
0.609GlnTrp: 0.609 ± 0.167
0.609GlnTyr: 0.609 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
7.868ArgAla: 7.868 ± 0.85
1.358ArgCys: 1.358 ± 0.255
5.011ArgAsp: 5.011 ± 0.639
5.105ArgGlu: 5.105 ± 0.53
2.763ArgPhe: 2.763 ± 0.492
6.417ArgGly: 6.417 ± 0.694
1.499ArgHis: 1.499 ± 0.317
3.841ArgIle: 3.841 ± 0.411
4.168ArgLys: 4.168 ± 0.406
4.543ArgLeu: 4.543 ± 0.467
2.061ArgMet: 2.061 ± 0.354
2.108ArgAsn: 2.108 ± 0.289
3.841ArgPro: 3.841 ± 0.521
2.061ArgGln: 2.061 ± 0.326
6.463ArgArg: 6.463 ± 0.662
3.325ArgSer: 3.325 ± 0.457
4.637ArgThr: 4.637 ± 0.559
5.574ArgVal: 5.574 ± 0.519
1.546ArgTrp: 1.546 ± 0.263
2.108ArgTyr: 2.108 ± 0.248
0.0ArgXaa: 0.0 ± 0.0
Ser
6.323SerAla: 6.323 ± 0.53
0.422SerCys: 0.422 ± 0.15
3.747SerAsp: 3.747 ± 0.438
3.466SerGlu: 3.466 ± 0.431
1.077SerPhe: 1.077 ± 0.178
4.965SerGly: 4.965 ± 0.567
1.077SerHis: 1.077 ± 0.269
2.717SerIle: 2.717 ± 0.343
1.733SerLys: 1.733 ± 0.299
3.279SerLeu: 3.279 ± 0.349
1.358SerMet: 1.358 ± 0.28
1.686SerAsn: 1.686 ± 0.258
3.232SerPro: 3.232 ± 0.379
1.827SerGln: 1.827 ± 0.327
3.7SerArg: 3.7 ± 0.384
2.998SerSer: 2.998 ± 0.497
3.232SerThr: 3.232 ± 0.342
3.981SerVal: 3.981 ± 0.425
0.984SerTrp: 0.984 ± 0.2
0.749SerTyr: 0.749 ± 0.136
0.0SerXaa: 0.0 ± 0.0
Thr
5.527ThrAla: 5.527 ± 0.486
0.422ThrCys: 0.422 ± 0.169
3.232ThrAsp: 3.232 ± 0.34
3.7ThrGlu: 3.7 ± 0.413
2.108ThrPhe: 2.108 ± 0.251
6.838ThrGly: 6.838 ± 0.541
1.311ThrHis: 1.311 ± 0.25
3.887ThrIle: 3.887 ± 0.448
2.904ThrLys: 2.904 ± 0.366
5.386ThrLeu: 5.386 ± 0.595
1.077ThrMet: 1.077 ± 0.208
1.405ThrAsn: 1.405 ± 0.211
4.449ThrPro: 4.449 ± 0.514
1.873ThrGln: 1.873 ± 0.241
4.449ThrArg: 4.449 ± 0.53
3.325ThrSer: 3.325 ± 0.391
4.59ThrThr: 4.59 ± 0.512
5.058ThrVal: 5.058 ± 0.689
1.405ThrTrp: 1.405 ± 0.247
1.546ThrTyr: 1.546 ± 0.283
0.0ThrXaa: 0.0 ± 0.0
Val
7.353ValAla: 7.353 ± 0.807
0.703ValCys: 0.703 ± 0.21
5.48ValAsp: 5.48 ± 0.533
4.965ValGlu: 4.965 ± 0.527
2.201ValPhe: 2.201 ± 0.303
5.901ValGly: 5.901 ± 0.577
1.077ValHis: 1.077 ± 0.236
2.951ValIle: 2.951 ± 0.424
1.873ValLys: 1.873 ± 0.338
4.73ValLeu: 4.73 ± 0.386
1.311ValMet: 1.311 ± 0.272
2.342ValAsn: 2.342 ± 0.282
4.215ValPro: 4.215 ± 0.443
2.061ValGln: 2.061 ± 0.34
5.995ValArg: 5.995 ± 0.541
3.934ValSer: 3.934 ± 0.383
5.48ValThr: 5.48 ± 0.539
5.058ValVal: 5.058 ± 0.459
1.218ValTrp: 1.218 ± 0.299
2.201ValTyr: 2.201 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
1.827TrpAla: 1.827 ± 0.301
0.094TrpCys: 0.094 ± 0.07
1.358TrpAsp: 1.358 ± 0.258
1.124TrpGlu: 1.124 ± 0.247
0.656TrpPhe: 0.656 ± 0.17
1.124TrpGly: 1.124 ± 0.211
0.609TrpHis: 0.609 ± 0.172
1.124TrpIle: 1.124 ± 0.231
0.796TrpLys: 0.796 ± 0.196
1.78TrpLeu: 1.78 ± 0.346
0.234TrpMet: 0.234 ± 0.109
0.656TrpAsn: 0.656 ± 0.188
0.703TrpPro: 0.703 ± 0.186
0.843TrpGln: 0.843 ± 0.231
1.358TrpArg: 1.358 ± 0.287
1.499TrpSer: 1.499 ± 0.364
1.171TrpThr: 1.171 ± 0.225
1.218TrpVal: 1.218 ± 0.213
0.89TrpTrp: 0.89 ± 0.231
0.328TrpTyr: 0.328 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.061TyrAla: 2.061 ± 0.348
0.281TyrCys: 0.281 ± 0.115
1.733TyrAsp: 1.733 ± 0.273
1.358TyrGlu: 1.358 ± 0.274
0.796TyrPhe: 0.796 ± 0.212
2.108TyrGly: 2.108 ± 0.35
0.468TyrHis: 0.468 ± 0.17
0.749TyrIle: 0.749 ± 0.2
0.656TyrLys: 0.656 ± 0.16
2.389TyrLeu: 2.389 ± 0.306
0.468TyrMet: 0.468 ± 0.129
0.843TyrAsn: 0.843 ± 0.214
2.201TyrPro: 2.201 ± 0.363
0.656TyrGln: 0.656 ± 0.169
2.482TyrArg: 2.482 ± 0.371
1.405TyrSer: 1.405 ± 0.256
1.592TyrThr: 1.592 ± 0.247
2.108TyrVal: 2.108 ± 0.348
0.749TyrTrp: 0.749 ± 0.209
0.609TyrTyr: 0.609 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (21352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski