Amino acid dipepetide frequency for Ruegeria phage 45A6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.678AlaAla: 14.678 ± 1.288
0.819AlaCys: 0.819 ± 0.292
8.771AlaAsp: 8.771 ± 1.454
8.888AlaGlu: 8.888 ± 0.633
3.567AlaPhe: 3.567 ± 0.488
9.649AlaGly: 9.649 ± 0.992
2.047AlaHis: 2.047 ± 0.477
4.737AlaIle: 4.737 ± 0.464
5.672AlaLys: 5.672 ± 0.478
9.824AlaLeu: 9.824 ± 0.743
4.444AlaMet: 4.444 ± 0.781
3.158AlaAsn: 3.158 ± 0.55
6.725AlaPro: 6.725 ± 0.956
5.204AlaGln: 5.204 ± 0.554
8.713AlaArg: 8.713 ± 0.852
6.198AlaSer: 6.198 ± 1.172
6.198AlaThr: 6.198 ± 0.671
5.906AlaVal: 5.906 ± 0.484
1.52AlaTrp: 1.52 ± 0.286
2.514AlaTyr: 2.514 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.643CysAla: 0.643 ± 0.245
0.117CysCys: 0.117 ± 0.074
0.468CysAsp: 0.468 ± 0.201
0.468CysGlu: 0.468 ± 0.195
0.117CysPhe: 0.117 ± 0.088
0.76CysGly: 0.76 ± 0.262
0.117CysHis: 0.117 ± 0.079
0.292CysIle: 0.292 ± 0.157
0.409CysLys: 0.409 ± 0.164
0.351CysLeu: 0.351 ± 0.162
0.234CysMet: 0.234 ± 0.108
0.292CysAsn: 0.292 ± 0.193
0.643CysPro: 0.643 ± 0.269
0.117CysGln: 0.117 ± 0.101
0.76CysArg: 0.76 ± 0.352
0.409CysSer: 0.409 ± 0.164
0.117CysThr: 0.117 ± 0.094
0.468CysVal: 0.468 ± 0.175
0.234CysTrp: 0.234 ± 0.161
0.175CysTyr: 0.175 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
8.421AspAla: 8.421 ± 0.955
0.351AspCys: 0.351 ± 0.17
5.555AspAsp: 5.555 ± 1.122
4.269AspGlu: 4.269 ± 0.659
2.164AspPhe: 2.164 ± 0.355
7.193AspGly: 7.193 ± 0.65
1.111AspHis: 1.111 ± 0.254
3.041AspIle: 3.041 ± 0.374
2.456AspLys: 2.456 ± 0.418
6.374AspLeu: 6.374 ± 0.557
1.988AspMet: 1.988 ± 0.384
1.637AspAsn: 1.637 ± 0.362
4.678AspPro: 4.678 ± 0.564
2.865AspGln: 2.865 ± 0.622
5.438AspArg: 5.438 ± 0.747
2.456AspSer: 2.456 ± 0.368
3.392AspThr: 3.392 ± 0.428
4.327AspVal: 4.327 ± 0.646
1.345AspTrp: 1.345 ± 0.249
1.52AspTyr: 1.52 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
9.941GluAla: 9.941 ± 1.007
0.292GluCys: 0.292 ± 0.147
4.327GluAsp: 4.327 ± 0.498
5.029GluGlu: 5.029 ± 0.694
1.988GluPhe: 1.988 ± 0.261
5.087GluGly: 5.087 ± 0.609
1.228GluHis: 1.228 ± 0.369
4.386GluIle: 4.386 ± 0.513
2.631GluLys: 2.631 ± 0.559
4.035GluLeu: 4.035 ± 0.569
2.105GluMet: 2.105 ± 0.373
2.748GluAsn: 2.748 ± 0.301
3.099GluPro: 3.099 ± 0.41
2.982GluGln: 2.982 ± 0.452
5.555GluArg: 5.555 ± 0.684
1.813GluSer: 1.813 ± 0.339
4.093GluThr: 4.093 ± 0.511
4.327GluVal: 4.327 ± 0.667
1.17GluTrp: 1.17 ± 0.271
1.754GluTyr: 1.754 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
3.333PheAla: 3.333 ± 0.593
0.292PheCys: 0.292 ± 0.159
2.281PheAsp: 2.281 ± 0.462
3.099PheGlu: 3.099 ± 0.449
0.468PhePhe: 0.468 ± 0.137
3.158PheGly: 3.158 ± 0.422
0.526PheHis: 0.526 ± 0.185
1.17PheIle: 1.17 ± 0.281
1.053PheLys: 1.053 ± 0.257
2.164PheLeu: 2.164 ± 0.333
0.643PheMet: 0.643 ± 0.17
0.936PheAsn: 0.936 ± 0.24
1.754PhePro: 1.754 ± 0.325
0.819PheGln: 0.819 ± 0.201
2.398PheArg: 2.398 ± 0.523
1.871PheSer: 1.871 ± 0.402
2.105PheThr: 2.105 ± 0.474
2.339PheVal: 2.339 ± 0.382
0.585PheTrp: 0.585 ± 0.168
1.17PheTyr: 1.17 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
8.947GlyAla: 8.947 ± 0.848
0.351GlyCys: 0.351 ± 0.183
5.789GlyAsp: 5.789 ± 0.558
6.374GlyGlu: 6.374 ± 0.481
3.099GlyPhe: 3.099 ± 0.536
9.239GlyGly: 9.239 ± 0.923
2.047GlyHis: 2.047 ± 0.453
2.865GlyIle: 2.865 ± 0.491
4.035GlyLys: 4.035 ± 0.52
6.023GlyLeu: 6.023 ± 0.639
2.456GlyMet: 2.456 ± 0.396
2.631GlyAsn: 2.631 ± 0.452
3.976GlyPro: 3.976 ± 0.582
3.918GlyGln: 3.918 ± 0.548
6.725GlyArg: 6.725 ± 0.838
5.789GlySer: 5.789 ± 0.743
5.029GlyThr: 5.029 ± 0.615
5.204GlyVal: 5.204 ± 0.575
2.105GlyTrp: 2.105 ± 0.47
1.637GlyTyr: 1.637 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.871HisAla: 1.871 ± 0.363
0.234HisCys: 0.234 ± 0.164
1.403HisAsp: 1.403 ± 0.264
1.579HisGlu: 1.579 ± 0.486
0.702HisPhe: 0.702 ± 0.227
1.93HisGly: 1.93 ± 0.323
0.292HisHis: 0.292 ± 0.142
0.819HisIle: 0.819 ± 0.284
0.76HisLys: 0.76 ± 0.244
1.696HisLeu: 1.696 ± 0.33
0.526HisMet: 0.526 ± 0.213
0.702HisAsn: 0.702 ± 0.213
1.462HisPro: 1.462 ± 0.354
0.819HisGln: 0.819 ± 0.197
1.345HisArg: 1.345 ± 0.471
1.111HisSer: 1.111 ± 0.384
0.702HisThr: 0.702 ± 0.341
1.111HisVal: 1.111 ± 0.313
0.234HisTrp: 0.234 ± 0.2
0.292HisTyr: 0.292 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
5.614IleAla: 5.614 ± 0.698
0.351IleCys: 0.351 ± 0.163
2.982IleAsp: 2.982 ± 0.358
3.509IleGlu: 3.509 ± 0.554
1.17IlePhe: 1.17 ± 0.245
3.216IleGly: 3.216 ± 0.489
0.819IleHis: 0.819 ± 0.3
2.105IleIle: 2.105 ± 0.356
2.222IleLys: 2.222 ± 0.456
2.456IleLeu: 2.456 ± 0.393
0.643IleMet: 0.643 ± 0.192
2.047IleAsn: 2.047 ± 0.376
1.579IlePro: 1.579 ± 0.287
1.286IleGln: 1.286 ± 0.187
3.041IleArg: 3.041 ± 0.371
3.216IleSer: 3.216 ± 0.366
3.742IleThr: 3.742 ± 0.485
2.281IleVal: 2.281 ± 0.357
0.936IleTrp: 0.936 ± 0.257
1.17IleTyr: 1.17 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
5.789LysAla: 5.789 ± 0.82
0.351LysCys: 0.351 ± 0.142
2.339LysAsp: 2.339 ± 0.389
2.631LysGlu: 2.631 ± 0.493
0.76LysPhe: 0.76 ± 0.237
3.801LysGly: 3.801 ± 0.7
0.702LysHis: 0.702 ± 0.248
1.696LysIle: 1.696 ± 0.329
1.93LysLys: 1.93 ± 0.529
2.69LysLeu: 2.69 ± 0.353
0.994LysMet: 0.994 ± 0.264
1.286LysAsn: 1.286 ± 0.255
2.514LysPro: 2.514 ± 0.503
1.696LysGln: 1.696 ± 0.366
4.269LysArg: 4.269 ± 0.41
2.105LysSer: 2.105 ± 0.391
2.924LysThr: 2.924 ± 0.389
2.514LysVal: 2.514 ± 0.346
0.585LysTrp: 0.585 ± 0.256
0.994LysTyr: 0.994 ± 0.253
0.0LysXaa: 0.0 ± 0.0
Leu
8.771LeuAla: 8.771 ± 0.703
0.643LeuCys: 0.643 ± 0.304
5.263LeuAsp: 5.263 ± 0.611
4.386LeuGlu: 4.386 ± 0.477
3.099LeuPhe: 3.099 ± 0.435
7.017LeuGly: 7.017 ± 0.577
1.637LeuHis: 1.637 ± 0.32
3.392LeuIle: 3.392 ± 0.625
3.45LeuLys: 3.45 ± 0.486
5.029LeuLeu: 5.029 ± 0.717
1.462LeuMet: 1.462 ± 0.375
2.339LeuAsn: 2.339 ± 0.378
3.567LeuPro: 3.567 ± 0.456
3.216LeuGln: 3.216 ± 0.478
6.315LeuArg: 6.315 ± 0.703
4.327LeuSer: 4.327 ± 0.616
3.976LeuThr: 3.976 ± 0.427
4.62LeuVal: 4.62 ± 0.536
0.994LeuTrp: 0.994 ± 0.283
1.696LeuTyr: 1.696 ± 0.333
0.0LeuXaa: 0.0 ± 0.0
Met
4.152MetAla: 4.152 ± 0.469
0.058MetCys: 0.058 ± 0.065
1.17MetAsp: 1.17 ± 0.233
1.403MetGlu: 1.403 ± 0.286
0.877MetPhe: 0.877 ± 0.279
2.456MetGly: 2.456 ± 0.339
0.643MetHis: 0.643 ± 0.262
1.345MetIle: 1.345 ± 0.324
0.76MetLys: 0.76 ± 0.3
2.222MetLeu: 2.222 ± 0.348
0.468MetMet: 0.468 ± 0.136
0.76MetAsn: 0.76 ± 0.224
1.637MetPro: 1.637 ± 0.279
0.877MetGln: 0.877 ± 0.237
2.047MetArg: 2.047 ± 0.343
1.754MetSer: 1.754 ± 0.446
1.93MetThr: 1.93 ± 0.354
1.053MetVal: 1.053 ± 0.288
0.409MetTrp: 0.409 ± 0.159
0.292MetTyr: 0.292 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.216AsnAla: 3.216 ± 0.622
0.175AsnCys: 0.175 ± 0.1
1.345AsnAsp: 1.345 ± 0.357
1.286AsnGlu: 1.286 ± 0.204
0.994AsnPhe: 0.994 ± 0.233
2.982AsnGly: 2.982 ± 0.435
0.819AsnHis: 0.819 ± 0.227
1.403AsnIle: 1.403 ± 0.346
1.345AsnLys: 1.345 ± 0.29
2.865AsnLeu: 2.865 ± 0.5
0.819AsnMet: 0.819 ± 0.215
0.994AsnAsn: 0.994 ± 0.209
2.398AsnPro: 2.398 ± 0.369
0.994AsnGln: 0.994 ± 0.221
1.754AsnArg: 1.754 ± 0.29
1.579AsnSer: 1.579 ± 0.367
1.52AsnThr: 1.52 ± 0.286
1.462AsnVal: 1.462 ± 0.291
1.053AsnTrp: 1.053 ± 0.185
0.292AsnTyr: 0.292 ± 0.167
0.0AsnXaa: 0.0 ± 0.0
Pro
6.432ProAla: 6.432 ± 1.018
0.351ProCys: 0.351 ± 0.221
4.678ProAsp: 4.678 ± 0.428
4.912ProGlu: 4.912 ± 0.794
1.637ProPhe: 1.637 ± 0.292
4.62ProGly: 4.62 ± 0.436
1.286ProHis: 1.286 ± 0.332
2.222ProIle: 2.222 ± 0.362
1.813ProLys: 1.813 ± 0.353
4.327ProLeu: 4.327 ± 0.562
1.813ProMet: 1.813 ± 0.417
0.936ProAsn: 0.936 ± 0.208
3.041ProPro: 3.041 ± 0.514
2.164ProGln: 2.164 ± 0.319
3.275ProArg: 3.275 ± 0.376
3.041ProSer: 3.041 ± 0.363
3.041ProThr: 3.041 ± 0.548
3.333ProVal: 3.333 ± 0.347
0.526ProTrp: 0.526 ± 0.156
0.76ProTyr: 0.76 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
4.795GlnAla: 4.795 ± 0.685
0.117GlnCys: 0.117 ± 0.118
2.222GlnAsp: 2.222 ± 0.409
2.339GlnGlu: 2.339 ± 0.443
0.994GlnPhe: 0.994 ± 0.27
3.333GlnGly: 3.333 ± 0.451
1.17GlnHis: 1.17 ± 0.263
2.339GlnIle: 2.339 ± 0.384
1.17GlnLys: 1.17 ± 0.233
2.514GlnLeu: 2.514 ± 0.353
1.462GlnMet: 1.462 ± 0.316
0.877GlnAsn: 0.877 ± 0.264
2.573GlnPro: 2.573 ± 0.452
1.988GlnGln: 1.988 ± 0.586
3.45GlnArg: 3.45 ± 0.481
2.281GlnSer: 2.281 ± 0.449
2.573GlnThr: 2.573 ± 0.424
1.696GlnVal: 1.696 ± 0.251
0.526GlnTrp: 0.526 ± 0.167
1.053GlnTyr: 1.053 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
7.953ArgAla: 7.953 ± 0.836
0.409ArgCys: 0.409 ± 0.199
6.432ArgAsp: 6.432 ± 0.682
5.848ArgGlu: 5.848 ± 0.654
2.748ArgPhe: 2.748 ± 0.456
5.029ArgGly: 5.029 ± 0.675
1.813ArgHis: 1.813 ± 0.37
4.035ArgIle: 4.035 ± 0.407
3.742ArgLys: 3.742 ± 0.619
6.14ArgLeu: 6.14 ± 0.545
1.52ArgMet: 1.52 ± 0.391
2.69ArgAsn: 2.69 ± 0.354
3.216ArgPro: 3.216 ± 0.461
2.865ArgGln: 2.865 ± 0.335
6.959ArgArg: 6.959 ± 0.966
3.976ArgSer: 3.976 ± 0.494
3.216ArgThr: 3.216 ± 0.467
4.327ArgVal: 4.327 ± 0.388
1.345ArgTrp: 1.345 ± 0.333
1.871ArgTyr: 1.871 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
7.134SerAla: 7.134 ± 1.268
0.409SerCys: 0.409 ± 0.184
3.567SerAsp: 3.567 ± 0.457
3.859SerGlu: 3.859 ± 0.483
1.754SerPhe: 1.754 ± 0.324
5.321SerGly: 5.321 ± 0.824
0.936SerHis: 0.936 ± 0.267
2.339SerIle: 2.339 ± 0.309
2.573SerLys: 2.573 ± 0.449
4.093SerLeu: 4.093 ± 0.471
1.345SerMet: 1.345 ± 0.264
0.819SerAsn: 0.819 ± 0.226
2.807SerPro: 2.807 ± 0.346
2.047SerGln: 2.047 ± 0.459
3.801SerArg: 3.801 ± 0.469
2.748SerSer: 2.748 ± 0.444
2.924SerThr: 2.924 ± 0.439
3.333SerVal: 3.333 ± 0.454
0.702SerTrp: 0.702 ± 0.282
0.877SerTyr: 0.877 ± 0.39
0.0SerXaa: 0.0 ± 0.0
Thr
7.251ThrAla: 7.251 ± 0.684
0.526ThrCys: 0.526 ± 0.214
4.561ThrAsp: 4.561 ± 0.505
2.339ThrGlu: 2.339 ± 0.481
2.339ThrPhe: 2.339 ± 0.456
5.731ThrGly: 5.731 ± 0.638
0.468ThrHis: 0.468 ± 0.189
2.339ThrIle: 2.339 ± 0.495
2.164ThrLys: 2.164 ± 0.292
3.801ThrLeu: 3.801 ± 0.491
0.994ThrMet: 0.994 ± 0.219
1.637ThrAsn: 1.637 ± 0.259
3.918ThrPro: 3.918 ± 0.685
1.871ThrGln: 1.871 ± 0.361
3.333ThrArg: 3.333 ± 0.508
3.158ThrSer: 3.158 ± 0.673
2.982ThrThr: 2.982 ± 0.439
4.561ThrVal: 4.561 ± 0.559
0.643ThrTrp: 0.643 ± 0.207
1.345ThrTyr: 1.345 ± 0.253
0.0ThrXaa: 0.0 ± 0.0
Val
6.491ValAla: 6.491 ± 0.546
0.819ValCys: 0.819 ± 0.312
4.62ValAsp: 4.62 ± 0.588
4.269ValGlu: 4.269 ± 0.473
2.573ValPhe: 2.573 ± 0.422
4.327ValGly: 4.327 ± 0.506
1.228ValHis: 1.228 ± 0.414
2.105ValIle: 2.105 ± 0.39
2.281ValLys: 2.281 ± 0.456
4.678ValLeu: 4.678 ± 0.574
1.579ValMet: 1.579 ± 0.331
1.637ValAsn: 1.637 ± 0.245
2.807ValPro: 2.807 ± 0.384
2.281ValGln: 2.281 ± 0.412
4.21ValArg: 4.21 ± 0.421
3.626ValSer: 3.626 ± 0.493
4.21ValThr: 4.21 ± 0.679
3.976ValVal: 3.976 ± 0.472
0.819ValTrp: 0.819 ± 0.212
0.819ValTyr: 0.819 ± 0.179
0.0ValXaa: 0.0 ± 0.0
Trp
1.696TrpAla: 1.696 ± 0.305
0.351TrpCys: 0.351 ± 0.163
0.76TrpAsp: 0.76 ± 0.184
0.819TrpGlu: 0.819 ± 0.238
0.526TrpPhe: 0.526 ± 0.164
0.936TrpGly: 0.936 ± 0.214
0.292TrpHis: 0.292 ± 0.142
0.936TrpIle: 0.936 ± 0.337
0.994TrpLys: 0.994 ± 0.189
1.988TrpLeu: 1.988 ± 0.478
0.351TrpMet: 0.351 ± 0.178
0.468TrpAsn: 0.468 ± 0.164
0.819TrpPro: 0.819 ± 0.259
0.526TrpGln: 0.526 ± 0.167
1.111TrpArg: 1.111 ± 0.243
0.936TrpSer: 0.936 ± 0.253
0.643TrpThr: 0.643 ± 0.198
1.228TrpVal: 1.228 ± 0.377
0.351TrpTrp: 0.351 ± 0.159
0.702TrpTyr: 0.702 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.105TyrAla: 2.105 ± 0.285
0.175TyrCys: 0.175 ± 0.12
2.105TyrAsp: 2.105 ± 0.344
1.286TyrGlu: 1.286 ± 0.289
0.585TyrPhe: 0.585 ± 0.151
2.456TyrGly: 2.456 ± 0.415
0.409TyrHis: 0.409 ± 0.145
0.819TyrIle: 0.819 ± 0.194
1.111TyrLys: 1.111 ± 0.27
1.93TyrLeu: 1.93 ± 0.353
0.409TyrMet: 0.409 ± 0.173
0.702TyrAsn: 0.702 ± 0.228
0.994TyrPro: 0.994 ± 0.209
0.936TyrGln: 0.936 ± 0.317
1.637TyrArg: 1.637 ± 0.288
0.994TyrSer: 0.994 ± 0.22
0.643TyrThr: 0.643 ± 0.197
1.286TyrVal: 1.286 ± 0.273
0.351TyrTrp: 0.351 ± 0.224
0.76TyrTyr: 0.76 ± 0.204
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (17102 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski