Amino acid dipepetide frequency for Gordonia phage LittleFella

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.776AlaAla: 9.776 ± 1.154
0.391AlaCys: 0.391 ± 0.161
5.425AlaAsp: 5.425 ± 0.563
7.478AlaGlu: 7.478 ± 0.935
2.933AlaPhe: 2.933 ± 0.679
6.843AlaGly: 6.843 ± 0.718
1.808AlaHis: 1.808 ± 0.294
4.937AlaIle: 4.937 ± 0.666
5.523AlaLys: 5.523 ± 0.596
7.674AlaLeu: 7.674 ± 0.666
3.324AlaMet: 3.324 ± 0.635
3.275AlaAsn: 3.275 ± 0.508
4.008AlaPro: 4.008 ± 0.47
4.057AlaGln: 4.057 ± 0.586
5.768AlaArg: 5.768 ± 0.591
4.937AlaSer: 4.937 ± 0.592
5.132AlaThr: 5.132 ± 0.571
6.403AlaVal: 6.403 ± 0.938
1.466AlaTrp: 1.466 ± 0.298
2.884AlaTyr: 2.884 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.244CysAla: 0.244 ± 0.102
0.049CysCys: 0.049 ± 0.06
0.489CysAsp: 0.489 ± 0.146
0.342CysGlu: 0.342 ± 0.135
0.196CysPhe: 0.196 ± 0.131
0.978CysGly: 0.978 ± 0.265
0.147CysHis: 0.147 ± 0.094
0.244CysIle: 0.244 ± 0.117
0.244CysLys: 0.244 ± 0.11
0.44CysLeu: 0.44 ± 0.162
0.098CysMet: 0.098 ± 0.064
0.293CysAsn: 0.293 ± 0.13
0.391CysPro: 0.391 ± 0.17
0.098CysGln: 0.098 ± 0.067
0.684CysArg: 0.684 ± 0.2
0.684CysSer: 0.684 ± 0.195
0.587CysThr: 0.587 ± 0.148
0.342CysVal: 0.342 ± 0.154
0.098CysTrp: 0.098 ± 0.073
0.196CysTyr: 0.196 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.986AspAla: 4.986 ± 0.448
0.44AspCys: 0.44 ± 0.124
5.034AspAsp: 5.034 ± 1.108
6.696AspGlu: 6.696 ± 0.871
2.395AspPhe: 2.395 ± 0.437
4.546AspGly: 4.546 ± 0.744
1.271AspHis: 1.271 ± 0.253
2.639AspIle: 2.639 ± 0.346
3.959AspLys: 3.959 ± 0.573
5.523AspLeu: 5.523 ± 0.595
1.515AspMet: 1.515 ± 0.242
2.102AspAsn: 2.102 ± 0.294
3.764AspPro: 3.764 ± 0.463
2.591AspGln: 2.591 ± 0.383
3.764AspArg: 3.764 ± 0.468
3.079AspSer: 3.079 ± 0.4
2.151AspThr: 2.151 ± 0.412
4.155AspVal: 4.155 ± 0.512
1.32AspTrp: 1.32 ± 0.289
2.102AspTyr: 2.102 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
7.185GluAla: 7.185 ± 0.539
0.293GluCys: 0.293 ± 0.163
5.914GluAsp: 5.914 ± 0.84
7.283GluGlu: 7.283 ± 1.117
2.688GluPhe: 2.688 ± 0.44
5.914GluGly: 5.914 ± 0.706
1.515GluHis: 1.515 ± 0.31
3.666GluIle: 3.666 ± 0.486
4.497GluLys: 4.497 ± 0.509
5.523GluLeu: 5.523 ± 0.575
3.079GluMet: 3.079 ± 0.482
2.591GluAsn: 2.591 ± 0.385
2.835GluPro: 2.835 ± 0.604
2.884GluGln: 2.884 ± 0.39
3.91GluArg: 3.91 ± 0.487
2.591GluSer: 2.591 ± 0.384
3.177GluThr: 3.177 ± 0.519
3.47GluVal: 3.47 ± 0.538
1.711GluTrp: 1.711 ± 0.309
2.151GluTyr: 2.151 ± 0.394
0.0GluXaa: 0.0 ± 0.0
Phe
3.226PheAla: 3.226 ± 0.421
0.293PheCys: 0.293 ± 0.121
2.102PheAsp: 2.102 ± 0.271
1.906PheGlu: 1.906 ± 0.284
1.222PhePhe: 1.222 ± 0.266
2.737PheGly: 2.737 ± 0.401
0.88PheHis: 0.88 ± 0.237
1.906PheIle: 1.906 ± 0.379
1.613PheLys: 1.613 ± 0.276
2.395PheLeu: 2.395 ± 0.303
0.635PheMet: 0.635 ± 0.228
1.417PheAsn: 1.417 ± 0.274
1.711PhePro: 1.711 ± 0.288
1.564PheGln: 1.564 ± 0.209
2.053PheArg: 2.053 ± 0.245
2.346PheSer: 2.346 ± 0.35
1.613PheThr: 1.613 ± 0.275
2.248PheVal: 2.248 ± 0.279
0.244PheTrp: 0.244 ± 0.113
0.978PheTyr: 0.978 ± 0.181
0.0PheXaa: 0.0 ± 0.0
Gly
6.55GlyAla: 6.55 ± 0.93
0.538GlyCys: 0.538 ± 0.222
5.523GlyAsp: 5.523 ± 1.022
5.23GlyGlu: 5.23 ± 0.435
2.591GlyPhe: 2.591 ± 0.408
7.723GlyGly: 7.723 ± 0.822
1.369GlyHis: 1.369 ± 0.278
5.328GlyIle: 5.328 ± 0.745
5.132GlyLys: 5.132 ± 0.479
6.647GlyLeu: 6.647 ± 0.66
2.151GlyMet: 2.151 ± 0.355
3.666GlyAsn: 3.666 ± 0.423
3.764GlyPro: 3.764 ± 0.618
3.861GlyGln: 3.861 ± 0.482
4.497GlyArg: 4.497 ± 0.524
5.914GlySer: 5.914 ± 0.655
4.595GlyThr: 4.595 ± 0.558
7.723GlyVal: 7.723 ± 0.587
1.417GlyTrp: 1.417 ± 0.288
2.639GlyTyr: 2.639 ± 0.469
0.0GlyXaa: 0.0 ± 0.0
His
1.613HisAla: 1.613 ± 0.27
0.098HisCys: 0.098 ± 0.073
0.978HisAsp: 0.978 ± 0.239
1.124HisGlu: 1.124 ± 0.301
0.489HisPhe: 0.489 ± 0.175
1.906HisGly: 1.906 ± 0.329
0.244HisHis: 0.244 ± 0.105
1.173HisIle: 1.173 ± 0.214
0.782HisLys: 0.782 ± 0.211
1.613HisLeu: 1.613 ± 0.28
0.44HisMet: 0.44 ± 0.157
0.831HisAsn: 0.831 ± 0.19
1.222HisPro: 1.222 ± 0.28
0.684HisGln: 0.684 ± 0.159
1.124HisArg: 1.124 ± 0.276
0.978HisSer: 0.978 ± 0.279
1.222HisThr: 1.222 ± 0.327
1.075HisVal: 1.075 ± 0.269
0.391HisTrp: 0.391 ± 0.154
0.538HisTyr: 0.538 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
4.301IleAla: 4.301 ± 0.661
0.391IleCys: 0.391 ± 0.155
3.324IleAsp: 3.324 ± 0.432
2.982IleGlu: 2.982 ± 0.332
1.222IlePhe: 1.222 ± 0.226
4.986IleGly: 4.986 ± 0.594
1.075IleHis: 1.075 ± 0.173
2.004IleIle: 2.004 ± 0.315
2.444IleLys: 2.444 ± 0.387
3.666IleLeu: 3.666 ± 0.444
1.222IleMet: 1.222 ± 0.244
2.102IleAsn: 2.102 ± 0.344
2.444IlePro: 2.444 ± 0.356
2.346IleGln: 2.346 ± 0.435
3.617IleArg: 3.617 ± 0.428
3.568IleSer: 3.568 ± 0.492
3.079IleThr: 3.079 ± 0.326
3.959IleVal: 3.959 ± 0.523
0.88IleTrp: 0.88 ± 0.233
1.173IleTyr: 1.173 ± 0.316
0.0IleXaa: 0.0 ± 0.0
Lys
6.745LysAla: 6.745 ± 0.592
0.44LysCys: 0.44 ± 0.159
3.128LysAsp: 3.128 ± 0.397
3.813LysGlu: 3.813 ± 0.402
2.346LysPhe: 2.346 ± 0.352
5.377LysGly: 5.377 ± 0.929
0.831LysHis: 0.831 ± 0.252
2.542LysIle: 2.542 ± 0.326
4.497LysLys: 4.497 ± 0.534
3.861LysLeu: 3.861 ± 0.412
1.466LysMet: 1.466 ± 0.267
2.102LysAsn: 2.102 ± 0.364
2.786LysPro: 2.786 ± 0.387
2.102LysGln: 2.102 ± 0.358
4.252LysArg: 4.252 ± 0.629
2.346LysSer: 2.346 ± 0.376
2.542LysThr: 2.542 ± 0.409
2.933LysVal: 2.933 ± 0.436
0.782LysTrp: 0.782 ± 0.203
1.564LysTyr: 1.564 ± 0.298
0.0LysXaa: 0.0 ± 0.0
Leu
7.381LeuAla: 7.381 ± 0.549
0.831LeuCys: 0.831 ± 0.226
5.132LeuAsp: 5.132 ± 0.483
4.399LeuGlu: 4.399 ± 0.534
1.857LeuPhe: 1.857 ± 0.328
6.305LeuGly: 6.305 ± 0.543
1.417LeuHis: 1.417 ± 0.35
3.519LeuIle: 3.519 ± 0.522
4.497LeuLys: 4.497 ± 0.488
4.595LeuLeu: 4.595 ± 0.425
1.417LeuMet: 1.417 ± 0.299
3.177LeuAsn: 3.177 ± 0.355
4.204LeuPro: 4.204 ± 0.474
3.226LeuGln: 3.226 ± 0.49
6.012LeuArg: 6.012 ± 0.468
4.79LeuSer: 4.79 ± 0.417
4.252LeuThr: 4.252 ± 0.426
4.35LeuVal: 4.35 ± 0.462
0.978LeuTrp: 0.978 ± 0.254
1.466LeuTyr: 1.466 ± 0.27
0.0LeuXaa: 0.0 ± 0.0
Met
3.959MetAla: 3.959 ± 0.439
0.147MetCys: 0.147 ± 0.089
1.173MetAsp: 1.173 ± 0.294
2.248MetGlu: 2.248 ± 0.344
1.026MetPhe: 1.026 ± 0.206
1.613MetGly: 1.613 ± 0.319
0.293MetHis: 0.293 ± 0.117
1.026MetIle: 1.026 ± 0.314
1.711MetLys: 1.711 ± 0.355
1.515MetLeu: 1.515 ± 0.372
0.635MetMet: 0.635 ± 0.194
0.88MetAsn: 0.88 ± 0.172
1.124MetPro: 1.124 ± 0.275
1.173MetGln: 1.173 ± 0.215
1.32MetArg: 1.32 ± 0.271
1.711MetSer: 1.711 ± 0.399
2.053MetThr: 2.053 ± 0.257
1.222MetVal: 1.222 ± 0.267
0.538MetTrp: 0.538 ± 0.206
0.538MetTyr: 0.538 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
4.643AsnAla: 4.643 ± 0.583
0.147AsnCys: 0.147 ± 0.073
2.053AsnAsp: 2.053 ± 0.403
2.151AsnGlu: 2.151 ± 0.268
1.613AsnPhe: 1.613 ± 0.37
3.324AsnGly: 3.324 ± 0.431
0.44AsnHis: 0.44 ± 0.164
1.76AsnIle: 1.76 ± 0.292
1.711AsnLys: 1.711 ± 0.333
2.444AsnLeu: 2.444 ± 0.348
1.075AsnMet: 1.075 ± 0.259
1.369AsnAsn: 1.369 ± 0.252
2.297AsnPro: 2.297 ± 0.452
1.564AsnGln: 1.564 ± 0.328
3.128AsnArg: 3.128 ± 0.393
2.102AsnSer: 2.102 ± 0.293
2.297AsnThr: 2.297 ± 0.325
2.933AsnVal: 2.933 ± 0.339
0.684AsnTrp: 0.684 ± 0.177
1.32AsnTyr: 1.32 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
4.106ProAla: 4.106 ± 0.63
0.098ProCys: 0.098 ± 0.073
3.177ProAsp: 3.177 ± 0.395
5.083ProGlu: 5.083 ± 0.547
1.417ProPhe: 1.417 ± 0.26
4.35ProGly: 4.35 ± 0.422
1.222ProHis: 1.222 ± 0.283
2.493ProIle: 2.493 ± 0.467
2.2ProLys: 2.2 ± 0.36
3.47ProLeu: 3.47 ± 0.376
1.369ProMet: 1.369 ± 0.204
1.564ProAsn: 1.564 ± 0.328
2.102ProPro: 2.102 ± 0.357
2.248ProGln: 2.248 ± 0.367
2.151ProArg: 2.151 ± 0.315
2.835ProSer: 2.835 ± 0.43
3.421ProThr: 3.421 ± 0.507
3.177ProVal: 3.177 ± 0.458
1.124ProTrp: 1.124 ± 0.328
0.978ProTyr: 0.978 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.301GlnAla: 4.301 ± 0.614
0.196GlnCys: 0.196 ± 0.125
1.906GlnAsp: 1.906 ± 0.345
2.493GlnGlu: 2.493 ± 0.353
1.564GlnPhe: 1.564 ± 0.265
4.79GlnGly: 4.79 ± 1.132
0.684GlnHis: 0.684 ± 0.191
2.639GlnIle: 2.639 ± 0.418
1.808GlnLys: 1.808 ± 0.302
3.861GlnLeu: 3.861 ± 0.516
0.831GlnMet: 0.831 ± 0.236
1.466GlnAsn: 1.466 ± 0.262
1.515GlnPro: 1.515 ± 0.363
2.102GlnGln: 2.102 ± 0.462
2.835GlnArg: 2.835 ± 0.355
1.906GlnSer: 1.906 ± 0.248
1.417GlnThr: 1.417 ± 0.281
3.324GlnVal: 3.324 ± 0.501
0.538GlnTrp: 0.538 ± 0.155
1.711GlnTyr: 1.711 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
5.328ArgAla: 5.328 ± 0.637
0.635ArgCys: 0.635 ± 0.208
4.692ArgAsp: 4.692 ± 0.483
4.546ArgGlu: 4.546 ± 0.43
2.639ArgPhe: 2.639 ± 0.375
4.986ArgGly: 4.986 ± 0.52
1.075ArgHis: 1.075 ± 0.278
3.715ArgIle: 3.715 ± 0.523
4.301ArgLys: 4.301 ± 0.601
3.91ArgLeu: 3.91 ± 0.383
1.808ArgMet: 1.808 ± 0.233
3.177ArgAsn: 3.177 ± 0.362
2.737ArgPro: 2.737 ± 0.324
2.786ArgGln: 2.786 ± 0.399
4.546ArgArg: 4.546 ± 0.749
3.275ArgSer: 3.275 ± 0.415
3.128ArgThr: 3.128 ± 0.375
4.399ArgVal: 4.399 ± 0.377
0.929ArgTrp: 0.929 ± 0.177
2.102ArgTyr: 2.102 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
4.692SerAla: 4.692 ± 0.458
0.391SerCys: 0.391 ± 0.166
3.617SerAsp: 3.617 ± 0.501
4.204SerGlu: 4.204 ± 0.514
1.417SerPhe: 1.417 ± 0.235
5.768SerGly: 5.768 ± 0.643
0.929SerHis: 0.929 ± 0.257
2.982SerIle: 2.982 ± 0.448
3.177SerLys: 3.177 ± 0.449
3.764SerLeu: 3.764 ± 0.447
1.662SerMet: 1.662 ± 0.298
1.857SerAsn: 1.857 ± 0.369
2.2SerPro: 2.2 ± 0.352
2.346SerGln: 2.346 ± 0.359
3.764SerArg: 3.764 ± 0.507
2.639SerSer: 2.639 ± 0.348
3.764SerThr: 3.764 ± 0.473
3.079SerVal: 3.079 ± 0.428
1.32SerTrp: 1.32 ± 0.281
1.857SerTyr: 1.857 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
5.621ThrAla: 5.621 ± 0.647
0.44ThrCys: 0.44 ± 0.186
2.737ThrAsp: 2.737 ± 0.354
3.275ThrGlu: 3.275 ± 0.517
2.004ThrPhe: 2.004 ± 0.312
5.817ThrGly: 5.817 ± 0.615
1.173ThrHis: 1.173 ± 0.226
2.835ThrIle: 2.835 ± 0.354
2.151ThrLys: 2.151 ± 0.368
3.715ThrLeu: 3.715 ± 0.394
1.124ThrMet: 1.124 ± 0.236
2.2ThrAsn: 2.2 ± 0.399
3.519ThrPro: 3.519 ± 0.462
1.857ThrGln: 1.857 ± 0.321
3.079ThrArg: 3.079 ± 0.382
3.373ThrSer: 3.373 ± 0.392
3.421ThrThr: 3.421 ± 0.485
3.47ThrVal: 3.47 ± 0.451
0.978ThrTrp: 0.978 ± 0.244
1.613ThrTyr: 1.613 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
6.012ValAla: 6.012 ± 1.008
0.538ValCys: 0.538 ± 0.156
3.959ValAsp: 3.959 ± 0.43
4.839ValGlu: 4.839 ± 0.537
1.857ValPhe: 1.857 ± 0.372
5.328ValGly: 5.328 ± 0.591
0.978ValHis: 0.978 ± 0.21
3.715ValIle: 3.715 ± 0.461
4.301ValLys: 4.301 ± 0.549
4.741ValLeu: 4.741 ± 0.4
1.369ValMet: 1.369 ± 0.229
2.639ValAsn: 2.639 ± 0.359
3.568ValPro: 3.568 ± 0.538
2.297ValGln: 2.297 ± 0.309
4.595ValArg: 4.595 ± 0.694
3.47ValSer: 3.47 ± 0.469
3.764ValThr: 3.764 ± 0.437
4.888ValVal: 4.888 ± 0.648
1.32ValTrp: 1.32 ± 0.256
1.76ValTyr: 1.76 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
1.369TrpAla: 1.369 ± 0.251
0.244TrpCys: 0.244 ± 0.101
1.271TrpAsp: 1.271 ± 0.25
0.88TrpGlu: 0.88 ± 0.2
0.587TrpPhe: 0.587 ± 0.17
1.124TrpGly: 1.124 ± 0.221
0.391TrpHis: 0.391 ± 0.139
0.831TrpIle: 0.831 ± 0.212
0.733TrpLys: 0.733 ± 0.223
1.662TrpLeu: 1.662 ± 0.321
0.196TrpMet: 0.196 ± 0.084
1.173TrpAsn: 1.173 ± 0.268
0.782TrpPro: 0.782 ± 0.222
0.782TrpGln: 0.782 ± 0.208
1.222TrpArg: 1.222 ± 0.271
1.222TrpSer: 1.222 ± 0.258
1.271TrpThr: 1.271 ± 0.255
0.88TrpVal: 0.88 ± 0.212
0.293TrpTrp: 0.293 ± 0.104
0.635TrpTyr: 0.635 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.004TyrAla: 2.004 ± 0.468
0.244TyrCys: 0.244 ± 0.111
2.395TyrAsp: 2.395 ± 0.358
1.906TyrGlu: 1.906 ± 0.366
1.026TyrPhe: 1.026 ± 0.288
2.444TyrGly: 2.444 ± 0.339
0.782TyrHis: 0.782 ± 0.212
0.831TyrIle: 0.831 ± 0.2
1.173TyrLys: 1.173 ± 0.247
2.835TyrLeu: 2.835 ± 0.298
0.44TyrMet: 0.44 ± 0.154
1.173TyrAsn: 1.173 ± 0.238
1.613TyrPro: 1.613 ± 0.407
1.32TyrGln: 1.32 ± 0.283
2.395TyrArg: 2.395 ± 0.326
1.662TyrSer: 1.662 ± 0.292
1.515TyrThr: 1.515 ± 0.308
1.906TyrVal: 1.906 ± 0.368
0.538TyrTrp: 0.538 ± 0.175
1.026TyrTyr: 1.026 ± 0.269
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (20460 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski