Amino acid dipepetide frequency for Pebjah virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.941AlaAla: 7.941 ± 0.907
3.387AlaCys: 3.387 ± 0.504
3.153AlaAsp: 3.153 ± 0.537
2.569AlaGlu: 2.569 ± 0.482
3.387AlaPhe: 3.387 ± 0.88
4.788AlaGly: 4.788 ± 0.574
1.635AlaHis: 1.635 ± 0.317
5.956AlaIle: 5.956 ± 0.931
3.153AlaLys: 3.153 ± 0.573
10.16AlaLeu: 10.16 ± 1.035
1.051AlaMet: 1.051 ± 0.35
2.569AlaAsn: 2.569 ± 0.392
5.839AlaPro: 5.839 ± 1.271
3.503AlaGln: 3.503 ± 0.378
5.956AlaArg: 5.956 ± 0.883
6.773AlaSer: 6.773 ± 1.201
5.255AlaThr: 5.255 ± 0.671
7.708AlaVal: 7.708 ± 0.833
1.518AlaTrp: 1.518 ± 0.387
2.92AlaTyr: 2.92 ± 0.476
0.0AlaXaa: 0.0 ± 0.0
Cys
3.036CysAla: 3.036 ± 0.402
1.635CysCys: 1.635 ± 0.587
1.635CysAsp: 1.635 ± 0.317
0.701CysGlu: 0.701 ± 0.244
1.752CysPhe: 1.752 ± 0.641
3.153CysGly: 3.153 ± 0.457
1.168CysHis: 1.168 ± 0.299
2.219CysIle: 2.219 ± 0.417
1.051CysLys: 1.051 ± 0.313
2.686CysLeu: 2.686 ± 0.564
1.168CysMet: 1.168 ± 0.534
1.285CysAsn: 1.285 ± 0.345
2.219CysPro: 2.219 ± 0.501
0.234CysGln: 0.234 ± 0.163
1.752CysArg: 1.752 ± 0.564
2.569CysSer: 2.569 ± 0.367
1.285CysThr: 1.285 ± 0.374
2.102CysVal: 2.102 ± 0.446
1.752CysTrp: 1.752 ± 0.469
1.051CysTyr: 1.051 ± 0.326
0.0CysXaa: 0.0 ± 0.0
Asp
3.854AspAla: 3.854 ± 0.802
1.168AspCys: 1.168 ± 0.296
2.336AspAsp: 2.336 ± 0.498
1.635AspGlu: 1.635 ± 0.401
2.102AspPhe: 2.102 ± 0.407
2.686AspGly: 2.686 ± 0.486
1.168AspHis: 1.168 ± 0.421
1.869AspIle: 1.869 ± 0.569
1.635AspLys: 1.635 ± 0.373
4.438AspLeu: 4.438 ± 0.671
0.817AspMet: 0.817 ± 0.23
1.635AspAsn: 1.635 ± 0.324
3.971AspPro: 3.971 ± 0.763
1.051AspGln: 1.051 ± 0.519
2.219AspArg: 2.219 ± 0.785
4.321AspSer: 4.321 ± 0.796
1.635AspThr: 1.635 ± 0.383
3.62AspVal: 3.62 ± 0.587
0.584AspTrp: 0.584 ± 0.239
1.168AspTyr: 1.168 ± 0.253
0.0AspXaa: 0.0 ± 0.0
Glu
3.27GluAla: 3.27 ± 0.468
0.467GluCys: 0.467 ± 0.125
1.051GluAsp: 1.051 ± 0.28
2.219GluGlu: 2.219 ± 0.407
0.234GluPhe: 0.234 ± 0.199
3.036GluGly: 3.036 ± 0.799
0.701GluHis: 0.701 ± 0.183
2.452GluIle: 2.452 ± 0.45
2.803GluLys: 2.803 ± 0.627
3.387GluLeu: 3.387 ± 0.555
0.701GluMet: 0.701 ± 0.314
0.701GluAsn: 0.701 ± 0.304
1.752GluPro: 1.752 ± 0.285
1.752GluGln: 1.752 ± 0.581
1.985GluArg: 1.985 ± 0.371
3.737GluSer: 3.737 ± 0.44
1.985GluThr: 1.985 ± 0.69
1.985GluVal: 1.985 ± 0.582
0.234GluTrp: 0.234 ± 0.311
1.752GluTyr: 1.752 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
3.971PheAla: 3.971 ± 1.027
1.985PheCys: 1.985 ± 0.47
1.401PheAsp: 1.401 ± 0.564
2.102PheGlu: 2.102 ± 0.281
2.336PhePhe: 2.336 ± 0.933
3.737PheGly: 3.737 ± 0.604
1.285PheHis: 1.285 ± 0.644
2.219PheIle: 2.219 ± 0.307
1.051PheLys: 1.051 ± 0.446
3.387PheLeu: 3.387 ± 0.791
0.234PheMet: 0.234 ± 0.306
1.051PheAsn: 1.051 ± 0.398
1.869PhePro: 1.869 ± 0.351
1.401PheGln: 1.401 ± 0.34
1.401PheArg: 1.401 ± 0.703
3.503PheSer: 3.503 ± 1.534
2.686PheThr: 2.686 ± 0.716
2.92PheVal: 2.92 ± 0.357
0.35PheTrp: 0.35 ± 0.379
1.168PheTyr: 1.168 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
4.554GlyAla: 4.554 ± 0.711
3.62GlyCys: 3.62 ± 0.883
3.503GlyAsp: 3.503 ± 1.047
1.051GlyGlu: 1.051 ± 0.27
3.62GlyPhe: 3.62 ± 0.432
4.438GlyGly: 4.438 ± 0.403
1.752GlyHis: 1.752 ± 0.353
3.153GlyIle: 3.153 ± 0.689
4.438GlyLys: 4.438 ± 0.907
5.839GlyLeu: 5.839 ± 0.875
0.934GlyMet: 0.934 ± 0.391
1.518GlyAsn: 1.518 ± 0.688
3.62GlyPro: 3.62 ± 0.465
1.518GlyGln: 1.518 ± 0.629
3.62GlyArg: 3.62 ± 0.555
6.423GlySer: 6.423 ± 0.811
4.087GlyThr: 4.087 ± 0.676
6.54GlyVal: 6.54 ± 1.271
0.117GlyTrp: 0.117 ± 0.209
2.686GlyTyr: 2.686 ± 0.793
0.0GlyXaa: 0.0 ± 0.0
His
2.803HisAla: 2.803 ± 0.384
1.285HisCys: 1.285 ± 0.361
0.584HisAsp: 0.584 ± 0.308
0.817HisGlu: 0.817 ± 0.224
0.817HisPhe: 0.817 ± 0.244
1.635HisGly: 1.635 ± 0.319
0.467HisHis: 0.467 ± 0.281
1.635HisIle: 1.635 ± 0.509
0.817HisLys: 0.817 ± 0.196
2.803HisLeu: 2.803 ± 0.412
0.701HisMet: 0.701 ± 0.337
0.817HisAsn: 0.817 ± 0.465
1.518HisPro: 1.518 ± 0.653
0.584HisGln: 0.584 ± 0.381
0.934HisArg: 0.934 ± 0.396
1.869HisSer: 1.869 ± 0.316
1.401HisThr: 1.401 ± 0.877
2.452HisVal: 2.452 ± 0.292
1.051HisTrp: 1.051 ± 0.346
1.051HisTyr: 1.051 ± 0.383
0.0HisXaa: 0.0 ± 0.0
Ile
5.255IleAla: 5.255 ± 0.774
2.102IleCys: 2.102 ± 1.096
1.635IleAsp: 1.635 ± 0.498
1.168IleGlu: 1.168 ± 0.37
1.635IlePhe: 1.635 ± 0.524
2.92IleGly: 2.92 ± 0.623
1.518IleHis: 1.518 ± 0.302
3.27IleIle: 3.27 ± 0.827
0.817IleLys: 0.817 ± 0.22
5.138IleLeu: 5.138 ± 0.925
0.117IleMet: 0.117 ± 0.241
1.869IleAsn: 1.869 ± 0.355
2.102IlePro: 2.102 ± 0.363
1.985IleGln: 1.985 ± 0.287
2.92IleArg: 2.92 ± 0.693
5.022IleSer: 5.022 ± 0.757
3.153IleThr: 3.153 ± 0.513
3.387IleVal: 3.387 ± 0.773
0.0IleTrp: 0.0 ± 0.0
1.518IleTyr: 1.518 ± 0.441
0.0IleXaa: 0.0 ± 0.0
Lys
3.387LysAla: 3.387 ± 0.673
1.168LysCys: 1.168 ± 0.281
0.584LysAsp: 0.584 ± 0.289
0.934LysGlu: 0.934 ± 0.307
1.168LysPhe: 1.168 ± 0.579
2.102LysGly: 2.102 ± 0.36
0.701LysHis: 0.701 ± 0.171
1.635LysIle: 1.635 ± 0.251
2.219LysLys: 2.219 ± 0.539
3.737LysLeu: 3.737 ± 0.574
0.467LysMet: 0.467 ± 0.306
1.168LysAsn: 1.168 ± 0.428
2.92LysPro: 2.92 ± 0.623
1.168LysGln: 1.168 ± 0.363
2.336LysArg: 2.336 ± 0.711
2.92LysSer: 2.92 ± 0.469
2.452LysThr: 2.452 ± 0.552
2.686LysVal: 2.686 ± 0.652
0.817LysTrp: 0.817 ± 0.267
1.401LysTyr: 1.401 ± 0.297
0.0LysXaa: 0.0 ± 0.0
Leu
12.729LeuAla: 12.729 ± 0.686
3.27LeuCys: 3.27 ± 0.613
4.087LeuAsp: 4.087 ± 0.901
3.503LeuGlu: 3.503 ± 0.602
4.204LeuPhe: 4.204 ± 0.84
5.606LeuGly: 5.606 ± 0.839
2.569LeuHis: 2.569 ± 0.398
3.036LeuIle: 3.036 ± 0.565
2.569LeuLys: 2.569 ± 0.756
15.649LeuLeu: 15.649 ± 2.536
1.401LeuMet: 1.401 ± 0.42
3.036LeuAsn: 3.036 ± 0.384
9.576LeuPro: 9.576 ± 1.406
3.503LeuGln: 3.503 ± 0.777
5.839LeuArg: 5.839 ± 1.027
9.226LeuSer: 9.226 ± 1.288
5.489LeuThr: 5.489 ± 0.82
8.058LeuVal: 8.058 ± 0.758
1.285LeuTrp: 1.285 ± 0.246
3.854LeuTyr: 3.854 ± 1.117
0.0LeuXaa: 0.0 ± 0.0
Met
1.168MetAla: 1.168 ± 0.834
0.701MetCys: 0.701 ± 0.352
0.467MetAsp: 0.467 ± 0.218
0.701MetGlu: 0.701 ± 0.232
0.584MetPhe: 0.584 ± 0.222
1.752MetGly: 1.752 ± 0.362
0.584MetHis: 0.584 ± 0.186
1.051MetIle: 1.051 ± 0.377
0.467MetLys: 0.467 ± 0.309
1.869MetLeu: 1.869 ± 1.041
0.117MetMet: 0.117 ± 0.241
0.234MetAsn: 0.234 ± 0.163
0.467MetPro: 0.467 ± 0.194
0.35MetGln: 0.35 ± 0.16
0.701MetArg: 0.701 ± 0.197
0.817MetSer: 0.817 ± 0.327
0.35MetThr: 0.35 ± 0.12
1.401MetVal: 1.401 ± 0.259
0.35MetTrp: 0.35 ± 0.12
0.35MetTyr: 0.35 ± 0.348
0.0MetXaa: 0.0 ± 0.0
Asn
2.569AsnAla: 2.569 ± 0.663
1.168AsnCys: 1.168 ± 0.511
2.102AsnAsp: 2.102 ± 0.419
1.985AsnGlu: 1.985 ± 0.431
0.817AsnPhe: 0.817 ± 0.637
2.336AsnGly: 2.336 ± 0.427
1.285AsnHis: 1.285 ± 0.89
1.285AsnIle: 1.285 ± 0.621
0.584AsnLys: 0.584 ± 0.22
2.336AsnLeu: 2.336 ± 0.628
0.0AsnMet: 0.0 ± 0.0
1.401AsnAsn: 1.401 ± 0.73
1.869AsnPro: 1.869 ± 0.476
1.168AsnGln: 1.168 ± 0.232
1.985AsnArg: 1.985 ± 0.256
1.752AsnSer: 1.752 ± 0.534
2.452AsnThr: 2.452 ± 0.436
2.686AsnVal: 2.686 ± 0.504
0.0AsnTrp: 0.0 ± 0.0
0.467AsnTyr: 0.467 ± 0.457
0.0AsnXaa: 0.0 ± 0.0
Pro
6.189ProAla: 6.189 ± 1.012
2.452ProCys: 2.452 ± 0.706
4.087ProAsp: 4.087 ± 0.81
3.854ProGlu: 3.854 ± 0.714
2.452ProPhe: 2.452 ± 0.386
4.788ProGly: 4.788 ± 0.658
2.102ProHis: 2.102 ± 0.414
2.336ProIle: 2.336 ± 0.588
1.985ProLys: 1.985 ± 0.404
5.372ProLeu: 5.372 ± 0.8
1.285ProMet: 1.285 ± 0.246
1.869ProAsn: 1.869 ± 0.337
3.971ProPro: 3.971 ± 0.894
2.686ProGln: 2.686 ± 0.719
3.737ProArg: 3.737 ± 0.785
6.657ProSer: 6.657 ± 0.737
3.854ProThr: 3.854 ± 0.46
4.554ProVal: 4.554 ± 0.809
0.584ProTrp: 0.584 ± 0.395
1.752ProTyr: 1.752 ± 0.445
0.0ProXaa: 0.0 ± 0.0
Gln
2.92GlnAla: 2.92 ± 0.51
0.584GlnCys: 0.584 ± 0.343
1.401GlnAsp: 1.401 ± 0.276
1.401GlnGlu: 1.401 ± 0.331
0.584GlnPhe: 0.584 ± 0.246
2.336GlnGly: 2.336 ± 0.669
1.285GlnHis: 1.285 ± 0.307
1.401GlnIle: 1.401 ± 0.391
0.934GlnLys: 0.934 ± 0.431
4.671GlnLeu: 4.671 ± 0.772
0.35GlnMet: 0.35 ± 0.12
0.467GlnAsn: 0.467 ± 0.55
2.219GlnPro: 2.219 ± 0.423
1.985GlnGln: 1.985 ± 0.436
2.569GlnArg: 2.569 ± 0.761
3.27GlnSer: 3.27 ± 0.831
0.934GlnThr: 0.934 ± 0.524
1.985GlnVal: 1.985 ± 0.614
0.584GlnTrp: 0.584 ± 0.168
1.752GlnTyr: 1.752 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
3.854ArgAla: 3.854 ± 0.48
1.285ArgCys: 1.285 ± 0.839
3.62ArgAsp: 3.62 ± 0.814
2.336ArgGlu: 2.336 ± 0.363
1.518ArgPhe: 1.518 ± 0.449
3.62ArgGly: 3.62 ± 0.413
1.752ArgHis: 1.752 ± 0.495
1.401ArgIle: 1.401 ± 0.211
2.102ArgLys: 2.102 ± 0.356
7.357ArgLeu: 7.357 ± 0.622
0.817ArgMet: 0.817 ± 0.304
1.869ArgAsn: 1.869 ± 0.651
4.204ArgPro: 4.204 ± 0.625
1.635ArgGln: 1.635 ± 0.465
3.971ArgArg: 3.971 ± 1.212
1.518ArgSer: 1.518 ± 0.241
2.92ArgThr: 2.92 ± 0.342
5.022ArgVal: 5.022 ± 0.663
0.584ArgTrp: 0.584 ± 0.293
2.803ArgTyr: 2.803 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
6.89SerAla: 6.89 ± 0.875
2.569SerCys: 2.569 ± 0.573
4.438SerAsp: 4.438 ± 0.546
1.518SerGlu: 1.518 ± 0.462
3.737SerPhe: 3.737 ± 1.103
6.189SerGly: 6.189 ± 0.813
2.452SerHis: 2.452 ± 0.471
3.62SerIle: 3.62 ± 0.973
3.854SerLys: 3.854 ± 0.478
8.875SerLeu: 8.875 ± 0.968
0.817SerMet: 0.817 ± 0.219
2.452SerAsn: 2.452 ± 0.511
6.306SerPro: 6.306 ± 0.772
1.752SerGln: 1.752 ± 0.362
3.387SerArg: 3.387 ± 0.432
9.693SerSer: 9.693 ± 2.652
4.788SerThr: 4.788 ± 0.878
4.788SerVal: 4.788 ± 0.802
1.635SerTrp: 1.635 ± 0.647
2.336SerTyr: 2.336 ± 0.933
0.0SerXaa: 0.0 ± 0.0
Thr
5.255ThrAla: 5.255 ± 1.201
0.817ThrCys: 0.817 ± 0.442
1.518ThrAsp: 1.518 ± 0.318
1.985ThrGlu: 1.985 ± 0.516
2.92ThrPhe: 2.92 ± 1.007
4.671ThrGly: 4.671 ± 0.97
1.285ThrHis: 1.285 ± 0.884
3.036ThrIle: 3.036 ± 0.458
2.452ThrLys: 2.452 ± 0.622
4.438ThrLeu: 4.438 ± 0.618
1.401ThrMet: 1.401 ± 0.784
2.686ThrAsn: 2.686 ± 0.791
5.255ThrPro: 5.255 ± 1.147
2.219ThrGln: 2.219 ± 0.374
2.219ThrArg: 2.219 ± 0.535
3.62ThrSer: 3.62 ± 0.829
4.438ThrThr: 4.438 ± 1.16
6.073ThrVal: 6.073 ± 1.075
0.817ThrTrp: 0.817 ± 0.373
2.102ThrTyr: 2.102 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
7.007ValAla: 7.007 ± 0.712
3.854ValCys: 3.854 ± 0.562
2.452ValAsp: 2.452 ± 0.413
4.087ValGlu: 4.087 ± 0.472
3.153ValPhe: 3.153 ± 0.497
4.905ValGly: 4.905 ± 0.955
1.401ValHis: 1.401 ± 0.312
3.503ValIle: 3.503 ± 0.48
1.869ValLys: 1.869 ± 0.822
8.642ValLeu: 8.642 ± 1.21
1.635ValMet: 1.635 ± 0.412
2.336ValAsn: 2.336 ± 1.163
5.138ValPro: 5.138 ± 0.729
1.518ValGln: 1.518 ± 0.31
4.087ValArg: 4.087 ± 0.939
5.489ValSer: 5.489 ± 0.469
6.657ValThr: 6.657 ± 0.717
6.54ValVal: 6.54 ± 0.996
0.817ValTrp: 0.817 ± 0.245
4.087ValTyr: 4.087 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.401TrpAla: 1.401 ± 0.743
0.117TrpCys: 0.117 ± 0.157
0.584TrpAsp: 0.584 ± 0.239
0.0TrpGlu: 0.0 ± 0.0
0.817TrpPhe: 0.817 ± 0.199
0.701TrpGly: 0.701 ± 0.241
0.117TrpHis: 0.117 ± 0.153
0.35TrpIle: 0.35 ± 0.12
0.117TrpLys: 0.117 ± 0.241
1.985TrpLeu: 1.985 ± 0.372
0.234TrpMet: 0.234 ± 0.454
0.35TrpAsn: 0.35 ± 0.175
1.168TrpPro: 1.168 ± 0.333
0.701TrpGln: 0.701 ± 0.271
0.701TrpArg: 0.701 ± 0.183
0.35TrpSer: 0.35 ± 0.413
1.635TrpThr: 1.635 ± 0.582
1.401TrpVal: 1.401 ± 0.311
0.0TrpTrp: 0.0 ± 0.0
1.051TrpTyr: 1.051 ± 0.361
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.168TyrAla: 1.168 ± 0.995
0.817TyrCys: 0.817 ± 0.279
3.27TyrAsp: 3.27 ± 0.608
1.635TyrGlu: 1.635 ± 0.376
2.219TyrPhe: 2.219 ± 0.586
1.635TyrGly: 1.635 ± 0.571
0.817TyrHis: 0.817 ± 0.379
2.102TyrIle: 2.102 ± 0.424
1.051TyrLys: 1.051 ± 0.267
5.839TyrLeu: 5.839 ± 1.127
0.117TyrMet: 0.117 ± 0.077
0.934TyrAsn: 0.934 ± 0.267
0.584TyrPro: 0.584 ± 0.337
2.92TyrGln: 2.92 ± 0.587
1.869TyrArg: 1.869 ± 0.5
2.569TyrSer: 2.569 ± 0.451
1.752TyrThr: 1.752 ± 0.312
3.27TyrVal: 3.27 ± 0.52
0.701TyrTrp: 0.701 ± 0.183
2.452TyrTyr: 2.452 ± 0.905
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (8564 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski