Amino acid dipepetide frequency for Avian orthoavulavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.024AlaAla: 8.024 ± 1.539
2.169AlaCys: 2.169 ± 0.473
5.205AlaAsp: 5.205 ± 0.744
4.121AlaGlu: 4.121 ± 0.818
1.952AlaPhe: 1.952 ± 0.553
3.687AlaGly: 3.687 ± 1.235
1.952AlaHis: 1.952 ± 0.414
6.506AlaIle: 6.506 ± 0.723
3.904AlaLys: 3.904 ± 0.99
9.326AlaLeu: 9.326 ± 1.433
1.952AlaMet: 1.952 ± 0.949
3.47AlaAsn: 3.47 ± 0.503
3.253AlaPro: 3.253 ± 0.592
3.47AlaGln: 3.47 ± 1.66
3.687AlaArg: 3.687 ± 0.518
5.856AlaSer: 5.856 ± 0.27
5.205AlaThr: 5.205 ± 1.03
4.771AlaVal: 4.771 ± 0.968
0.867AlaTrp: 0.867 ± 0.492
1.518AlaTyr: 1.518 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
1.735CysAla: 1.735 ± 0.509
0.434CysCys: 0.434 ± 0.262
0.867CysAsp: 0.867 ± 0.322
0.651CysGlu: 0.651 ± 0.293
0.651CysPhe: 0.651 ± 0.228
0.651CysGly: 0.651 ± 0.225
0.0CysHis: 0.0 ± 0.0
1.518CysIle: 1.518 ± 0.551
1.084CysLys: 1.084 ± 0.495
2.169CysLeu: 2.169 ± 0.445
0.651CysMet: 0.651 ± 0.276
1.518CysAsn: 1.518 ± 0.685
0.867CysPro: 0.867 ± 0.358
0.651CysGln: 0.651 ± 0.225
1.084CysArg: 1.084 ± 0.283
1.084CysSer: 1.084 ± 0.438
1.518CysThr: 1.518 ± 0.45
1.084CysVal: 1.084 ± 0.466
0.0CysTrp: 0.0 ± 0.0
1.084CysTyr: 1.084 ± 0.263
0.0CysXaa: 0.0 ± 0.0
Asp
4.121AspAla: 4.121 ± 0.692
0.217AspCys: 0.217 ± 0.131
4.337AspAsp: 4.337 ± 1.36
3.036AspGlu: 3.036 ± 0.498
2.169AspPhe: 2.169 ± 0.389
2.169AspGly: 2.169 ± 0.849
1.735AspHis: 1.735 ± 0.373
2.386AspIle: 2.386 ± 0.164
1.735AspLys: 1.735 ± 0.24
4.988AspLeu: 4.988 ± 0.813
1.518AspMet: 1.518 ± 0.567
3.687AspAsn: 3.687 ± 0.765
3.687AspPro: 3.687 ± 0.716
2.169AspGln: 2.169 ± 1.114
3.036AspArg: 3.036 ± 0.639
3.036AspSer: 3.036 ± 0.428
4.121AspThr: 4.121 ± 0.643
2.169AspVal: 2.169 ± 0.514
0.217AspTrp: 0.217 ± 0.218
1.084AspTyr: 1.084 ± 0.314
0.0AspXaa: 0.0 ± 0.0
Glu
4.121GluAla: 4.121 ± 0.501
0.651GluCys: 0.651 ± 0.393
1.735GluAsp: 1.735 ± 0.835
3.687GluGlu: 3.687 ± 0.518
2.819GluPhe: 2.819 ± 1.208
2.602GluGly: 2.602 ± 0.803
1.084GluHis: 1.084 ± 0.438
4.121GluIle: 4.121 ± 0.342
3.687GluLys: 3.687 ± 0.665
4.988GluLeu: 4.988 ± 0.544
1.301GluMet: 1.301 ± 0.367
3.253GluAsn: 3.253 ± 0.677
1.301GluPro: 1.301 ± 0.358
1.518GluGln: 1.518 ± 0.365
1.952GluArg: 1.952 ± 0.327
4.771GluSer: 4.771 ± 0.877
2.386GluThr: 2.386 ± 0.646
2.169GluVal: 2.169 ± 1.135
0.434GluTrp: 0.434 ± 0.262
1.301GluTyr: 1.301 ± 0.623
0.0GluXaa: 0.0 ± 0.0
Phe
1.952PheAla: 1.952 ± 0.403
0.651PheCys: 0.651 ± 0.21
1.952PheAsp: 1.952 ± 0.9
3.47PheGlu: 3.47 ± 1.03
2.386PhePhe: 2.386 ± 0.983
3.253PheGly: 3.253 ± 0.69
0.651PheHis: 0.651 ± 0.232
2.386PheIle: 2.386 ± 0.662
1.518PheLys: 1.518 ± 0.54
3.253PheLeu: 3.253 ± 0.817
0.434PheMet: 0.434 ± 0.188
1.301PheAsn: 1.301 ± 0.448
1.518PhePro: 1.518 ± 0.223
0.651PheGln: 0.651 ± 0.393
1.518PheArg: 1.518 ± 0.348
1.301PheSer: 1.301 ± 0.435
1.735PheThr: 1.735 ± 0.359
3.036PheVal: 3.036 ± 0.348
0.867PheTrp: 0.867 ± 0.245
0.217PheTyr: 0.217 ± 0.131
0.0PheXaa: 0.0 ± 0.0
Gly
3.036GlyAla: 3.036 ± 1.207
1.084GlyCys: 1.084 ± 0.907
4.554GlyAsp: 4.554 ± 1.047
1.301GlyGlu: 1.301 ± 0.221
1.301GlyPhe: 1.301 ± 0.834
3.687GlyGly: 3.687 ± 0.572
1.518GlyHis: 1.518 ± 0.358
3.47GlyIle: 3.47 ± 0.602
3.036GlyLys: 3.036 ± 0.38
5.205GlyLeu: 5.205 ± 0.94
1.735GlyMet: 1.735 ± 0.817
2.386GlyAsn: 2.386 ± 0.668
1.084GlyPro: 1.084 ± 0.537
3.036GlyGln: 3.036 ± 0.797
3.253GlyArg: 3.253 ± 0.29
4.554GlySer: 4.554 ± 1.283
2.602GlyThr: 2.602 ± 0.684
6.723GlyVal: 6.723 ± 1.793
0.0GlyTrp: 0.0 ± 0.0
1.952GlyTyr: 1.952 ± 0.769
0.0GlyXaa: 0.0 ± 0.0
His
1.735HisAla: 1.735 ± 0.242
0.434HisCys: 0.434 ± 0.262
0.434HisAsp: 0.434 ± 0.179
1.084HisGlu: 1.084 ± 0.385
0.434HisPhe: 0.434 ± 0.179
0.434HisGly: 0.434 ± 0.179
0.217HisHis: 0.217 ± 0.218
1.518HisIle: 1.518 ± 0.729
0.651HisLys: 0.651 ± 0.225
3.47HisLeu: 3.47 ± 0.777
0.434HisMet: 0.434 ± 0.262
1.084HisAsn: 1.084 ± 0.462
1.735HisPro: 1.735 ± 0.814
0.651HisGln: 0.651 ± 0.293
0.434HisArg: 0.434 ± 0.262
1.518HisSer: 1.518 ± 0.7
1.518HisThr: 1.518 ± 0.652
2.602HisVal: 2.602 ± 0.342
0.434HisTrp: 0.434 ± 0.353
0.217HisTyr: 0.217 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
5.856IleAla: 5.856 ± 1.106
0.0IleCys: 0.0 ± 0.0
3.687IleAsp: 3.687 ± 0.655
3.904IleGlu: 3.904 ± 1.229
2.602IlePhe: 2.602 ± 0.586
2.386IleGly: 2.386 ± 0.856
1.301IleHis: 1.301 ± 0.787
2.819IleIle: 2.819 ± 0.73
3.47IleLys: 3.47 ± 1.216
9.542IleLeu: 9.542 ± 1.256
1.735IleMet: 1.735 ± 0.613
2.819IleAsn: 2.819 ± 0.589
2.169IlePro: 2.169 ± 0.658
4.121IleGln: 4.121 ± 0.958
4.121IleArg: 4.121 ± 0.342
7.157IleSer: 7.157 ± 1.086
4.771IleThr: 4.771 ± 1.259
3.47IleVal: 3.47 ± 1.115
0.217IleTrp: 0.217 ± 0.131
2.169IleTyr: 2.169 ± 0.636
0.0IleXaa: 0.0 ± 0.0
Lys
2.602LysAla: 2.602 ± 0.57
1.952LysCys: 1.952 ± 0.524
2.386LysAsp: 2.386 ± 0.768
2.386LysGlu: 2.386 ± 0.424
0.867LysPhe: 0.867 ± 0.41
1.952LysGly: 1.952 ± 0.744
1.301LysHis: 1.301 ± 0.508
3.253LysIle: 3.253 ± 0.99
2.169LysLys: 2.169 ± 0.946
4.554LysLeu: 4.554 ± 0.804
1.518LysMet: 1.518 ± 0.701
2.169LysAsn: 2.169 ± 0.796
1.518LysPro: 1.518 ± 0.248
2.386LysGln: 2.386 ± 0.433
3.253LysArg: 3.253 ± 0.601
2.169LysSer: 2.169 ± 0.516
3.687LysThr: 3.687 ± 1.915
4.337LysVal: 4.337 ± 1.058
0.217LysTrp: 0.217 ± 0.131
1.301LysTyr: 1.301 ± 0.398
0.0LysXaa: 0.0 ± 0.0
Leu
9.759LeuAla: 9.759 ± 2.922
2.386LeuCys: 2.386 ± 0.341
7.807LeuAsp: 7.807 ± 0.956
5.856LeuGlu: 5.856 ± 1.272
3.47LeuPhe: 3.47 ± 1.049
6.94LeuGly: 6.94 ± 2.457
2.602LeuHis: 2.602 ± 0.781
6.723LeuIle: 6.723 ± 1.937
4.121LeuLys: 4.121 ± 1.116
10.193LeuLeu: 10.193 ± 1.997
3.036LeuMet: 3.036 ± 0.666
5.205LeuAsn: 5.205 ± 0.89
3.687LeuPro: 3.687 ± 0.787
6.072LeuGln: 6.072 ± 0.823
4.988LeuArg: 4.988 ± 0.761
9.542LeuSer: 9.542 ± 1.629
8.892LeuThr: 8.892 ± 1.007
4.337LeuVal: 4.337 ± 0.861
1.301LeuTrp: 1.301 ± 0.426
3.687LeuTyr: 3.687 ± 0.774
0.0LeuXaa: 0.0 ± 0.0
Met
1.952MetAla: 1.952 ± 0.509
0.434MetCys: 0.434 ± 0.262
1.301MetAsp: 1.301 ± 0.497
1.735MetGlu: 1.735 ± 0.866
1.084MetPhe: 1.084 ± 0.518
0.651MetGly: 0.651 ± 0.467
0.651MetHis: 0.651 ± 0.393
2.602MetIle: 2.602 ± 0.732
1.301MetLys: 1.301 ± 0.375
3.47MetLeu: 3.47 ± 0.894
0.434MetMet: 0.434 ± 0.334
0.434MetAsn: 0.434 ± 0.262
0.651MetPro: 0.651 ± 0.493
0.651MetGln: 0.651 ± 0.293
1.084MetArg: 1.084 ± 0.598
2.169MetSer: 2.169 ± 0.321
1.735MetThr: 1.735 ± 0.359
0.867MetVal: 0.867 ± 0.525
0.0MetTrp: 0.0 ± 0.0
0.867MetTyr: 0.867 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
2.819AsnAla: 2.819 ± 0.716
1.301AsnCys: 1.301 ± 0.463
3.036AsnAsp: 3.036 ± 0.572
2.169AsnGlu: 2.169 ± 0.595
1.518AsnPhe: 1.518 ± 0.695
4.337AsnGly: 4.337 ± 1.011
1.084AsnHis: 1.084 ± 0.407
3.904AsnIle: 3.904 ± 0.578
2.386AsnLys: 2.386 ± 1.09
5.639AsnLeu: 5.639 ± 1.913
0.867AsnMet: 0.867 ± 0.29
2.386AsnAsn: 2.386 ± 0.833
3.036AsnPro: 3.036 ± 0.484
1.735AsnGln: 1.735 ± 0.854
2.602AsnArg: 2.602 ± 0.489
3.036AsnSer: 3.036 ± 0.656
4.337AsnThr: 4.337 ± 0.994
2.169AsnVal: 2.169 ± 0.554
0.867AsnTrp: 0.867 ± 0.525
1.084AsnTyr: 1.084 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
2.819ProAla: 2.819 ± 0.542
0.867ProCys: 0.867 ± 0.25
0.867ProAsp: 0.867 ± 0.322
2.386ProGlu: 2.386 ± 0.761
1.518ProPhe: 1.518 ± 0.339
2.602ProGly: 2.602 ± 0.65
1.084ProHis: 1.084 ± 0.525
2.602ProIle: 2.602 ± 0.469
2.386ProLys: 2.386 ± 0.737
3.904ProLeu: 3.904 ± 0.494
0.867ProMet: 0.867 ± 0.525
1.952ProAsn: 1.952 ± 0.563
2.169ProPro: 2.169 ± 0.6
1.518ProGln: 1.518 ± 0.626
1.518ProArg: 1.518 ± 0.435
4.771ProSer: 4.771 ± 0.98
3.47ProThr: 3.47 ± 0.793
3.47ProVal: 3.47 ± 1.431
0.217ProTrp: 0.217 ± 0.131
1.735ProTyr: 1.735 ± 0.55
0.0ProXaa: 0.0 ± 0.0
Gln
5.422GlnAla: 5.422 ± 1.454
0.217GlnCys: 0.217 ± 0.224
1.952GlnAsp: 1.952 ± 0.983
2.169GlnGlu: 2.169 ± 0.743
1.735GlnPhe: 1.735 ± 0.763
2.602GlnGly: 2.602 ± 0.723
1.084GlnHis: 1.084 ± 0.319
3.036GlnIle: 3.036 ± 0.797
1.518GlnLys: 1.518 ± 0.577
6.289GlnLeu: 6.289 ± 1.462
1.084GlnMet: 1.084 ± 0.288
1.084GlnAsn: 1.084 ± 0.298
0.867GlnPro: 0.867 ± 0.293
3.253GlnGln: 3.253 ± 0.66
3.253GlnArg: 3.253 ± 0.61
4.337GlnSer: 4.337 ± 1.199
1.735GlnThr: 1.735 ± 0.71
3.036GlnVal: 3.036 ± 0.337
0.0GlnTrp: 0.0 ± 0.0
1.518GlnTyr: 1.518 ± 0.865
0.0GlnXaa: 0.0 ± 0.0
Arg
3.036ArgAla: 3.036 ± 1.13
0.867ArgCys: 0.867 ± 0.195
1.952ArgAsp: 1.952 ± 0.316
3.253ArgGlu: 3.253 ± 1.086
2.169ArgPhe: 2.169 ± 0.634
3.253ArgGly: 3.253 ± 1.145
1.084ArgHis: 1.084 ± 0.49
4.771ArgIle: 4.771 ± 0.604
2.386ArgLys: 2.386 ± 0.54
6.289ArgLeu: 6.289 ± 0.906
1.952ArgMet: 1.952 ± 0.522
2.819ArgAsn: 2.819 ± 0.357
3.036ArgPro: 3.036 ± 0.621
1.735ArgGln: 1.735 ± 0.44
3.687ArgArg: 3.687 ± 0.547
3.47ArgSer: 3.47 ± 1.202
1.301ArgThr: 1.301 ± 0.294
4.771ArgVal: 4.771 ± 0.787
0.434ArgTrp: 0.434 ± 0.188
1.952ArgTyr: 1.952 ± 0.287
0.0ArgXaa: 0.0 ± 0.0
Ser
5.856SerAla: 5.856 ± 1.705
1.735SerCys: 1.735 ± 0.531
4.554SerAsp: 4.554 ± 0.393
3.036SerGlu: 3.036 ± 0.93
2.169SerPhe: 2.169 ± 0.111
5.639SerGly: 5.639 ± 0.946
1.084SerHis: 1.084 ± 0.299
6.289SerIle: 6.289 ± 0.868
3.036SerLys: 3.036 ± 0.834
9.542SerLeu: 9.542 ± 0.954
1.952SerMet: 1.952 ± 0.529
4.988SerAsn: 4.988 ± 0.762
3.904SerPro: 3.904 ± 0.816
5.205SerGln: 5.205 ± 1.78
4.554SerArg: 4.554 ± 0.745
8.024SerSer: 8.024 ± 1.404
3.687SerThr: 3.687 ± 1.07
3.904SerVal: 3.904 ± 0.952
0.651SerTrp: 0.651 ± 0.21
3.036SerTyr: 3.036 ± 0.975
0.0SerXaa: 0.0 ± 0.0
Thr
6.723ThrAla: 6.723 ± 1.766
1.952ThrCys: 1.952 ± 0.355
2.386ThrAsp: 2.386 ± 1.029
2.169ThrGlu: 2.169 ± 0.63
1.735ThrPhe: 1.735 ± 0.365
3.904ThrGly: 3.904 ± 1.544
0.651ThrHis: 0.651 ± 0.377
4.554ThrIle: 4.554 ± 0.823
3.036ThrLys: 3.036 ± 0.626
6.072ThrLeu: 6.072 ± 0.556
0.651ThrMet: 0.651 ± 0.247
3.253ThrAsn: 3.253 ± 0.612
3.904ThrPro: 3.904 ± 1.146
2.819ThrGln: 2.819 ± 0.815
3.687ThrArg: 3.687 ± 1.153
4.554ThrSer: 4.554 ± 1.582
5.856ThrThr: 5.856 ± 0.974
2.819ThrVal: 2.819 ± 0.703
1.301ThrTrp: 1.301 ± 0.56
1.735ThrTyr: 1.735 ± 0.558
0.0ThrXaa: 0.0 ± 0.0
Val
6.94ValAla: 6.94 ± 1.51
0.651ValCys: 0.651 ± 0.21
1.518ValAsp: 1.518 ± 0.248
1.952ValGlu: 1.952 ± 1.159
1.952ValPhe: 1.952 ± 0.284
2.602ValGly: 2.602 ± 0.528
1.518ValHis: 1.518 ± 0.317
3.47ValIle: 3.47 ± 0.807
3.036ValLys: 3.036 ± 0.382
7.591ValLeu: 7.591 ± 0.467
0.867ValMet: 0.867 ± 0.279
3.47ValAsn: 3.47 ± 0.757
2.602ValPro: 2.602 ± 0.8
2.169ValGln: 2.169 ± 0.632
3.687ValArg: 3.687 ± 0.91
7.157ValSer: 7.157 ± 1.985
3.253ValThr: 3.253 ± 0.729
1.518ValVal: 1.518 ± 0.816
0.651ValTrp: 0.651 ± 0.29
2.819ValTyr: 2.819 ± 0.827
0.0ValXaa: 0.0 ± 0.0
Trp
0.434TrpAla: 0.434 ± 0.262
0.217TrpCys: 0.217 ± 0.218
0.217TrpAsp: 0.217 ± 0.131
0.217TrpGlu: 0.217 ± 0.264
0.651TrpPhe: 0.651 ± 0.225
0.434TrpGly: 0.434 ± 0.188
0.0TrpHis: 0.0 ± 0.0
0.651TrpIle: 0.651 ± 0.393
0.0TrpLys: 0.0 ± 0.0
1.084TrpLeu: 1.084 ± 0.441
0.0TrpMet: 0.0 ± 0.0
0.434TrpAsn: 0.434 ± 0.188
0.434TrpPro: 0.434 ± 0.262
0.651TrpGln: 0.651 ± 0.393
1.301TrpArg: 1.301 ± 0.34
0.867TrpSer: 0.867 ± 0.607
0.217TrpThr: 0.217 ± 0.131
0.651TrpVal: 0.651 ± 0.29
0.0TrpTrp: 0.0 ± 0.0
0.217TrpTyr: 0.217 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.952TyrAla: 1.952 ± 0.784
1.084TyrCys: 1.084 ± 0.242
0.867TyrAsp: 0.867 ± 0.283
1.301TyrGlu: 1.301 ± 0.421
0.867TyrPhe: 0.867 ± 0.406
1.735TyrGly: 1.735 ± 0.66
0.434TyrHis: 0.434 ± 0.334
1.735TyrIle: 1.735 ± 0.327
1.518TyrLys: 1.518 ± 0.339
3.036TyrLeu: 3.036 ± 1.376
0.867TyrMet: 0.867 ± 0.296
2.819TyrAsn: 2.819 ± 0.566
1.301TyrPro: 1.301 ± 0.955
1.735TyrGln: 1.735 ± 0.549
1.518TyrArg: 1.518 ± 0.734
3.47TyrSer: 3.47 ± 1.099
1.735TyrThr: 1.735 ± 0.812
1.518TyrVal: 1.518 ± 0.375
0.0TyrTrp: 0.0 ± 0.0
1.084TyrTyr: 1.084 ± 0.407
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4612 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski