Amino acid dipepetide frequency for Avian metaavulavirus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.905AlaAla: 3.905 ± 1.39
0.976AlaCys: 0.976 ± 0.302
2.734AlaAsp: 2.734 ± 0.66
3.124AlaGlu: 3.124 ± 0.959
0.976AlaPhe: 0.976 ± 0.29
1.953AlaGly: 1.953 ± 0.927
2.539AlaHis: 2.539 ± 0.442
5.272AlaIle: 5.272 ± 1.115
3.124AlaLys: 3.124 ± 0.672
5.858AlaLeu: 5.858 ± 0.695
2.343AlaMet: 2.343 ± 1.055
2.734AlaAsn: 2.734 ± 0.914
2.734AlaPro: 2.734 ± 1.379
3.905AlaGln: 3.905 ± 0.688
3.32AlaArg: 3.32 ± 1.138
5.663AlaSer: 5.663 ± 1.148
3.32AlaThr: 3.32 ± 0.783
4.101AlaVal: 4.101 ± 1.59
0.781AlaTrp: 0.781 ± 0.566
1.757AlaTyr: 1.757 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
1.367CysAla: 1.367 ± 0.348
0.391CysCys: 0.391 ± 0.242
0.781CysAsp: 0.781 ± 0.341
0.195CysGlu: 0.195 ± 0.215
0.391CysPhe: 0.391 ± 0.292
0.586CysGly: 0.586 ± 0.302
0.586CysHis: 0.586 ± 0.255
0.976CysIle: 0.976 ± 0.443
1.562CysLys: 1.562 ± 0.47
2.734CysLeu: 2.734 ± 0.822
0.586CysMet: 0.586 ± 0.362
0.586CysAsn: 0.586 ± 0.314
1.367CysPro: 1.367 ± 0.329
1.172CysGln: 1.172 ± 0.423
1.562CysArg: 1.562 ± 0.304
2.929CysSer: 2.929 ± 0.784
0.781CysThr: 0.781 ± 0.5
1.757CysVal: 1.757 ± 0.527
0.391CysTrp: 0.391 ± 0.255
0.781CysTyr: 0.781 ± 0.429
0.0CysXaa: 0.0 ± 0.0
Asp
2.343AspAla: 2.343 ± 1.186
0.586AspCys: 0.586 ± 0.362
4.101AspAsp: 4.101 ± 1.972
0.781AspGlu: 0.781 ± 0.377
1.172AspPhe: 1.172 ± 0.331
2.343AspGly: 2.343 ± 0.731
1.367AspHis: 1.367 ± 0.793
4.101AspIle: 4.101 ± 0.821
0.976AspLys: 0.976 ± 0.275
6.835AspLeu: 6.835 ± 0.909
0.976AspMet: 0.976 ± 0.919
2.734AspAsn: 2.734 ± 1.21
4.101AspPro: 4.101 ± 0.732
3.515AspGln: 3.515 ± 0.794
2.343AspArg: 2.343 ± 0.569
5.077AspSer: 5.077 ± 1.988
3.32AspThr: 3.32 ± 0.812
1.757AspVal: 1.757 ± 0.948
0.195AspTrp: 0.195 ± 0.201
1.172AspTyr: 1.172 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
4.491GluAla: 4.491 ± 1.135
1.562GluCys: 1.562 ± 0.423
2.734GluAsp: 2.734 ± 0.421
3.124GluGlu: 3.124 ± 0.582
1.953GluPhe: 1.953 ± 1.017
3.32GluGly: 3.32 ± 0.608
0.586GluHis: 0.586 ± 0.373
1.953GluIle: 1.953 ± 0.69
1.367GluLys: 1.367 ± 0.589
4.296GluLeu: 4.296 ± 0.702
1.757GluMet: 1.757 ± 0.328
2.343GluAsn: 2.343 ± 0.986
1.172GluPro: 1.172 ± 0.384
1.172GluGln: 1.172 ± 0.217
1.757GluArg: 1.757 ± 0.402
6.054GluSer: 6.054 ± 1.481
3.124GluThr: 3.124 ± 0.359
1.953GluVal: 1.953 ± 0.745
0.391GluTrp: 0.391 ± 0.255
1.562GluTyr: 1.562 ± 0.637
0.0GluXaa: 0.0 ± 0.0
Phe
1.562PheAla: 1.562 ± 0.744
1.367PheCys: 1.367 ± 0.508
1.757PheAsp: 1.757 ± 0.612
1.562PheGlu: 1.562 ± 0.682
2.539PhePhe: 2.539 ± 0.561
0.976PheGly: 0.976 ± 0.275
0.195PheHis: 0.195 ± 0.201
2.539PheIle: 2.539 ± 0.683
1.953PheLys: 1.953 ± 0.262
4.687PheLeu: 4.687 ± 1.605
0.781PheMet: 0.781 ± 0.499
1.172PheAsn: 1.172 ± 0.407
1.172PhePro: 1.172 ± 0.37
0.391PheGln: 0.391 ± 0.191
1.172PheArg: 1.172 ± 0.394
2.929PheSer: 2.929 ± 0.798
2.929PheThr: 2.929 ± 0.672
1.172PheVal: 1.172 ± 0.423
0.195PheTrp: 0.195 ± 0.121
0.781PheTyr: 0.781 ± 0.438
0.0PheXaa: 0.0 ± 0.0
Gly
3.515GlyAla: 3.515 ± 0.549
1.367GlyCys: 1.367 ± 0.544
3.124GlyAsp: 3.124 ± 0.805
2.929GlyGlu: 2.929 ± 0.686
1.562GlyPhe: 1.562 ± 0.461
3.32GlyGly: 3.32 ± 0.751
1.367GlyHis: 1.367 ± 0.476
2.734GlyIle: 2.734 ± 0.743
2.148GlyLys: 2.148 ± 0.903
6.249GlyLeu: 6.249 ± 1.875
0.976GlyMet: 0.976 ± 0.424
1.953GlyAsn: 1.953 ± 0.789
1.367GlyPro: 1.367 ± 0.484
1.562GlyGln: 1.562 ± 0.24
3.905GlyArg: 3.905 ± 0.549
5.272GlySer: 5.272 ± 0.658
4.101GlyThr: 4.101 ± 0.632
2.734GlyVal: 2.734 ± 1.089
0.0GlyTrp: 0.0 ± 0.0
0.195GlyTyr: 0.195 ± 0.201
0.0GlyXaa: 0.0 ± 0.0
His
1.367HisAla: 1.367 ± 0.921
0.195HisCys: 0.195 ± 0.121
1.172HisAsp: 1.172 ± 0.489
0.781HisGlu: 0.781 ± 0.483
0.195HisPhe: 0.195 ± 0.219
1.172HisGly: 1.172 ± 0.458
1.367HisHis: 1.367 ± 1.197
2.539HisIle: 2.539 ± 0.662
0.195HisLys: 0.195 ± 0.215
1.953HisLeu: 1.953 ± 0.894
0.195HisMet: 0.195 ± 0.206
0.781HisAsn: 0.781 ± 0.382
2.343HisPro: 2.343 ± 0.454
2.148HisGln: 2.148 ± 1.425
1.172HisArg: 1.172 ± 0.573
1.367HisSer: 1.367 ± 0.658
2.734HisThr: 2.734 ± 1.589
0.976HisVal: 0.976 ± 0.604
0.195HisTrp: 0.195 ± 0.201
0.976HisTyr: 0.976 ± 0.604
0.0HisXaa: 0.0 ± 0.0
Ile
4.101IleAla: 4.101 ± 1.12
0.781IleCys: 0.781 ± 0.28
1.757IleAsp: 1.757 ± 0.748
4.687IleGlu: 4.687 ± 0.848
2.148IlePhe: 2.148 ± 0.814
4.296IleGly: 4.296 ± 1.053
2.148IleHis: 2.148 ± 0.773
6.054IleIle: 6.054 ± 0.942
4.882IleLys: 4.882 ± 1.572
7.42IleLeu: 7.42 ± 1.177
2.343IleMet: 2.343 ± 0.591
5.468IleAsn: 5.468 ± 0.978
4.491IlePro: 4.491 ± 0.729
4.491IleGln: 4.491 ± 1.655
2.734IleArg: 2.734 ± 0.875
8.787IleSer: 8.787 ± 1.478
5.272IleThr: 5.272 ± 1.073
4.296IleVal: 4.296 ± 1.065
0.976IleTrp: 0.976 ± 0.421
2.148IleTyr: 2.148 ± 0.818
0.0IleXaa: 0.0 ± 0.0
Lys
2.734LysAla: 2.734 ± 0.612
0.781LysCys: 0.781 ± 0.34
2.148LysAsp: 2.148 ± 0.788
3.124LysGlu: 3.124 ± 0.765
1.172LysPhe: 1.172 ± 0.311
3.71LysGly: 3.71 ± 0.922
0.976LysHis: 0.976 ± 0.313
4.101LysIle: 4.101 ± 0.522
3.905LysLys: 3.905 ± 0.677
4.687LysLeu: 4.687 ± 1.103
1.367LysMet: 1.367 ± 0.589
2.734LysAsn: 2.734 ± 0.974
1.562LysPro: 1.562 ± 0.452
0.976LysGln: 0.976 ± 0.256
2.734LysArg: 2.734 ± 0.884
4.296LysSer: 4.296 ± 1.211
3.124LysThr: 3.124 ± 0.581
2.343LysVal: 2.343 ± 0.813
0.195LysTrp: 0.195 ± 0.121
1.953LysTyr: 1.953 ± 0.82
0.0LysXaa: 0.0 ± 0.0
Leu
6.835LeuAla: 6.835 ± 1.929
2.734LeuCys: 2.734 ± 1.017
8.397LeuAsp: 8.397 ± 2.51
4.687LeuGlu: 4.687 ± 0.667
4.296LeuPhe: 4.296 ± 1.233
4.491LeuGly: 4.491 ± 1.109
1.757LeuHis: 1.757 ± 0.856
8.397LeuIle: 8.397 ± 1.924
5.272LeuLys: 5.272 ± 0.672
10.74LeuLeu: 10.74 ± 2.215
1.172LeuMet: 1.172 ± 0.433
5.077LeuAsn: 5.077 ± 0.881
3.515LeuPro: 3.515 ± 0.92
4.491LeuGln: 4.491 ± 0.863
3.905LeuArg: 3.905 ± 0.986
12.107LeuSer: 12.107 ± 1.953
8.006LeuThr: 8.006 ± 1.067
4.101LeuVal: 4.101 ± 0.337
1.562LeuTrp: 1.562 ± 0.566
2.539LeuTyr: 2.539 ± 0.875
0.0LeuXaa: 0.0 ± 0.0
Met
2.148MetAla: 2.148 ± 0.822
0.391MetCys: 0.391 ± 0.255
0.976MetAsp: 0.976 ± 0.559
1.367MetGlu: 1.367 ± 0.306
0.586MetPhe: 0.586 ± 0.255
1.757MetGly: 1.757 ± 0.74
0.195MetHis: 0.195 ± 0.225
2.734MetIle: 2.734 ± 0.736
1.562MetLys: 1.562 ± 0.532
2.343MetLeu: 2.343 ± 0.873
0.586MetMet: 0.586 ± 0.373
0.781MetAsn: 0.781 ± 0.386
0.195MetPro: 0.195 ± 0.121
0.195MetGln: 0.195 ± 0.219
0.781MetArg: 0.781 ± 0.259
1.953MetSer: 1.953 ± 0.749
1.757MetThr: 1.757 ± 0.666
0.976MetVal: 0.976 ± 0.481
0.391MetTrp: 0.391 ± 0.242
0.781MetTyr: 0.781 ± 0.382
0.0MetXaa: 0.0 ± 0.0
Asn
2.148AsnAla: 2.148 ± 0.421
1.172AsnCys: 1.172 ± 0.444
2.734AsnAsp: 2.734 ± 0.586
2.148AsnGlu: 2.148 ± 0.756
1.562AsnPhe: 1.562 ± 0.606
1.953AsnGly: 1.953 ± 0.718
0.976AsnHis: 0.976 ± 0.601
4.296AsnIle: 4.296 ± 0.695
1.757AsnLys: 1.757 ± 0.487
6.054AsnLeu: 6.054 ± 1.793
0.781AsnMet: 0.781 ± 0.377
2.539AsnAsn: 2.539 ± 0.987
4.687AsnPro: 4.687 ± 1.399
3.905AsnGln: 3.905 ± 1.333
3.71AsnArg: 3.71 ± 0.549
4.687AsnSer: 4.687 ± 1.282
2.734AsnThr: 2.734 ± 0.8
1.367AsnVal: 1.367 ± 0.591
0.976AsnTrp: 0.976 ± 0.604
1.562AsnTyr: 1.562 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
3.124ProAla: 3.124 ± 1.306
0.586ProCys: 0.586 ± 0.286
1.562ProAsp: 1.562 ± 0.477
1.367ProGlu: 1.367 ± 0.536
1.367ProPhe: 1.367 ± 0.298
2.929ProGly: 2.929 ± 0.597
0.976ProHis: 0.976 ± 0.604
3.32ProIle: 3.32 ± 0.734
2.929ProLys: 2.929 ± 0.605
4.101ProLeu: 4.101 ± 1.207
0.586ProMet: 0.586 ± 0.244
3.32ProAsn: 3.32 ± 1.653
2.929ProPro: 2.929 ± 1.178
2.148ProGln: 2.148 ± 0.562
1.757ProArg: 1.757 ± 0.489
6.444ProSer: 6.444 ± 3.319
5.272ProThr: 5.272 ± 1.739
1.757ProVal: 1.757 ± 0.433
0.586ProTrp: 0.586 ± 0.373
1.562ProTyr: 1.562 ± 0.47
0.0ProXaa: 0.0 ± 0.0
Gln
2.929GlnAla: 2.929 ± 1.238
0.586GlnCys: 0.586 ± 0.255
2.539GlnAsp: 2.539 ± 1.365
2.148GlnGlu: 2.148 ± 0.876
1.757GlnPhe: 1.757 ± 0.428
2.539GlnGly: 2.539 ± 0.705
0.976GlnHis: 0.976 ± 0.559
4.296GlnIle: 4.296 ± 0.846
2.929GlnLys: 2.929 ± 0.368
6.249GlnLeu: 6.249 ± 0.526
0.586GlnMet: 0.586 ± 0.279
1.757GlnAsn: 1.757 ± 0.624
3.71GlnPro: 3.71 ± 2.51
3.515GlnGln: 3.515 ± 1.398
3.515GlnArg: 3.515 ± 0.689
5.077GlnSer: 5.077 ± 0.973
2.539GlnThr: 2.539 ± 0.589
3.515GlnVal: 3.515 ± 0.509
0.391GlnTrp: 0.391 ± 0.242
1.172GlnTyr: 1.172 ± 0.559
0.0GlnXaa: 0.0 ± 0.0
Arg
1.757ArgAla: 1.757 ± 0.631
0.976ArgCys: 0.976 ± 0.81
2.343ArgAsp: 2.343 ± 0.668
2.539ArgGlu: 2.539 ± 0.606
1.757ArgPhe: 1.757 ± 0.359
2.539ArgGly: 2.539 ± 0.439
1.367ArgHis: 1.367 ± 0.388
3.515ArgIle: 3.515 ± 0.621
2.343ArgLys: 2.343 ± 0.535
5.077ArgLeu: 5.077 ± 1.087
0.391ArgMet: 0.391 ± 0.391
3.124ArgAsn: 3.124 ± 0.81
1.562ArgPro: 1.562 ± 0.452
1.562ArgGln: 1.562 ± 0.631
2.539ArgArg: 2.539 ± 0.706
4.687ArgSer: 4.687 ± 0.732
2.734ArgThr: 2.734 ± 0.573
3.515ArgVal: 3.515 ± 0.908
0.586ArgTrp: 0.586 ± 0.427
1.172ArgTyr: 1.172 ± 0.558
0.0ArgXaa: 0.0 ± 0.0
Ser
5.468SerAla: 5.468 ± 1.001
2.343SerCys: 2.343 ± 0.59
3.71SerAsp: 3.71 ± 1.048
4.101SerGlu: 4.101 ± 0.729
2.734SerPhe: 2.734 ± 0.989
5.272SerGly: 5.272 ± 1.175
3.124SerHis: 3.124 ± 0.99
8.983SerIle: 8.983 ± 1.139
4.491SerLys: 4.491 ± 1.561
9.764SerLeu: 9.764 ± 1.225
2.929SerMet: 2.929 ± 0.817
7.03SerAsn: 7.03 ± 1.446
5.077SerPro: 5.077 ± 1.447
6.639SerGln: 6.639 ± 1.693
2.734SerArg: 2.734 ± 0.501
6.444SerSer: 6.444 ± 1.175
6.835SerThr: 6.835 ± 0.861
5.858SerVal: 5.858 ± 1.229
1.367SerTrp: 1.367 ± 0.584
2.734SerTyr: 2.734 ± 1.147
0.0SerXaa: 0.0 ± 0.0
Thr
4.491ThrAla: 4.491 ± 1.322
1.757ThrCys: 1.757 ± 0.532
3.515ThrAsp: 3.515 ± 0.828
3.905ThrGlu: 3.905 ± 0.83
1.953ThrPhe: 1.953 ± 0.647
2.148ThrGly: 2.148 ± 0.487
2.343ThrHis: 2.343 ± 1.153
5.468ThrIle: 5.468 ± 0.805
2.734ThrLys: 2.734 ± 0.742
8.592ThrLeu: 8.592 ± 1.674
1.953ThrMet: 1.953 ± 0.517
2.148ThrAsn: 2.148 ± 0.679
2.929ThrPro: 2.929 ± 1.214
5.077ThrGln: 5.077 ± 1.001
3.32ThrArg: 3.32 ± 0.861
5.272ThrSer: 5.272 ± 1.555
7.225ThrThr: 7.225 ± 1.694
4.101ThrVal: 4.101 ± 1.007
1.757ThrTrp: 1.757 ± 0.591
2.539ThrTyr: 2.539 ± 0.645
0.0ThrXaa: 0.0 ± 0.0
Val
2.539ValAla: 2.539 ± 0.541
1.562ValCys: 1.562 ± 0.769
2.343ValAsp: 2.343 ± 0.468
2.343ValGlu: 2.343 ± 0.748
2.148ValPhe: 2.148 ± 0.3
3.515ValGly: 3.515 ± 0.928
0.391ValHis: 0.391 ± 0.242
4.296ValIle: 4.296 ± 1.332
3.124ValLys: 3.124 ± 0.788
3.32ValLeu: 3.32 ± 0.56
1.562ValMet: 1.562 ± 0.517
3.515ValAsn: 3.515 ± 1.283
1.562ValPro: 1.562 ± 0.238
3.32ValGln: 3.32 ± 0.804
1.172ValArg: 1.172 ± 0.462
4.296ValSer: 4.296 ± 0.64
4.687ValThr: 4.687 ± 1.088
2.343ValVal: 2.343 ± 1.088
0.195ValTrp: 0.195 ± 0.215
1.953ValTyr: 1.953 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
1.562TrpAla: 1.562 ± 0.568
0.586TrpCys: 0.586 ± 0.381
0.391TrpAsp: 0.391 ± 0.242
1.172TrpGlu: 1.172 ± 1.271
0.586TrpPhe: 0.586 ± 0.373
0.586TrpGly: 0.586 ± 0.255
0.0TrpHis: 0.0 ± 0.0
1.562TrpIle: 1.562 ± 0.661
0.391TrpLys: 0.391 ± 0.242
0.586TrpLeu: 0.586 ± 0.248
0.195TrpMet: 0.195 ± 0.225
0.391TrpAsn: 0.391 ± 0.255
0.586TrpPro: 0.586 ± 0.248
0.195TrpGln: 0.195 ± 0.121
0.976TrpArg: 0.976 ± 0.426
0.391TrpSer: 0.391 ± 0.242
0.391TrpThr: 0.391 ± 0.191
0.195TrpVal: 0.195 ± 0.219
0.195TrpTrp: 0.195 ± 0.201
0.391TrpTyr: 0.391 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.734TyrAla: 2.734 ± 0.695
0.781TyrCys: 0.781 ± 0.483
0.976TyrAsp: 0.976 ± 0.279
0.586TyrGlu: 0.586 ± 0.231
0.976TyrPhe: 0.976 ± 0.481
0.976TyrGly: 0.976 ± 0.694
0.586TyrHis: 0.586 ± 0.362
2.148TyrIle: 2.148 ± 0.95
0.976TyrLys: 0.976 ± 0.407
2.539TyrLeu: 2.539 ± 0.617
0.195TyrMet: 0.195 ± 0.219
1.562TyrAsn: 1.562 ± 0.687
1.562TyrPro: 1.562 ± 0.612
2.343TyrGln: 2.343 ± 0.73
1.172TyrArg: 1.172 ± 0.415
3.905TyrSer: 3.905 ± 1.02
2.343TyrThr: 2.343 ± 0.787
1.367TyrVal: 1.367 ± 0.504
0.0TyrTrp: 0.0 ± 0.0
1.172TyrTyr: 1.172 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (5122 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski