Amino acid dipepetide frequency for Xingshan nematode virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.759AlaAla: 2.759 ± 1.16
0.828AlaCys: 0.828 ± 0.834
1.932AlaAsp: 1.932 ± 0.849
2.483AlaGlu: 2.483 ± 0.422
2.759AlaPhe: 2.759 ± 0.8
1.38AlaGly: 1.38 ± 0.713
2.208AlaHis: 2.208 ± 1.185
3.863AlaIle: 3.863 ± 1.313
1.656AlaLys: 1.656 ± 1.257
3.311AlaLeu: 3.311 ± 0.515
0.276AlaMet: 0.276 ± 0.163
0.828AlaAsn: 0.828 ± 0.443
0.276AlaPro: 0.276 ± 0.337
1.38AlaGln: 1.38 ± 0.814
2.483AlaArg: 2.483 ± 1.524
2.759AlaSer: 2.759 ± 1.293
1.932AlaThr: 1.932 ± 0.865
1.656AlaVal: 1.656 ± 0.709
0.276AlaTrp: 0.276 ± 0.163
1.104AlaTyr: 1.104 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.828CysAla: 0.828 ± 0.405
0.276CysCys: 0.276 ± 0.163
0.276CysAsp: 0.276 ± 0.163
1.104CysGlu: 1.104 ± 1.347
0.552CysPhe: 0.552 ± 0.326
1.104CysGly: 1.104 ± 0.408
0.552CysHis: 0.552 ± 0.326
0.828CysIle: 0.828 ± 0.488
1.656CysLys: 1.656 ± 1.015
2.208CysLeu: 2.208 ± 0.76
0.276CysMet: 0.276 ± 0.154
0.552CysAsn: 0.552 ± 0.326
1.104CysPro: 1.104 ± 0.511
0.552CysGln: 0.552 ± 0.397
1.104CysArg: 1.104 ± 0.433
3.863CysSer: 3.863 ± 1.127
0.828CysThr: 0.828 ± 0.385
1.656CysVal: 1.656 ± 0.709
0.552CysTrp: 0.552 ± 0.326
2.208CysTyr: 2.208 ± 0.523
0.0CysXaa: 0.0 ± 0.0
Asp
0.552AspAla: 0.552 ± 0.5
1.932AspCys: 1.932 ± 0.466
2.483AspAsp: 2.483 ± 1.923
2.759AspGlu: 2.759 ± 1.412
3.035AspPhe: 3.035 ± 0.258
1.38AspGly: 1.38 ± 0.785
1.932AspHis: 1.932 ± 0.524
2.208AspIle: 2.208 ± 0.621
1.656AspLys: 1.656 ± 0.709
8.278AspLeu: 8.278 ± 2.443
0.828AspMet: 0.828 ± 0.405
3.311AspAsn: 3.311 ± 1.483
3.311AspPro: 3.311 ± 0.907
1.38AspGln: 1.38 ± 0.573
1.38AspArg: 1.38 ± 0.967
1.104AspSer: 1.104 ± 0.793
2.483AspThr: 2.483 ± 0.523
2.759AspVal: 2.759 ± 1.232
0.552AspTrp: 0.552 ± 0.397
2.483AspTyr: 2.483 ± 1.033
0.0AspXaa: 0.0 ± 0.0
Glu
2.759GluAla: 2.759 ± 0.848
1.104GluCys: 1.104 ± 0.605
2.483GluAsp: 2.483 ± 0.4
3.863GluGlu: 3.863 ± 1.84
3.311GluPhe: 3.311 ± 0.879
2.759GluGly: 2.759 ± 0.668
1.104GluHis: 1.104 ± 0.462
5.519GluIle: 5.519 ± 1.485
5.519GluLys: 5.519 ± 1.721
7.45GluLeu: 7.45 ± 2.105
2.208GluMet: 2.208 ± 1.641
3.311GluAsn: 3.311 ± 1.024
3.035GluPro: 3.035 ± 0.882
1.932GluGln: 1.932 ± 0.827
2.759GluArg: 2.759 ± 0.945
4.139GluSer: 4.139 ± 1.379
3.863GluThr: 3.863 ± 0.593
2.483GluVal: 2.483 ± 0.692
1.656GluTrp: 1.656 ± 0.446
2.208GluTyr: 2.208 ± 0.634
0.0GluXaa: 0.0 ± 0.0
Phe
1.104PheAla: 1.104 ± 0.473
0.828PheCys: 0.828 ± 0.332
1.656PheAsp: 1.656 ± 0.606
3.311PheGlu: 3.311 ± 1.015
2.759PhePhe: 2.759 ± 0.404
1.932PheGly: 1.932 ± 0.958
2.208PheHis: 2.208 ± 0.867
2.759PheIle: 2.759 ± 1.57
2.483PheLys: 2.483 ± 1.562
7.174PheLeu: 7.174 ± 0.886
0.828PheMet: 0.828 ± 0.488
3.311PheAsn: 3.311 ± 0.935
2.759PhePro: 2.759 ± 1.332
3.311PheGln: 3.311 ± 0.917
2.483PheArg: 2.483 ± 1.033
3.035PheSer: 3.035 ± 0.702
4.139PheThr: 4.139 ± 2.087
2.759PheVal: 2.759 ± 0.872
1.656PheTrp: 1.656 ± 0.876
1.932PheTyr: 1.932 ± 0.86
0.0PheXaa: 0.0 ± 0.0
Gly
1.656GlyAla: 1.656 ± 0.507
0.552GlyCys: 0.552 ± 0.326
3.035GlyAsp: 3.035 ± 0.488
3.311GlyGlu: 3.311 ± 1.474
3.035GlyPhe: 3.035 ± 0.922
3.035GlyGly: 3.035 ± 0.725
0.828GlyHis: 0.828 ± 0.381
5.519GlyIle: 5.519 ± 1.578
3.587GlyLys: 3.587 ± 0.702
8.83GlyLeu: 8.83 ± 2.7
1.104GlyMet: 1.104 ± 0.472
2.483GlyAsn: 2.483 ± 0.474
1.104GlyPro: 1.104 ± 0.462
1.38GlyGln: 1.38 ± 0.565
1.38GlyArg: 1.38 ± 0.448
3.863GlySer: 3.863 ± 1.242
3.035GlyThr: 3.035 ± 0.725
1.932GlyVal: 1.932 ± 0.785
0.276GlyTrp: 0.276 ± 0.163
1.104GlyTyr: 1.104 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
1.104HisAla: 1.104 ± 0.317
0.552HisCys: 0.552 ± 0.326
1.932HisAsp: 1.932 ± 0.309
1.38HisGlu: 1.38 ± 0.482
0.828HisPhe: 0.828 ± 0.488
0.828HisGly: 0.828 ± 0.332
1.656HisHis: 1.656 ± 0.709
1.932HisIle: 1.932 ± 0.528
1.38HisLys: 1.38 ± 0.603
4.415HisLeu: 4.415 ± 1.086
0.552HisMet: 0.552 ± 0.733
1.38HisAsn: 1.38 ± 0.424
1.38HisPro: 1.38 ± 0.814
1.656HisGln: 1.656 ± 1.261
2.483HisArg: 2.483 ± 0.41
2.759HisSer: 2.759 ± 0.404
0.828HisThr: 0.828 ± 0.332
1.104HisVal: 1.104 ± 0.584
0.828HisTrp: 0.828 ± 0.616
1.932HisTyr: 1.932 ± 0.752
0.0HisXaa: 0.0 ± 0.0
Ile
2.759IleAla: 2.759 ± 1.52
1.656IleCys: 1.656 ± 0.663
4.415IleAsp: 4.415 ± 1.321
3.863IleGlu: 3.863 ± 0.846
2.208IlePhe: 2.208 ± 0.634
4.691IleGly: 4.691 ± 1.612
2.208IleHis: 2.208 ± 0.68
5.519IleIle: 5.519 ± 1.633
5.795IleLys: 5.795 ± 1.031
6.071IleLeu: 6.071 ± 0.834
0.276IleMet: 0.276 ± 0.163
4.415IleAsn: 4.415 ± 1.206
5.519IlePro: 5.519 ± 1.709
2.483IleGln: 2.483 ± 0.887
3.311IleArg: 3.311 ± 1.275
7.174IleSer: 7.174 ± 1.615
3.863IleThr: 3.863 ± 1.5
4.967IleVal: 4.967 ± 0.727
1.38IleTrp: 1.38 ± 0.565
2.759IleTyr: 2.759 ± 1.195
0.0IleXaa: 0.0 ± 0.0
Lys
1.932LysAla: 1.932 ± 0.68
1.656LysCys: 1.656 ± 0.463
3.311LysAsp: 3.311 ± 1.326
4.691LysGlu: 4.691 ± 1.29
3.035LysPhe: 3.035 ± 0.989
3.035LysGly: 3.035 ± 1.732
1.656LysHis: 1.656 ± 0.863
3.587LysIle: 3.587 ± 0.838
4.415LysLys: 4.415 ± 0.984
7.45LysLeu: 7.45 ± 1.242
1.38LysMet: 1.38 ± 0.448
4.139LysAsn: 4.139 ± 2.121
1.656LysPro: 1.656 ± 0.986
1.656LysGln: 1.656 ± 0.876
3.863LysArg: 3.863 ± 0.534
4.691LysSer: 4.691 ± 0.922
3.311LysThr: 3.311 ± 1.757
2.759LysVal: 2.759 ± 0.881
1.656LysTrp: 1.656 ± 0.977
1.932LysTyr: 1.932 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
5.519LeuAla: 5.519 ± 2.039
3.863LeuCys: 3.863 ± 0.748
4.415LeuAsp: 4.415 ± 0.627
9.106LeuGlu: 9.106 ± 1.112
6.347LeuPhe: 6.347 ± 1.446
8.278LeuGly: 8.278 ± 0.411
3.035LeuHis: 3.035 ± 1.584
9.106LeuIle: 9.106 ± 1.56
5.795LeuLys: 5.795 ± 1.785
12.141LeuLeu: 12.141 ± 1.992
2.208LeuMet: 2.208 ± 0.617
6.071LeuAsn: 6.071 ± 0.91
5.519LeuPro: 5.519 ± 1.546
4.139LeuGln: 4.139 ± 0.72
9.106LeuArg: 9.106 ± 1.403
8.83LeuSer: 8.83 ± 0.572
6.071LeuThr: 6.071 ± 0.728
4.139LeuVal: 4.139 ± 0.705
0.552LeuTrp: 0.552 ± 0.397
3.311LeuTyr: 3.311 ± 1.196
0.0LeuXaa: 0.0 ± 0.0
Met
0.828MetAla: 0.828 ± 0.589
1.104MetCys: 1.104 ± 0.651
1.38MetAsp: 1.38 ± 0.494
1.932MetGlu: 1.932 ± 0.797
1.38MetPhe: 1.38 ± 0.989
0.828MetGly: 0.828 ± 0.488
0.0MetHis: 0.0 ± 0.0
1.932MetIle: 1.932 ± 0.517
0.828MetLys: 0.828 ± 0.385
1.932MetLeu: 1.932 ± 0.521
0.276MetMet: 0.276 ± 0.337
0.828MetAsn: 0.828 ± 0.616
0.0MetPro: 0.0 ± 0.0
0.552MetGln: 0.552 ± 0.397
0.552MetArg: 0.552 ± 0.397
1.38MetSer: 1.38 ± 0.58
1.104MetThr: 1.104 ± 0.497
0.828MetVal: 0.828 ± 0.405
0.0MetTrp: 0.0 ± 0.0
0.828MetTyr: 0.828 ± 0.405
0.0MetXaa: 0.0 ± 0.0
Asn
0.828AsnAla: 0.828 ± 0.547
1.104AsnCys: 1.104 ± 0.462
2.208AsnAsp: 2.208 ± 0.63
1.656AsnGlu: 1.656 ± 0.474
3.311AsnPhe: 3.311 ± 1.924
2.208AsnGly: 2.208 ± 0.751
1.932AsnHis: 1.932 ± 0.621
2.483AsnIle: 2.483 ± 1.172
2.208AsnLys: 2.208 ± 0.483
7.726AsnLeu: 7.726 ± 1.642
1.104AsnMet: 1.104 ± 0.476
1.656AsnAsn: 1.656 ± 0.709
4.967AsnPro: 4.967 ± 1.784
0.828AsnGln: 0.828 ± 0.589
3.035AsnArg: 3.035 ± 1.091
4.967AsnSer: 4.967 ± 1.617
3.035AsnThr: 3.035 ± 0.731
1.38AsnVal: 1.38 ± 0.733
1.38AsnTrp: 1.38 ± 1.008
3.035AsnTyr: 3.035 ± 0.896
0.0AsnXaa: 0.0 ± 0.0
Pro
2.759ProAla: 2.759 ± 0.738
0.276ProCys: 0.276 ± 0.55
1.656ProAsp: 1.656 ± 0.404
3.035ProGlu: 3.035 ± 1.572
2.759ProPhe: 2.759 ± 0.807
1.104ProGly: 1.104 ± 0.511
1.38ProHis: 1.38 ± 0.58
3.035ProIle: 3.035 ± 1.314
1.932ProLys: 1.932 ± 1.201
6.071ProLeu: 6.071 ± 2.059
0.276ProMet: 0.276 ± 0.163
2.208ProAsn: 2.208 ± 0.86
2.759ProPro: 2.759 ± 0.695
1.656ProGln: 1.656 ± 1.098
1.932ProArg: 1.932 ± 0.517
6.347ProSer: 6.347 ± 1.134
2.759ProThr: 2.759 ± 0.809
2.483ProVal: 2.483 ± 1.5
0.552ProTrp: 0.552 ± 0.326
1.932ProTyr: 1.932 ± 1.333
0.0ProXaa: 0.0 ± 0.0
Gln
1.104GlnAla: 1.104 ± 0.631
0.552GlnCys: 0.552 ± 0.326
1.932GlnAsp: 1.932 ± 0.621
3.035GlnGlu: 3.035 ± 0.882
1.656GlnPhe: 1.656 ± 0.709
1.932GlnGly: 1.932 ± 1.14
0.552GlnHis: 0.552 ± 0.326
2.208GlnIle: 2.208 ± 0.339
3.311GlnLys: 3.311 ± 1.275
3.311GlnLeu: 3.311 ± 1.723
0.828GlnMet: 0.828 ± 0.438
1.38GlnAsn: 1.38 ± 0.325
1.104GlnPro: 1.104 ± 0.472
0.828GlnGln: 0.828 ± 0.99
1.932GlnArg: 1.932 ± 0.901
3.311GlnSer: 3.311 ± 1.176
1.656GlnThr: 1.656 ± 1.161
1.38GlnVal: 1.38 ± 0.603
0.828GlnTrp: 0.828 ± 0.488
1.656GlnTyr: 1.656 ± 1.325
0.0GlnXaa: 0.0 ± 0.0
Arg
1.932ArgAla: 1.932 ± 0.849
0.552ArgCys: 0.552 ± 0.292
3.863ArgAsp: 3.863 ± 1.048
4.415ArgGlu: 4.415 ± 1.153
2.759ArgPhe: 2.759 ± 0.613
2.208ArgGly: 2.208 ± 0.662
1.932ArgHis: 1.932 ± 0.849
4.139ArgIle: 4.139 ± 1.184
2.759ArgLys: 2.759 ± 1.746
6.347ArgLeu: 6.347 ± 0.918
0.828ArgMet: 0.828 ± 0.381
1.932ArgAsn: 1.932 ± 0.772
1.38ArgPro: 1.38 ± 0.814
1.656ArgGln: 1.656 ± 0.709
1.932ArgArg: 1.932 ± 1.388
4.415ArgSer: 4.415 ± 0.635
2.483ArgThr: 2.483 ± 0.795
2.759ArgVal: 2.759 ± 0.42
0.552ArgTrp: 0.552 ± 0.326
1.104ArgTyr: 1.104 ± 0.651
0.0ArgXaa: 0.0 ± 0.0
Ser
3.311SerAla: 3.311 ± 1.397
2.208SerCys: 2.208 ± 0.634
3.311SerAsp: 3.311 ± 1.954
4.967SerGlu: 4.967 ± 1.005
4.139SerPhe: 4.139 ± 1.217
3.587SerGly: 3.587 ± 1.15
2.483SerHis: 2.483 ± 0.904
4.691SerIle: 4.691 ± 1.055
7.174SerLys: 7.174 ± 1.367
9.382SerLeu: 9.382 ± 3.04
1.38SerMet: 1.38 ± 0.325
3.035SerAsn: 3.035 ± 0.789
5.243SerPro: 5.243 ± 2.177
1.932SerGln: 1.932 ± 0.579
4.691SerArg: 4.691 ± 0.771
6.623SerSer: 6.623 ± 1.522
4.415SerThr: 4.415 ± 1.281
4.967SerVal: 4.967 ± 0.674
1.932SerTrp: 1.932 ± 0.86
3.587SerTyr: 3.587 ± 0.537
0.0SerXaa: 0.0 ± 0.0
Thr
1.38ThrAla: 1.38 ± 0.603
1.104ThrCys: 1.104 ± 0.651
3.587ThrAsp: 3.587 ± 2.107
3.587ThrGlu: 3.587 ± 0.702
2.483ThrPhe: 2.483 ± 0.758
4.415ThrGly: 4.415 ± 1.139
1.656ThrHis: 1.656 ± 0.507
6.347ThrIle: 6.347 ± 2.099
3.035ThrLys: 3.035 ± 1.769
4.967ThrLeu: 4.967 ± 1.667
1.932ThrMet: 1.932 ± 0.785
2.759ThrAsn: 2.759 ± 1.404
1.656ThrPro: 1.656 ± 0.797
1.932ThrGln: 1.932 ± 0.74
1.104ThrArg: 1.104 ± 0.643
6.071ThrSer: 6.071 ± 1.09
4.967ThrThr: 4.967 ± 0.375
2.483ThrVal: 2.483 ± 1.935
0.828ThrTrp: 0.828 ± 0.332
0.828ThrTyr: 0.828 ± 0.743
0.0ThrXaa: 0.0 ± 0.0
Val
1.932ValAla: 1.932 ± 0.34
0.828ValCys: 0.828 ± 0.381
0.828ValAsp: 0.828 ± 0.443
3.035ValGlu: 3.035 ± 0.789
2.483ValPhe: 2.483 ± 0.739
2.759ValGly: 2.759 ± 0.588
1.656ValHis: 1.656 ± 1.092
3.863ValIle: 3.863 ± 0.869
3.311ValLys: 3.311 ± 1.032
3.587ValLeu: 3.587 ± 0.544
0.828ValMet: 0.828 ± 0.385
3.587ValAsn: 3.587 ± 0.903
1.656ValPro: 1.656 ± 0.766
2.483ValGln: 2.483 ± 2.1
2.759ValArg: 2.759 ± 0.83
1.932ValSer: 1.932 ± 0.309
3.587ValThr: 3.587 ± 0.281
2.759ValVal: 2.759 ± 1.427
0.276ValTrp: 0.276 ± 0.163
2.483ValTyr: 2.483 ± 0.523
0.0ValXaa: 0.0 ± 0.0
Trp
0.276TrpAla: 0.276 ± 0.163
0.552TrpCys: 0.552 ± 0.397
0.552TrpAsp: 0.552 ± 0.326
1.104TrpGlu: 1.104 ± 0.651
1.104TrpPhe: 1.104 ± 0.433
1.38TrpGly: 1.38 ± 0.565
1.104TrpHis: 1.104 ± 0.584
3.035TrpIle: 3.035 ± 0.955
1.104TrpLys: 1.104 ± 0.631
1.104TrpLeu: 1.104 ± 0.473
0.276TrpMet: 0.276 ± 0.511
1.656TrpAsn: 1.656 ± 0.977
0.276TrpPro: 0.276 ± 0.163
0.276TrpGln: 0.276 ± 0.163
0.0TrpArg: 0.0 ± 0.0
1.38TrpSer: 1.38 ± 0.565
0.552TrpThr: 0.552 ± 0.42
0.276TrpVal: 0.276 ± 0.163
0.276TrpTrp: 0.276 ± 0.163
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.828TyrAla: 0.828 ± 0.405
0.552TyrCys: 0.552 ± 0.326
0.552TyrAsp: 0.552 ± 0.5
0.828TyrGlu: 0.828 ± 0.405
2.483TyrPhe: 2.483 ± 1.203
2.208TyrGly: 2.208 ± 0.996
0.828TyrHis: 0.828 ± 0.381
2.759TyrIle: 2.759 ± 0.613
2.208TyrLys: 2.208 ± 0.476
5.795TyrLeu: 5.795 ± 0.613
0.552TyrMet: 0.552 ± 0.397
2.208TyrAsn: 2.208 ± 0.929
2.208TyrPro: 2.208 ± 0.893
2.759TyrGln: 2.759 ± 0.773
1.656TyrArg: 1.656 ± 0.446
4.415TyrSer: 4.415 ± 0.58
2.208TyrThr: 2.208 ± 0.819
1.104TyrVal: 1.104 ± 0.793
0.276TyrTrp: 0.276 ± 0.337
3.035TyrTyr: 3.035 ± 1.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3625 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski