Amino acid dipepetide frequency for Soybean leaf-associated negative-stranded RNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.577AlaAla: 4.577 ± 1.669
0.381AlaCys: 0.381 ± 0.643
2.67AlaAsp: 2.67 ± 1.657
5.339AlaGlu: 5.339 ± 1.743
2.67AlaPhe: 2.67 ± 1.645
4.958AlaGly: 4.958 ± 0.798
1.907AlaHis: 1.907 ± 0.946
4.958AlaIle: 4.958 ± 0.425
4.195AlaLys: 4.195 ± 2.286
6.102AlaLeu: 6.102 ± 0.613
1.526AlaMet: 1.526 ± 0.554
4.195AlaAsn: 4.195 ± 1.026
0.763AlaPro: 0.763 ± 0.378
1.144AlaGln: 1.144 ± 0.437
4.958AlaArg: 4.958 ± 0.425
6.484AlaSer: 6.484 ± 1.277
1.907AlaThr: 1.907 ± 1.499
3.432AlaVal: 3.432 ± 1.963
0.381AlaTrp: 0.381 ± 0.189
1.526AlaTyr: 1.526 ± 1.033
0.0AlaXaa: 0.0 ± 0.0
Cys
0.763CysAla: 0.763 ± 0.378
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.763CysGlu: 0.763 ± 0.378
0.381CysPhe: 0.381 ± 0.189
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.381CysLys: 0.381 ± 0.189
1.907CysLeu: 1.907 ± 0.504
0.381CysMet: 0.381 ± 0.189
0.381CysAsn: 0.381 ± 0.189
1.526CysPro: 1.526 ± 0.757
0.0CysGln: 0.0 ± 0.0
0.763CysArg: 0.763 ± 0.378
1.144CysSer: 1.144 ± 0.437
0.763CysThr: 0.763 ± 0.378
1.907CysVal: 1.907 ± 0.545
0.381CysTrp: 0.381 ± 0.189
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.195AspAla: 4.195 ± 3.712
0.763AspCys: 0.763 ± 0.701
2.288AspAsp: 2.288 ± 0.365
3.051AspGlu: 3.051 ± 2.002
2.288AspPhe: 2.288 ± 0.652
2.288AspGly: 2.288 ± 0.652
0.763AspHis: 0.763 ± 0.378
6.484AspIle: 6.484 ± 1.271
2.67AspLys: 2.67 ± 1.645
8.391AspLeu: 8.391 ± 1.66
1.144AspMet: 1.144 ± 0.437
3.814AspAsn: 3.814 ± 1.278
2.288AspPro: 2.288 ± 1.135
1.144AspGln: 1.144 ± 1.151
3.051AspArg: 3.051 ± 2.282
3.432AspSer: 3.432 ± 1.062
3.432AspThr: 3.432 ± 2.207
8.772AspVal: 8.772 ± 0.413
0.763AspTrp: 0.763 ± 0.378
1.144AspTyr: 1.144 ± 0.567
0.0AspXaa: 0.0 ± 0.0
Glu
3.814GluAla: 3.814 ± 0.793
1.144GluCys: 1.144 ± 0.567
4.195GluAsp: 4.195 ± 2.831
3.432GluGlu: 3.432 ± 1.274
3.814GluPhe: 3.814 ± 1.278
3.432GluGly: 3.432 ± 1.31
1.144GluHis: 1.144 ± 0.437
5.721GluIle: 5.721 ± 0.748
4.958GluLys: 4.958 ± 0.425
9.153GluLeu: 9.153 ± 1.616
0.381GluMet: 0.381 ± 0.189
1.907GluAsn: 1.907 ± 2.221
1.526GluPro: 1.526 ± 0.561
2.288GluGln: 2.288 ± 0.652
6.102GluArg: 6.102 ± 1.025
5.339GluSer: 5.339 ± 0.79
2.67GluThr: 2.67 ± 0.848
6.102GluVal: 6.102 ± 2.002
0.0GluTrp: 0.0 ± 0.0
1.144GluTyr: 1.144 ± 0.916
0.0GluXaa: 0.0 ± 0.0
Phe
4.577PheAla: 4.577 ± 1.296
0.381PheCys: 0.381 ± 0.189
2.288PheAsp: 2.288 ± 1.135
2.67PheGlu: 2.67 ± 0.777
1.907PhePhe: 1.907 ± 0.946
2.67PheGly: 2.67 ± 0.777
0.0PheHis: 0.0 ± 0.0
1.526PheIle: 1.526 ± 0.432
2.288PheLys: 2.288 ± 1.135
2.67PheLeu: 2.67 ± 0.202
1.144PheMet: 1.144 ± 0.567
1.907PheAsn: 1.907 ± 1.499
2.288PhePro: 2.288 ± 2.302
1.526PheGln: 1.526 ± 0.432
1.144PheArg: 1.144 ± 0.437
5.339PheSer: 5.339 ± 0.984
1.144PheThr: 1.144 ± 0.567
1.144PheVal: 1.144 ± 0.567
0.381PheTrp: 0.381 ± 0.189
1.907PheTyr: 1.907 ± 0.937
0.0PheXaa: 0.0 ± 0.0
Gly
3.051GlyAla: 3.051 ± 2.065
0.763GlyCys: 0.763 ± 0.378
2.67GlyAsp: 2.67 ± 0.777
2.67GlyGlu: 2.67 ± 1.324
1.526GlyPhe: 1.526 ± 0.757
2.67GlyGly: 2.67 ± 0.777
0.763GlyHis: 0.763 ± 0.378
2.67GlyIle: 2.67 ± 0.202
2.288GlyLys: 2.288 ± 0.365
3.051GlyLeu: 3.051 ± 0.141
1.526GlyMet: 1.526 ± 0.561
2.288GlyAsn: 2.288 ± 0.627
1.907GlyPro: 1.907 ± 0.504
1.144GlyGln: 1.144 ± 0.916
1.144GlyArg: 1.144 ± 0.567
4.958GlySer: 4.958 ± 0.425
2.67GlyThr: 2.67 ± 1.324
4.958GlyVal: 4.958 ± 2.305
0.0GlyTrp: 0.0 ± 0.0
2.67GlyTyr: 2.67 ± 1.324
0.0GlyXaa: 0.0 ± 0.0
His
1.144HisAla: 1.144 ± 0.437
0.0HisCys: 0.0 ± 0.0
0.763HisAsp: 0.763 ± 0.701
0.763HisGlu: 0.763 ± 0.516
0.381HisPhe: 0.381 ± 0.189
1.144HisGly: 1.144 ± 0.567
0.381HisHis: 0.381 ± 0.189
1.144HisIle: 1.144 ± 0.567
1.144HisLys: 1.144 ± 0.606
4.195HisLeu: 4.195 ± 0.721
0.0HisMet: 0.0 ± 0.0
0.381HisAsn: 0.381 ± 0.189
1.907HisPro: 1.907 ± 0.504
0.0HisGln: 0.0 ± 0.0
2.67HisArg: 2.67 ± 1.449
1.907HisSer: 1.907 ± 0.578
1.144HisThr: 1.144 ± 0.606
3.051HisVal: 3.051 ± 0.141
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.814IleAla: 3.814 ± 0.385
0.381IleCys: 0.381 ± 0.189
3.814IleAsp: 3.814 ± 2.997
4.195IleGlu: 4.195 ± 0.907
3.814IlePhe: 3.814 ± 1.875
2.288IleGly: 2.288 ± 0.627
0.0IleHis: 0.0 ± 0.0
2.67IleIle: 2.67 ± 0.202
5.721IleLys: 5.721 ± 3.888
5.339IleLeu: 5.339 ± 0.578
1.526IleMet: 1.526 ± 0.432
1.144IleAsn: 1.144 ± 0.567
4.577IlePro: 4.577 ± 1.253
3.051IleGln: 3.051 ± 0.141
5.339IleArg: 5.339 ± 1.183
4.958IleSer: 4.958 ± 1.398
1.907IleThr: 1.907 ± 0.545
4.958IleVal: 4.958 ± 0.719
0.381IleTrp: 0.381 ± 0.189
2.67IleTyr: 2.67 ± 1.324
0.0IleXaa: 0.0 ± 0.0
Lys
2.67LysAla: 2.67 ± 1.177
0.381LysCys: 0.381 ± 0.189
3.814LysAsp: 3.814 ± 0.793
7.628LysGlu: 7.628 ± 2.805
3.432LysPhe: 3.432 ± 0.529
2.67LysGly: 2.67 ± 0.777
2.288LysHis: 2.288 ± 0.365
3.051LysIle: 3.051 ± 0.141
3.432LysLys: 3.432 ± 1.858
5.339LysLeu: 5.339 ± 1.377
0.381LysMet: 0.381 ± 0.304
2.288LysAsn: 2.288 ± 0.652
3.432LysPro: 3.432 ± 1.702
2.288LysGln: 2.288 ± 1.832
4.958LysArg: 4.958 ± 0.425
3.814LysSer: 3.814 ± 0.343
4.958LysThr: 4.958 ± 1.397
3.432LysVal: 3.432 ± 0.529
1.144LysTrp: 1.144 ± 0.567
0.763LysTyr: 0.763 ± 0.378
0.0LysXaa: 0.0 ± 0.0
Leu
7.628LeuAla: 7.628 ± 2.179
0.763LeuCys: 0.763 ± 0.378
5.721LeuAsp: 5.721 ± 0.748
6.865LeuGlu: 6.865 ± 1.689
3.051LeuPhe: 3.051 ± 1.513
3.814LeuGly: 3.814 ± 1.089
1.526LeuHis: 1.526 ± 0.561
7.628LeuIle: 7.628 ± 1.198
7.246LeuLys: 7.246 ± 1.492
9.153LeuLeu: 9.153 ± 2.422
0.381LeuMet: 0.381 ± 0.189
3.814LeuAsn: 3.814 ± 1.17
4.577LeuPro: 4.577 ± 0.729
3.432LeuGln: 3.432 ± 1.817
6.102LeuArg: 6.102 ± 1.559
9.916LeuSer: 9.916 ± 2.786
7.246LeuThr: 7.246 ± 1.052
6.102LeuVal: 6.102 ± 0.613
1.526LeuTrp: 1.526 ± 0.757
4.958LeuTyr: 4.958 ± 1.345
0.0LeuXaa: 0.0 ± 0.0
Met
1.144MetAla: 1.144 ± 0.567
0.763MetCys: 0.763 ± 0.516
1.144MetAsp: 1.144 ± 0.606
1.907MetGlu: 1.907 ± 0.946
1.526MetPhe: 1.526 ± 0.432
0.381MetGly: 0.381 ± 0.189
0.763MetHis: 0.763 ± 0.378
0.763MetIle: 0.763 ± 0.378
0.381MetLys: 0.381 ± 0.189
1.144MetLeu: 1.144 ± 0.606
1.907MetMet: 1.907 ± 1.235
0.381MetAsn: 0.381 ± 0.189
1.144MetPro: 1.144 ± 0.567
1.526MetGln: 1.526 ± 0.729
1.526MetArg: 1.526 ± 0.757
1.526MetSer: 1.526 ± 0.757
1.144MetThr: 1.144 ± 0.606
1.526MetVal: 1.526 ± 2.183
0.381MetTrp: 0.381 ± 0.189
0.381MetTyr: 0.381 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
2.288AsnAla: 2.288 ± 1.829
0.763AsnCys: 0.763 ± 0.378
3.432AsnAsp: 3.432 ± 1.963
3.051AsnGlu: 3.051 ± 2.383
0.381AsnPhe: 0.381 ± 0.189
1.526AsnGly: 1.526 ± 0.757
1.907AsnHis: 1.907 ± 0.946
1.526AsnIle: 1.526 ± 1.668
1.526AsnLys: 1.526 ± 0.432
4.958AsnLeu: 4.958 ± 1.171
1.144AsnMet: 1.144 ± 0.606
1.144AsnAsn: 1.144 ± 0.606
1.526AsnPro: 1.526 ± 0.757
1.526AsnGln: 1.526 ± 0.561
2.288AsnArg: 2.288 ± 0.627
5.339AsnSer: 5.339 ± 0.732
1.526AsnThr: 1.526 ± 0.729
1.526AsnVal: 1.526 ± 0.561
0.763AsnTrp: 0.763 ± 0.516
1.907AsnTyr: 1.907 ± 0.504
0.0AsnXaa: 0.0 ± 0.0
Pro
1.907ProAla: 1.907 ± 0.946
0.0ProCys: 0.0 ± 0.0
4.195ProAsp: 4.195 ± 1.026
6.102ProGlu: 6.102 ± 1.025
1.907ProPhe: 1.907 ± 0.946
3.051ProGly: 3.051 ± 0.941
1.144ProHis: 1.144 ± 0.916
3.051ProIle: 3.051 ± 1.368
3.051ProLys: 3.051 ± 0.141
2.67ProLeu: 2.67 ± 1.324
1.144ProMet: 1.144 ± 0.567
2.67ProAsn: 2.67 ± 0.848
3.051ProPro: 3.051 ± 1.513
1.526ProGln: 1.526 ± 0.729
2.288ProArg: 2.288 ± 1.135
3.051ProSer: 3.051 ± 0.864
2.288ProThr: 2.288 ± 0.652
2.288ProVal: 2.288 ± 0.627
1.144ProTrp: 1.144 ± 0.567
2.288ProTyr: 2.288 ± 1.135
0.0ProXaa: 0.0 ± 0.0
Gln
3.051GlnAla: 3.051 ± 0.906
0.0GlnCys: 0.0 ± 0.0
1.526GlnAsp: 1.526 ± 0.432
2.67GlnGlu: 2.67 ± 1.645
0.0GlnPhe: 0.0 ± 0.0
1.526GlnGly: 1.526 ± 0.729
1.144GlnHis: 1.144 ± 0.916
1.907GlnIle: 1.907 ± 0.504
2.67GlnLys: 2.67 ± 1.152
3.432GlnLeu: 3.432 ± 0.529
1.144GlnMet: 1.144 ± 0.567
0.0GlnAsn: 0.0 ± 0.0
2.288GlnPro: 2.288 ± 0.365
1.526GlnGln: 1.526 ± 1.401
0.763GlnArg: 0.763 ± 0.701
1.526GlnSer: 1.526 ± 2.635
1.907GlnThr: 1.907 ± 0.946
2.288GlnVal: 2.288 ± 1.212
0.0GlnTrp: 0.0 ± 0.0
0.763GlnTyr: 0.763 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
3.051ArgAla: 3.051 ± 1.368
1.526ArgCys: 1.526 ± 0.432
3.814ArgAsp: 3.814 ± 1.724
4.195ArgGlu: 4.195 ± 1.47
3.051ArgPhe: 3.051 ± 0.864
2.288ArgGly: 2.288 ± 0.627
3.051ArgHis: 3.051 ± 0.906
6.102ArgIle: 6.102 ± 1.791
2.288ArgLys: 2.288 ± 1.135
8.391ArgLeu: 8.391 ± 1.318
1.144ArgMet: 1.144 ± 0.552
3.051ArgAsn: 3.051 ± 1.459
2.288ArgPro: 2.288 ± 0.365
0.381ArgGln: 0.381 ± 0.189
6.102ArgArg: 6.102 ± 1.622
4.577ArgSer: 4.577 ± 1.558
3.814ArgThr: 3.814 ± 1.724
3.814ArgVal: 3.814 ± 1.008
0.763ArgTrp: 0.763 ± 0.378
3.432ArgTyr: 3.432 ± 1.702
0.0ArgXaa: 0.0 ± 0.0
Ser
7.628SerAla: 7.628 ± 3.133
1.907SerCys: 1.907 ± 0.946
6.484SerAsp: 6.484 ± 0.432
4.195SerGlu: 4.195 ± 2.081
3.051SerPhe: 3.051 ± 0.941
3.051SerGly: 3.051 ± 0.141
2.67SerHis: 2.67 ± 1.177
3.432SerIle: 3.432 ± 1.062
6.865SerLys: 6.865 ± 0.921
6.865SerLeu: 6.865 ± 0.921
1.907SerMet: 1.907 ± 2.019
3.432SerAsn: 3.432 ± 0.529
3.814SerPro: 3.814 ± 1.008
2.67SerGln: 2.67 ± 1.152
8.391SerArg: 8.391 ± 3.213
7.246SerSer: 7.246 ± 1.492
4.577SerThr: 4.577 ± 1.683
3.814SerVal: 3.814 ± 1.29
0.763SerTrp: 0.763 ± 0.516
3.051SerTyr: 3.051 ± 1.513
0.0SerXaa: 0.0 ± 0.0
Thr
1.907ThrAla: 1.907 ± 0.504
0.381ThrCys: 0.381 ± 0.189
3.432ThrAsp: 3.432 ± 0.9
2.67ThrGlu: 2.67 ± 1.645
1.526ThrPhe: 1.526 ± 0.757
0.763ThrGly: 0.763 ± 0.378
0.381ThrHis: 0.381 ± 0.189
6.865ThrIle: 6.865 ± 2.752
3.814ThrLys: 3.814 ± 0.44
4.577ThrLeu: 4.577 ± 0.9
1.526ThrMet: 1.526 ± 0.561
1.907ThrAsn: 1.907 ± 1.296
1.907ThrPro: 1.907 ± 0.578
1.907ThrGln: 1.907 ± 0.578
3.432ThrArg: 3.432 ± 1.062
2.288ThrSer: 2.288 ± 1.334
2.288ThrThr: 2.288 ± 0.652
5.721ThrVal: 5.721 ± 1.276
1.144ThrTrp: 1.144 ± 0.567
2.67ThrTyr: 2.67 ± 0.777
0.0ThrXaa: 0.0 ± 0.0
Val
4.577ValAla: 4.577 ± 1.253
0.381ValCys: 0.381 ± 0.189
4.577ValAsp: 4.577 ± 0.729
4.577ValGlu: 4.577 ± 1.652
3.051ValPhe: 3.051 ± 1.488
4.195ValGly: 4.195 ± 1.121
1.144ValHis: 1.144 ± 0.437
2.288ValIle: 2.288 ± 0.874
4.958ValLys: 4.958 ± 2.3
9.153ValLeu: 9.153 ± 0.424
1.526ValMet: 1.526 ± 0.757
3.051ValAsn: 3.051 ± 2.065
4.195ValPro: 4.195 ± 0.296
1.526ValGln: 1.526 ± 0.561
3.432ValArg: 3.432 ± 1.274
8.391ValSer: 8.391 ± 2.408
3.051ValThr: 3.051 ± 1.122
3.814ValVal: 3.814 ± 0.385
1.144ValTrp: 1.144 ± 0.567
1.907ValTyr: 1.907 ± 0.578
0.0ValXaa: 0.0 ± 0.0
Trp
0.381TrpAla: 0.381 ± 0.643
0.381TrpCys: 0.381 ± 0.189
1.144TrpAsp: 1.144 ± 0.567
0.0TrpGlu: 0.0 ± 0.0
0.381TrpPhe: 0.381 ± 0.643
0.763TrpGly: 0.763 ± 0.378
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.526TrpLys: 1.526 ± 0.757
0.381TrpLeu: 0.381 ± 0.189
0.0TrpMet: 0.0 ± 0.0
1.526TrpAsn: 1.526 ± 0.757
1.144TrpPro: 1.144 ± 0.567
0.381TrpGln: 0.381 ± 0.189
0.763TrpArg: 0.763 ± 0.378
0.763TrpSer: 0.763 ± 0.378
0.763TrpThr: 0.763 ± 0.378
0.763TrpVal: 0.763 ± 0.378
0.381TrpTrp: 0.381 ± 0.189
0.381TrpTyr: 0.381 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.907TyrAla: 1.907 ± 0.937
0.381TyrCys: 0.381 ± 0.189
4.195TyrAsp: 4.195 ± 0.622
1.526TyrGlu: 1.526 ± 0.757
0.763TyrPhe: 0.763 ± 0.378
1.907TyrGly: 1.907 ± 0.504
1.144TyrHis: 1.144 ± 1.151
0.763TyrIle: 0.763 ± 0.378
0.763TyrLys: 0.763 ± 0.378
4.577TyrLeu: 4.577 ± 1.652
1.144TyrMet: 1.144 ± 0.567
0.763TyrAsn: 0.763 ± 0.378
2.67TyrPro: 2.67 ± 1.324
1.144TyrGln: 1.144 ± 0.567
2.288TyrArg: 2.288 ± 0.627
3.814TyrSer: 3.814 ± 1.226
1.907TyrThr: 1.907 ± 0.504
1.526TyrVal: 1.526 ± 0.432
0.381TyrTrp: 0.381 ± 0.189
0.763TyrTyr: 0.763 ± 0.378
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2623 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski