Amino acid dipepetide frequency for Soybean vein necrosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.357AlaAla: 1.357 ± 1.699
1.551AlaCys: 1.551 ± 0.709
1.939AlaAsp: 1.939 ± 0.585
2.908AlaGlu: 2.908 ± 1.037
1.551AlaPhe: 1.551 ± 0.709
2.326AlaGly: 2.326 ± 0.261
0.388AlaHis: 0.388 ± 0.174
3.296AlaIle: 3.296 ± 1.301
5.816AlaLys: 5.816 ± 2.109
3.102AlaLeu: 3.102 ± 0.281
1.357AlaMet: 1.357 ± 0.268
1.745AlaAsn: 1.745 ± 0.385
2.133AlaPro: 2.133 ± 1.164
0.582AlaGln: 0.582 ± 0.333
0.969AlaArg: 0.969 ± 0.429
5.041AlaSer: 5.041 ± 0.969
2.326AlaThr: 2.326 ± 0.362
1.745AlaVal: 1.745 ± 0.466
0.194AlaTrp: 0.194 ± 0.284
0.582AlaTyr: 0.582 ± 0.74
0.0AlaXaa: 0.0 ± 0.0
Cys
0.969CysAla: 0.969 ± 0.622
0.194CysCys: 0.194 ± 0.235
0.582CysAsp: 0.582 ± 0.178
1.551CysGlu: 1.551 ± 0.439
1.939CysPhe: 1.939 ± 0.587
1.163CysGly: 1.163 ± 1.202
0.388CysHis: 0.388 ± 0.174
1.745CysIle: 1.745 ± 0.359
1.939CysLys: 1.939 ± 0.712
3.102CysLeu: 3.102 ± 1.009
0.388CysMet: 0.388 ± 0.23
1.551CysAsn: 1.551 ± 1.086
1.551CysPro: 1.551 ± 0.54
0.582CysGln: 0.582 ± 0.178
1.357CysArg: 1.357 ± 0.739
1.551CysSer: 1.551 ± 0.426
1.163CysThr: 1.163 ± 0.356
0.969CysVal: 0.969 ± 0.622
0.0CysTrp: 0.0 ± 0.0
1.163CysTyr: 1.163 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
1.745AspAla: 1.745 ± 0.637
0.194AspCys: 0.194 ± 0.359
4.653AspAsp: 4.653 ± 1.871
4.459AspGlu: 4.459 ± 0.764
3.102AspPhe: 3.102 ± 0.937
3.296AspGly: 3.296 ± 0.836
1.163AspHis: 1.163 ± 0.508
3.49AspIle: 3.49 ± 1.005
2.908AspLys: 2.908 ± 1.282
9.5AspLeu: 9.5 ± 0.602
1.357AspMet: 1.357 ± 0.548
1.939AspAsn: 1.939 ± 0.253
1.357AspPro: 1.357 ± 0.258
1.939AspGln: 1.939 ± 0.479
3.296AspArg: 3.296 ± 1.43
6.398AspSer: 6.398 ± 1.704
3.49AspThr: 3.49 ± 0.472
3.102AspVal: 3.102 ± 0.666
0.582AspTrp: 0.582 ± 0.345
1.939AspTyr: 1.939 ± 0.585
0.0AspXaa: 0.0 ± 0.0
Glu
3.102GluAla: 3.102 ± 0.668
0.969GluCys: 0.969 ± 0.337
4.459GluAsp: 4.459 ± 0.9
5.622GluGlu: 5.622 ± 1.029
4.459GluPhe: 4.459 ± 0.988
4.265GluGly: 4.265 ± 0.572
1.357GluHis: 1.357 ± 0.571
6.01GluIle: 6.01 ± 1.01
5.041GluLys: 5.041 ± 1.387
6.592GluLeu: 6.592 ± 0.857
1.939GluMet: 1.939 ± 0.696
3.296GluAsn: 3.296 ± 0.647
1.357GluPro: 1.357 ± 0.464
1.745GluGln: 1.745 ± 0.803
1.357GluArg: 1.357 ± 0.641
6.979GluSer: 6.979 ± 1.349
4.653GluThr: 4.653 ± 1.308
3.49GluVal: 3.49 ± 0.558
0.194GluTrp: 0.194 ± 0.115
2.908GluTyr: 2.908 ± 0.915
0.0GluXaa: 0.0 ± 0.0
Phe
1.551PheAla: 1.551 ± 0.962
1.357PheCys: 1.357 ± 0.675
3.877PheAsp: 3.877 ± 1.313
3.49PheGlu: 3.49 ± 0.764
2.133PhePhe: 2.133 ± 0.809
1.939PheGly: 1.939 ± 0.81
1.357PheHis: 1.357 ± 0.445
1.939PheIle: 1.939 ± 0.447
4.847PheLys: 4.847 ± 0.715
4.459PheLeu: 4.459 ± 0.665
0.969PheMet: 0.969 ± 0.429
4.265PheAsn: 4.265 ± 0.45
1.745PhePro: 1.745 ± 0.211
1.939PheGln: 1.939 ± 0.472
1.551PheArg: 1.551 ± 0.347
4.265PheSer: 4.265 ± 0.518
2.908PheThr: 2.908 ± 0.513
3.296PheVal: 3.296 ± 1.239
0.0PheTrp: 0.0 ± 0.0
2.326PheTyr: 2.326 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
1.551GlyAla: 1.551 ± 0.426
2.133GlyCys: 2.133 ± 1.114
3.49GlyAsp: 3.49 ± 0.561
2.908GlyGlu: 2.908 ± 0.529
2.908GlyPhe: 2.908 ± 0.487
1.357GlyGly: 1.357 ± 0.578
0.775GlyHis: 0.775 ± 0.204
3.877GlyIle: 3.877 ± 0.911
6.01GlyLys: 6.01 ± 1.573
4.847GlyLeu: 4.847 ± 1.198
2.133GlyMet: 2.133 ± 0.496
3.49GlyAsn: 3.49 ± 0.783
0.582GlyPro: 0.582 ± 0.352
0.969GlyGln: 0.969 ± 0.46
1.357GlyArg: 1.357 ± 0.598
4.071GlySer: 4.071 ± 1.682
2.52GlyThr: 2.52 ± 1.211
2.908GlyVal: 2.908 ± 0.479
0.582GlyTrp: 0.582 ± 0.536
2.714GlyTyr: 2.714 ± 0.578
0.0GlyXaa: 0.0 ± 0.0
His
1.163HisAla: 1.163 ± 0.439
0.194HisCys: 0.194 ± 0.235
0.582HisAsp: 0.582 ± 0.292
0.969HisGlu: 0.969 ± 0.622
1.551HisPhe: 1.551 ± 0.474
0.775HisGly: 0.775 ± 0.244
0.194HisHis: 0.194 ± 0.115
0.388HisIle: 0.388 ± 0.321
1.163HisLys: 1.163 ± 0.645
1.745HisLeu: 1.745 ± 0.715
0.388HisMet: 0.388 ± 0.498
1.939HisAsn: 1.939 ± 0.253
0.775HisPro: 0.775 ± 0.413
0.194HisGln: 0.194 ± 0.235
0.582HisArg: 0.582 ± 0.345
0.969HisSer: 0.969 ± 0.333
0.969HisThr: 0.969 ± 0.337
1.551HisVal: 1.551 ± 0.683
0.194HisTrp: 0.194 ± 0.115
0.388HisTyr: 0.388 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
4.071IleAla: 4.071 ± 0.846
1.745IleCys: 1.745 ± 0.581
5.235IleAsp: 5.235 ± 1.323
4.265IleGlu: 4.265 ± 1.027
2.52IlePhe: 2.52 ± 0.472
4.071IleGly: 4.071 ± 1.133
1.551IleHis: 1.551 ± 0.713
5.428IleIle: 5.428 ± 0.678
6.786IleLys: 6.786 ± 1.749
4.071IleLeu: 4.071 ± 0.821
1.745IleMet: 1.745 ± 0.618
3.877IleAsn: 3.877 ± 0.562
2.714IlePro: 2.714 ± 0.917
1.357IleGln: 1.357 ± 0.464
2.326IleArg: 2.326 ± 0.613
7.949IleSer: 7.949 ± 1.222
4.459IleThr: 4.459 ± 0.883
3.877IleVal: 3.877 ± 1.244
0.194IleTrp: 0.194 ± 0.115
4.459IleTyr: 4.459 ± 1.688
0.0IleXaa: 0.0 ± 0.0
Lys
4.847LysAla: 4.847 ± 0.849
1.357LysCys: 1.357 ± 0.464
3.684LysAsp: 3.684 ± 0.793
7.561LysGlu: 7.561 ± 1.216
5.622LysPhe: 5.622 ± 1.458
5.235LysGly: 5.235 ± 1.497
1.163LysHis: 1.163 ± 0.275
7.561LysIle: 7.561 ± 0.645
10.469LysLys: 10.469 ± 1.016
6.786LysLeu: 6.786 ± 1.293
3.102LysMet: 3.102 ± 1.02
3.877LysAsn: 3.877 ± 1.591
3.49LysPro: 3.49 ± 0.532
1.939LysGln: 1.939 ± 0.656
2.908LysArg: 2.908 ± 0.571
7.949LysSer: 7.949 ± 0.454
6.398LysThr: 6.398 ± 1.318
5.622LysVal: 5.622 ± 1.608
0.775LysTrp: 0.775 ± 0.362
2.714LysTyr: 2.714 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
4.459LeuAla: 4.459 ± 1.686
1.357LeuCys: 1.357 ± 0.322
4.071LeuAsp: 4.071 ± 1.387
6.979LeuGlu: 6.979 ± 1.122
3.296LeuPhe: 3.296 ± 0.852
5.428LeuGly: 5.428 ± 0.925
1.163LeuHis: 1.163 ± 0.308
5.428LeuIle: 5.428 ± 0.499
9.112LeuLys: 9.112 ± 1.027
6.979LeuLeu: 6.979 ± 1.017
3.296LeuMet: 3.296 ± 1.198
6.398LeuAsn: 6.398 ± 1.714
2.714LeuPro: 2.714 ± 0.653
2.714LeuGln: 2.714 ± 0.666
3.49LeuArg: 3.49 ± 0.67
9.306LeuSer: 9.306 ± 0.8
5.428LeuThr: 5.428 ± 0.908
5.041LeuVal: 5.041 ± 0.956
0.388LeuTrp: 0.388 ± 0.471
3.296LeuTyr: 3.296 ± 1.216
0.0LeuXaa: 0.0 ± 0.0
Met
1.551MetAla: 1.551 ± 0.439
0.582MetCys: 0.582 ± 0.352
1.939MetAsp: 1.939 ± 0.883
1.357MetGlu: 1.357 ± 0.909
1.551MetPhe: 1.551 ± 0.683
1.939MetGly: 1.939 ± 0.799
0.194MetHis: 0.194 ± 0.235
2.714MetIle: 2.714 ± 1.198
3.877MetLys: 3.877 ± 0.657
2.326MetLeu: 2.326 ± 0.418
1.551MetMet: 1.551 ± 0.6
2.714MetAsn: 2.714 ± 0.865
1.357MetPro: 1.357 ± 0.645
0.969MetGln: 0.969 ± 0.442
1.745MetArg: 1.745 ± 0.502
1.939MetSer: 1.939 ± 0.714
0.969MetThr: 0.969 ± 0.575
2.133MetVal: 2.133 ± 0.776
0.0MetTrp: 0.0 ± 0.0
0.582MetTyr: 0.582 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
1.551AsnAla: 1.551 ± 0.403
2.133AsnCys: 2.133 ± 1.686
3.877AsnAsp: 3.877 ± 0.746
5.428AsnGlu: 5.428 ± 0.947
3.296AsnPhe: 3.296 ± 0.921
2.326AsnGly: 2.326 ± 0.471
1.163AsnHis: 1.163 ± 0.312
4.459AsnIle: 4.459 ± 1.01
3.877AsnLys: 3.877 ± 1.997
5.428AsnLeu: 5.428 ± 0.821
1.745AsnMet: 1.745 ± 0.658
2.714AsnAsn: 2.714 ± 1.282
2.326AsnPro: 2.326 ± 0.428
1.357AsnGln: 1.357 ± 0.309
1.357AsnArg: 1.357 ± 0.876
2.714AsnSer: 2.714 ± 0.603
3.296AsnThr: 3.296 ± 1.226
3.296AsnVal: 3.296 ± 0.808
1.163AsnTrp: 1.163 ± 0.867
2.326AsnTyr: 2.326 ± 0.918
0.0AsnXaa: 0.0 ± 0.0
Pro
1.357ProAla: 1.357 ± 0.598
0.194ProCys: 0.194 ± 0.235
1.357ProAsp: 1.357 ± 0.527
1.745ProGlu: 1.745 ± 1.194
1.357ProPhe: 1.357 ± 1.028
1.939ProGly: 1.939 ± 0.401
0.0ProHis: 0.0 ± 0.0
3.684ProIle: 3.684 ± 1.69
3.102ProLys: 3.102 ± 1.124
2.326ProLeu: 2.326 ± 0.944
0.194ProMet: 0.194 ± 0.235
1.357ProAsn: 1.357 ± 0.297
0.775ProPro: 0.775 ± 0.63
1.551ProGln: 1.551 ± 0.699
1.163ProArg: 1.163 ± 0.308
3.296ProSer: 3.296 ± 0.147
1.163ProThr: 1.163 ± 0.441
2.326ProVal: 2.326 ± 0.382
0.388ProTrp: 0.388 ± 0.23
0.969ProTyr: 0.969 ± 0.575
0.0ProXaa: 0.0 ± 0.0
Gln
1.357GlnAla: 1.357 ± 0.81
0.582GlnCys: 0.582 ± 0.322
2.133GlnAsp: 2.133 ± 0.798
1.745GlnGlu: 1.745 ± 0.385
0.582GlnPhe: 0.582 ± 0.322
1.163GlnGly: 1.163 ± 0.652
0.194GlnHis: 0.194 ± 0.115
2.52GlnIle: 2.52 ± 0.425
1.163GlnLys: 1.163 ± 0.439
2.326GlnLeu: 2.326 ± 1.011
1.745GlnMet: 1.745 ± 0.225
2.133GlnAsn: 2.133 ± 0.493
0.582GlnPro: 0.582 ± 0.345
0.582GlnGln: 0.582 ± 0.326
0.582GlnArg: 0.582 ± 0.345
3.296GlnSer: 3.296 ± 0.724
3.296GlnThr: 3.296 ± 0.836
0.582GlnVal: 0.582 ± 0.398
0.194GlnTrp: 0.194 ± 0.115
1.357GlnTyr: 1.357 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
0.194ArgAla: 0.194 ± 0.115
0.582ArgCys: 0.582 ± 0.254
2.908ArgAsp: 2.908 ± 0.879
2.52ArgGlu: 2.52 ± 1.012
1.939ArgPhe: 1.939 ± 1.409
2.133ArgGly: 2.133 ± 0.28
0.582ArgHis: 0.582 ± 0.322
2.908ArgIle: 2.908 ± 0.614
2.326ArgLys: 2.326 ± 0.553
3.102ArgLeu: 3.102 ± 0.535
0.969ArgMet: 0.969 ± 0.575
2.52ArgAsn: 2.52 ± 0.553
0.388ArgPro: 0.388 ± 0.174
0.969ArgGln: 0.969 ± 0.333
0.582ArgArg: 0.582 ± 0.508
1.939ArgSer: 1.939 ± 0.966
2.133ArgThr: 2.133 ± 0.765
2.714ArgVal: 2.714 ± 0.615
0.194ArgTrp: 0.194 ± 0.115
1.939ArgTyr: 1.939 ± 0.946
0.0ArgXaa: 0.0 ± 0.0
Ser
3.49SerAla: 3.49 ± 0.635
3.102SerCys: 3.102 ± 0.853
5.622SerAsp: 5.622 ± 0.739
5.816SerGlu: 5.816 ± 0.637
4.071SerPhe: 4.071 ± 0.696
5.235SerGly: 5.235 ± 0.973
1.163SerHis: 1.163 ± 0.275
6.01SerIle: 6.01 ± 1.077
10.081SerLys: 10.081 ± 1.888
10.081SerLeu: 10.081 ± 1.356
3.49SerMet: 3.49 ± 0.649
4.265SerAsn: 4.265 ± 1.568
1.357SerPro: 1.357 ± 0.909
3.102SerGln: 3.102 ± 0.519
4.265SerArg: 4.265 ± 0.682
9.112SerSer: 9.112 ± 1.02
4.459SerThr: 4.459 ± 0.78
4.653SerVal: 4.653 ± 1.114
0.775SerTrp: 0.775 ± 0.469
3.296SerTyr: 3.296 ± 0.779
0.0SerXaa: 0.0 ± 0.0
Thr
2.133ThrAla: 2.133 ± 0.259
1.745ThrCys: 1.745 ± 0.912
1.745ThrAsp: 1.745 ± 0.387
4.459ThrGlu: 4.459 ± 1.143
2.908ThrPhe: 2.908 ± 0.862
3.684ThrGly: 3.684 ± 1.066
2.133ThrHis: 2.133 ± 0.259
4.071ThrIle: 4.071 ± 0.708
5.428ThrLys: 5.428 ± 0.789
4.265ThrLeu: 4.265 ± 1.296
1.939ThrMet: 1.939 ± 0.585
2.908ThrAsn: 2.908 ± 0.725
1.745ThrPro: 1.745 ± 0.387
1.551ThrGln: 1.551 ± 0.505
1.745ThrArg: 1.745 ± 0.577
6.01ThrSer: 6.01 ± 0.923
2.326ThrThr: 2.326 ± 0.471
2.714ThrVal: 2.714 ± 0.617
0.388ThrTrp: 0.388 ± 0.174
2.908ThrTyr: 2.908 ± 0.824
0.0ThrXaa: 0.0 ± 0.0
Val
2.714ValAla: 2.714 ± 0.965
1.357ValCys: 1.357 ± 0.501
4.653ValAsp: 4.653 ± 0.965
3.684ValGlu: 3.684 ± 0.8
2.52ValPhe: 2.52 ± 0.733
1.745ValGly: 1.745 ± 0.211
1.551ValHis: 1.551 ± 0.474
2.52ValIle: 2.52 ± 0.843
4.265ValLys: 4.265 ± 0.756
4.459ValLeu: 4.459 ± 1.426
2.326ValMet: 2.326 ± 0.569
2.908ValAsn: 2.908 ± 0.479
1.939ValPro: 1.939 ± 0.666
2.133ValGln: 2.133 ± 0.387
1.939ValArg: 1.939 ± 0.544
4.653ValSer: 4.653 ± 0.869
2.908ValThr: 2.908 ± 0.696
2.52ValVal: 2.52 ± 0.854
0.388ValTrp: 0.388 ± 0.413
2.714ValTyr: 2.714 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
0.194TrpAla: 0.194 ± 0.284
0.388TrpCys: 0.388 ± 0.321
0.582TrpAsp: 0.582 ± 0.326
0.388TrpGlu: 0.388 ± 0.174
0.194TrpPhe: 0.194 ± 0.235
0.194TrpGly: 0.194 ± 0.235
0.0TrpHis: 0.0 ± 0.0
0.775TrpIle: 0.775 ± 0.436
0.388TrpLys: 0.388 ± 0.321
1.357TrpLeu: 1.357 ± 0.363
0.388TrpMet: 0.388 ± 0.264
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.194TrpArg: 0.194 ± 0.235
0.969TrpSer: 0.969 ± 0.239
0.388TrpThr: 0.388 ± 0.23
0.194TrpVal: 0.194 ± 0.235
0.0TrpTrp: 0.0 ± 0.0
0.194TrpTyr: 0.194 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.551TyrAla: 1.551 ± 0.749
2.133TyrCys: 2.133 ± 0.436
2.52TyrAsp: 2.52 ± 0.836
1.939TyrGlu: 1.939 ± 0.613
2.714TyrPhe: 2.714 ± 0.509
0.969TyrGly: 0.969 ± 0.469
0.194TyrHis: 0.194 ± 0.115
3.877TyrIle: 3.877 ± 0.772
4.459TyrLys: 4.459 ± 0.863
3.49TyrLeu: 3.49 ± 0.773
0.775TyrMet: 0.775 ± 0.362
1.939TyrAsn: 1.939 ± 0.674
1.357TyrPro: 1.357 ± 0.338
1.939TyrGln: 1.939 ± 0.574
0.775TyrArg: 0.775 ± 0.244
5.041TyrSer: 5.041 ± 0.996
1.551TyrThr: 1.551 ± 0.291
1.357TyrVal: 1.357 ± 0.338
0.194TyrTrp: 0.194 ± 0.284
1.939TyrTyr: 1.939 ± 0.857
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (5159 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski