Amino acid dipepetide frequency for Sclerotinia sclerotiorum negative-stranded RNA virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.972AlaAla: 5.972 ± 1.178
0.995AlaCys: 0.995 ± 0.555
3.65AlaAsp: 3.65 ± 1.427
5.309AlaGlu: 5.309 ± 3.884
3.65AlaPhe: 3.65 ± 1.7
2.986AlaGly: 2.986 ± 0.824
0.995AlaHis: 0.995 ± 0.555
4.313AlaIle: 4.313 ± 1.653
5.64AlaLys: 5.64 ± 3.665
6.967AlaLeu: 6.967 ± 1.768
1.659AlaMet: 1.659 ± 0.777
1.327AlaAsn: 1.327 ± 1.169
6.304AlaPro: 6.304 ± 1.914
1.991AlaGln: 1.991 ± 1.451
4.977AlaArg: 4.977 ± 1.182
7.631AlaSer: 7.631 ± 2.792
5.972AlaThr: 5.972 ± 1.549
4.977AlaVal: 4.977 ± 1.528
1.991AlaTrp: 1.991 ± 1.11
1.659AlaTyr: 1.659 ± 2.028
0.0AlaXaa: 0.0 ± 0.0
Cys
0.332CysAla: 0.332 ± 0.185
0.0CysCys: 0.0 ± 0.0
0.332CysAsp: 0.332 ± 0.185
0.995CysGlu: 0.995 ± 1.547
0.995CysPhe: 0.995 ± 0.429
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.327CysIle: 1.327 ± 0.74
0.332CysLys: 0.332 ± 0.185
1.327CysLeu: 1.327 ± 0.794
0.664CysMet: 0.664 ± 0.37
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.664CysGln: 0.664 ± 0.37
1.327CysArg: 1.327 ± 0.74
0.664CysSer: 0.664 ± 0.37
0.664CysThr: 0.664 ± 0.37
1.327CysVal: 1.327 ± 0.74
0.332CysTrp: 0.332 ± 0.185
0.664CysTyr: 0.664 ± 0.37
0.0CysXaa: 0.0 ± 0.0
Asp
5.64AspAla: 5.64 ± 2.406
0.664AspCys: 0.664 ± 0.849
1.991AspAsp: 1.991 ± 0.74
3.65AspGlu: 3.65 ± 1.081
1.327AspPhe: 1.327 ± 0.74
2.654AspGly: 2.654 ± 0.973
0.664AspHis: 0.664 ± 0.739
3.65AspIle: 3.65 ± 1.625
1.659AspLys: 1.659 ± 0.753
4.977AspLeu: 4.977 ± 1.487
0.995AspMet: 0.995 ± 0.429
1.327AspAsn: 1.327 ± 1.746
3.65AspPro: 3.65 ± 2.035
2.322AspGln: 2.322 ± 0.741
2.322AspArg: 2.322 ± 0.899
5.309AspSer: 5.309 ± 2.077
2.322AspThr: 2.322 ± 1.757
3.65AspVal: 3.65 ± 1.027
1.327AspTrp: 1.327 ± 0.74
3.65AspTyr: 3.65 ± 1.588
0.0AspXaa: 0.0 ± 0.0
Glu
5.64GluAla: 5.64 ± 1.38
0.664GluCys: 0.664 ± 0.37
2.986GluAsp: 2.986 ± 1.349
3.318GluGlu: 3.318 ± 2.165
3.981GluPhe: 3.981 ± 1.189
2.986GluGly: 2.986 ± 0.983
1.659GluHis: 1.659 ± 0.77
3.981GluIle: 3.981 ± 1.3
2.654GluLys: 2.654 ± 2.438
6.636GluLeu: 6.636 ± 1.59
2.654GluMet: 2.654 ± 1.428
1.659GluAsn: 1.659 ± 1.409
0.995GluPro: 0.995 ± 0.429
2.654GluGln: 2.654 ± 1.056
2.986GluArg: 2.986 ± 0.893
4.645GluSer: 4.645 ± 2.644
3.318GluThr: 3.318 ± 1.554
3.981GluVal: 3.981 ± 1.657
0.664GluTrp: 0.664 ± 0.849
0.664GluTyr: 0.664 ± 0.37
0.0GluXaa: 0.0 ± 0.0
Phe
1.659PheAla: 1.659 ± 0.857
1.659PheCys: 1.659 ± 0.77
1.659PheAsp: 1.659 ± 0.925
3.65PheGlu: 3.65 ± 0.874
0.332PhePhe: 0.332 ± 0.185
2.986PheGly: 2.986 ± 1.235
0.0PheHis: 0.0 ± 0.0
1.659PheIle: 1.659 ± 1.409
2.654PheLys: 2.654 ± 0.79
2.986PheLeu: 2.986 ± 1.235
0.995PheMet: 0.995 ± 0.429
2.322PheAsn: 2.322 ± 0.772
1.991PhePro: 1.991 ± 0.858
0.995PheGln: 0.995 ± 0.514
1.659PheArg: 1.659 ± 2.028
4.977PheSer: 4.977 ± 1.887
0.664PheThr: 0.664 ± 0.37
1.659PheVal: 1.659 ± 1.181
0.0PheTrp: 0.0 ± 0.0
0.995PheTyr: 0.995 ± 0.555
0.0PheXaa: 0.0 ± 0.0
Gly
3.65GlyAla: 3.65 ± 1.59
1.327GlyCys: 1.327 ± 0.74
2.986GlyAsp: 2.986 ± 1.265
1.659GlyGlu: 1.659 ± 0.73
1.327GlyPhe: 1.327 ± 0.895
3.318GlyGly: 3.318 ± 0.991
1.659GlyHis: 1.659 ± 0.925
3.65GlyIle: 3.65 ± 2.387
0.995GlyLys: 0.995 ± 0.429
6.304GlyLeu: 6.304 ± 1.914
0.332GlyMet: 0.332 ± 0.185
1.991GlyAsn: 1.991 ± 0.476
3.65GlyPro: 3.65 ± 0.682
0.995GlyGln: 0.995 ± 0.429
4.645GlyArg: 4.645 ± 1.287
4.977GlySer: 4.977 ± 1.279
3.65GlyThr: 3.65 ± 0.89
8.295GlyVal: 8.295 ± 1.906
0.332GlyTrp: 0.332 ± 0.185
1.991GlyTyr: 1.991 ± 0.74
0.0GlyXaa: 0.0 ± 0.0
His
1.659HisAla: 1.659 ± 1.64
0.332HisCys: 0.332 ± 0.185
0.664HisAsp: 0.664 ± 0.37
0.995HisGlu: 0.995 ± 0.702
0.332HisPhe: 0.332 ± 0.185
1.327HisGly: 1.327 ± 0.74
0.995HisHis: 0.995 ± 0.555
0.995HisIle: 0.995 ± 0.702
0.0HisLys: 0.0 ± 0.0
2.322HisLeu: 2.322 ± 0.893
0.0HisMet: 0.0 ± 0.0
0.664HisAsn: 0.664 ± 0.447
0.995HisPro: 0.995 ± 0.555
0.332HisGln: 0.332 ± 0.185
1.991HisArg: 1.991 ± 0.74
2.322HisSer: 2.322 ± 0.741
0.0HisThr: 0.0 ± 0.0
0.995HisVal: 0.995 ± 0.555
0.0HisTrp: 0.0 ± 0.0
0.332HisTyr: 0.332 ± 0.185
0.0HisXaa: 0.0 ± 0.0
Ile
3.981IleAla: 3.981 ± 1.563
0.995IleCys: 0.995 ± 0.555
4.977IleAsp: 4.977 ± 1.0
4.313IleGlu: 4.313 ± 2.828
2.986IlePhe: 2.986 ± 1.074
4.313IleGly: 4.313 ± 1.211
1.327IleHis: 1.327 ± 0.794
0.995IleIle: 0.995 ± 0.429
3.981IleLys: 3.981 ± 1.573
8.626IleLeu: 8.626 ± 3.03
0.664IleMet: 0.664 ± 0.37
2.322IleAsn: 2.322 ± 1.149
2.654IlePro: 2.654 ± 1.12
2.654IleGln: 2.654 ± 0.904
3.318IleArg: 3.318 ± 1.022
4.977IleSer: 4.977 ± 2.08
2.322IleThr: 2.322 ± 0.921
3.318IleVal: 3.318 ± 0.596
0.995IleTrp: 0.995 ± 0.555
1.327IleTyr: 1.327 ± 0.74
0.0IleXaa: 0.0 ± 0.0
Lys
5.309LysAla: 5.309 ± 1.48
0.0LysCys: 0.0 ± 0.0
1.991LysAsp: 1.991 ± 0.736
2.322LysGlu: 2.322 ± 0.871
0.664LysPhe: 0.664 ± 0.37
4.313LysGly: 4.313 ± 1.0
0.995LysHis: 0.995 ± 0.429
3.65LysIle: 3.65 ± 4.325
3.318LysLys: 3.318 ± 0.675
6.304LysLeu: 6.304 ± 2.843
0.332LysMet: 0.332 ± 0.613
1.659LysAsn: 1.659 ± 0.857
1.327LysPro: 1.327 ± 0.74
2.986LysGln: 2.986 ± 1.93
3.318LysArg: 3.318 ± 1.428
2.986LysSer: 2.986 ± 1.014
2.986LysThr: 2.986 ± 1.613
1.991LysVal: 1.991 ± 1.629
0.0LysTrp: 0.0 ± 0.0
0.664LysTyr: 0.664 ± 0.37
0.0LysXaa: 0.0 ± 0.0
Leu
10.285LeuAla: 10.285 ± 2.227
0.995LeuCys: 0.995 ± 0.555
7.299LeuAsp: 7.299 ± 1.893
6.967LeuGlu: 6.967 ± 1.635
2.322LeuPhe: 2.322 ± 0.899
5.972LeuGly: 5.972 ± 1.302
1.327LeuHis: 1.327 ± 0.794
5.972LeuIle: 5.972 ± 1.672
4.977LeuLys: 4.977 ± 1.077
9.622LeuLeu: 9.622 ± 0.886
3.65LeuMet: 3.65 ± 1.311
4.977LeuAsn: 4.977 ± 1.369
6.967LeuPro: 6.967 ± 2.321
4.977LeuGln: 4.977 ± 1.516
5.972LeuArg: 5.972 ± 2.585
13.603LeuSer: 13.603 ± 1.961
6.636LeuThr: 6.636 ± 0.878
3.981LeuVal: 3.981 ± 1.301
0.664LeuTrp: 0.664 ± 0.447
1.991LeuTyr: 1.991 ± 1.11
0.0LeuXaa: 0.0 ± 0.0
Met
2.322MetAla: 2.322 ± 0.54
0.0MetCys: 0.0 ± 0.0
0.664MetAsp: 0.664 ± 0.37
1.327MetGlu: 1.327 ± 0.486
1.327MetPhe: 1.327 ± 0.895
1.327MetGly: 1.327 ± 1.169
0.0MetHis: 0.0 ± 0.0
2.654MetIle: 2.654 ± 0.79
1.327MetLys: 1.327 ± 0.549
1.659MetLeu: 1.659 ± 0.972
1.659MetMet: 1.659 ± 0.653
0.332MetAsn: 0.332 ± 0.185
0.664MetPro: 0.664 ± 0.37
0.995MetGln: 0.995 ± 0.555
1.659MetArg: 1.659 ± 0.925
3.981MetSer: 3.981 ± 0.94
1.659MetThr: 1.659 ± 1.59
0.995MetVal: 0.995 ± 0.429
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.654AsnAla: 2.654 ± 1.399
0.332AsnCys: 0.332 ± 0.185
0.332AsnAsp: 0.332 ± 0.932
1.327AsnGlu: 1.327 ± 0.798
0.664AsnPhe: 0.664 ± 0.447
0.995AsnGly: 0.995 ± 0.555
0.664AsnHis: 0.664 ± 0.37
2.322AsnIle: 2.322 ± 0.899
0.995AsnLys: 0.995 ± 0.429
2.322AsnLeu: 2.322 ± 0.54
0.664AsnMet: 0.664 ± 0.37
0.995AsnAsn: 0.995 ± 0.514
3.981AsnPro: 3.981 ± 1.672
2.322AsnGln: 2.322 ± 0.921
3.981AsnArg: 3.981 ± 0.673
3.318AsnSer: 3.318 ± 0.675
0.0AsnThr: 0.0 ± 0.0
0.995AsnVal: 0.995 ± 1.33
0.332AsnTrp: 0.332 ± 0.185
0.995AsnTyr: 0.995 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
1.991ProAla: 1.991 ± 0.476
0.0ProCys: 0.0 ± 0.0
4.645ProAsp: 4.645 ± 0.755
3.65ProGlu: 3.65 ± 1.192
1.991ProPhe: 1.991 ± 1.007
5.309ProGly: 5.309 ± 1.439
0.995ProHis: 0.995 ± 0.555
4.313ProIle: 4.313 ± 0.793
2.654ProLys: 2.654 ± 1.49
4.645ProLeu: 4.645 ± 2.59
0.664ProMet: 0.664 ± 0.447
0.664ProAsn: 0.664 ± 0.37
3.318ProPro: 3.318 ± 2.382
1.659ProGln: 1.659 ± 1.219
2.654ProArg: 2.654 ± 0.925
5.64ProSer: 5.64 ± 2.409
2.654ProThr: 2.654 ± 1.087
4.645ProVal: 4.645 ± 0.755
0.664ProTrp: 0.664 ± 0.37
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.645GlnAla: 4.645 ± 1.512
0.664GlnCys: 0.664 ± 0.37
2.654GlnAsp: 2.654 ± 1.056
2.322GlnGlu: 2.322 ± 0.921
0.664GlnPhe: 0.664 ± 0.447
2.654GlnGly: 2.654 ± 0.973
0.995GlnHis: 0.995 ± 0.429
2.654GlnIle: 2.654 ± 1.127
1.327GlnLys: 1.327 ± 0.557
5.309GlnLeu: 5.309 ± 2.346
1.327GlnMet: 1.327 ± 0.838
0.995GlnAsn: 0.995 ± 0.514
0.995GlnPro: 0.995 ± 0.555
1.659GlnGln: 1.659 ± 0.753
1.991GlnArg: 1.991 ± 0.988
4.313GlnSer: 4.313 ± 0.968
1.659GlnThr: 1.659 ± 1.14
2.654GlnVal: 2.654 ± 1.099
0.664GlnTrp: 0.664 ± 0.37
0.332GlnTyr: 0.332 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
3.981ArgAla: 3.981 ± 1.205
0.664ArgCys: 0.664 ± 0.37
4.977ArgAsp: 4.977 ± 1.27
4.313ArgGlu: 4.313 ± 0.725
1.327ArgPhe: 1.327 ± 0.713
2.986ArgGly: 2.986 ± 0.746
0.995ArgHis: 0.995 ± 0.429
2.654ArgIle: 2.654 ± 0.955
1.659ArgLys: 1.659 ± 0.598
10.617ArgLeu: 10.617 ± 2.481
1.659ArgMet: 1.659 ± 0.777
2.654ArgAsn: 2.654 ± 1.48
1.327ArgPro: 1.327 ± 1.286
2.654ArgGln: 2.654 ± 1.063
3.65ArgArg: 3.65 ± 1.364
5.64ArgSer: 5.64 ± 1.036
3.65ArgThr: 3.65 ± 1.162
5.972ArgVal: 5.972 ± 1.346
2.986ArgTrp: 2.986 ± 1.074
0.995ArgTyr: 0.995 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
6.636SerAla: 6.636 ± 1.113
1.327SerCys: 1.327 ± 0.713
3.981SerAsp: 3.981 ± 1.563
3.981SerGlu: 3.981 ± 2.501
4.977SerPhe: 4.977 ± 1.0
4.313SerGly: 4.313 ± 0.481
1.659SerHis: 1.659 ± 0.829
4.313SerIle: 4.313 ± 1.186
4.645SerLys: 4.645 ± 1.23
11.281SerLeu: 11.281 ± 3.24
3.65SerMet: 3.65 ± 1.162
3.981SerAsn: 3.981 ± 1.269
6.636SerPro: 6.636 ± 1.977
3.981SerGln: 3.981 ± 0.927
8.295SerArg: 8.295 ± 2.064
8.958SerSer: 8.958 ± 0.689
6.967SerThr: 6.967 ± 1.678
6.967SerVal: 6.967 ± 1.272
0.664SerTrp: 0.664 ± 0.535
2.654SerTyr: 2.654 ± 0.79
0.0SerXaa: 0.0 ± 0.0
Thr
4.645ThrAla: 4.645 ± 1.931
0.332ThrCys: 0.332 ± 0.185
1.659ThrAsp: 1.659 ± 0.479
4.313ThrGlu: 4.313 ± 1.298
2.322ThrPhe: 2.322 ± 0.893
2.654ThrGly: 2.654 ± 1.063
0.332ThrHis: 0.332 ± 0.185
3.981ThrIle: 3.981 ± 0.927
2.322ThrLys: 2.322 ± 1.682
4.645ThrLeu: 4.645 ± 1.951
1.327ThrMet: 1.327 ± 0.862
0.332ThrAsn: 0.332 ± 0.533
1.991ThrPro: 1.991 ± 0.902
1.991ThrGln: 1.991 ± 0.871
4.313ThrArg: 4.313 ± 1.663
5.309ThrSer: 5.309 ± 1.063
3.318ThrThr: 3.318 ± 1.906
5.309ThrVal: 5.309 ± 1.45
0.664ThrTrp: 0.664 ± 0.37
0.995ThrTyr: 0.995 ± 0.702
0.0ThrXaa: 0.0 ± 0.0
Val
6.304ValAla: 6.304 ± 1.915
0.995ValCys: 0.995 ± 0.429
4.313ValAsp: 4.313 ± 1.557
3.65ValGlu: 3.65 ± 1.426
3.318ValPhe: 3.318 ± 1.41
3.981ValGly: 3.981 ± 1.006
1.991ValHis: 1.991 ± 1.007
5.64ValIle: 5.64 ± 2.066
3.981ValLys: 3.981 ± 0.805
7.963ValLeu: 7.963 ± 1.145
0.664ValMet: 0.664 ± 0.37
0.664ValAsn: 0.664 ± 0.447
2.654ValPro: 2.654 ± 1.087
1.991ValGln: 1.991 ± 0.736
3.65ValArg: 3.65 ± 0.811
6.967ValSer: 6.967 ± 3.444
3.318ValThr: 3.318 ± 1.022
5.64ValVal: 5.64 ± 1.314
0.664ValTrp: 0.664 ± 0.739
0.332ValTyr: 0.332 ± 0.185
0.0ValXaa: 0.0 ± 0.0
Trp
0.995TrpAla: 0.995 ± 0.665
0.0TrpCys: 0.0 ± 0.0
0.995TrpAsp: 0.995 ± 0.555
0.664TrpGlu: 0.664 ± 0.37
0.664TrpPhe: 0.664 ± 0.37
1.327TrpGly: 1.327 ± 0.895
0.0TrpHis: 0.0 ± 0.0
0.332TrpIle: 0.332 ± 0.185
0.664TrpLys: 0.664 ± 0.37
0.995TrpLeu: 0.995 ± 0.555
0.332TrpMet: 0.332 ± 0.932
0.332TrpAsn: 0.332 ± 0.185
0.995TrpPro: 0.995 ± 0.555
1.327TrpGln: 1.327 ± 0.74
0.995TrpArg: 0.995 ± 0.555
1.327TrpSer: 1.327 ± 0.74
0.664TrpThr: 0.664 ± 0.739
0.332TrpVal: 0.332 ± 0.185
0.0TrpTrp: 0.0 ± 0.0
0.332TrpTyr: 0.332 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.664TyrAla: 0.664 ± 0.37
0.0TyrCys: 0.0 ± 0.0
0.995TyrAsp: 0.995 ± 0.555
0.332TyrGlu: 0.332 ± 0.185
0.332TyrPhe: 0.332 ± 0.185
1.327TyrGly: 1.327 ± 0.895
0.0TyrHis: 0.0 ± 0.0
1.991TyrIle: 1.991 ± 0.706
1.327TyrLys: 1.327 ± 0.856
3.65TyrLeu: 3.65 ± 1.588
0.332TyrMet: 0.332 ± 0.185
0.995TyrAsn: 0.995 ± 0.555
1.659TyrPro: 1.659 ± 0.857
1.327TyrGln: 1.327 ± 0.486
1.659TyrArg: 1.659 ± 0.925
2.322TyrSer: 2.322 ± 1.295
0.332TyrThr: 0.332 ± 0.185
0.995TyrVal: 0.995 ± 0.429
0.332TyrTrp: 0.332 ± 0.185
0.332TyrTyr: 0.332 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski