Amino acid dipepetide frequency for Subterranean clover stunt virus (strain F) (SCSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.255AlaAla: 1.255 ± 0.762
1.255AlaCys: 1.255 ± 0.74
3.137AlaAsp: 3.137 ± 0.877
3.764AlaGlu: 3.764 ± 1.028
1.882AlaPhe: 1.882 ± 1.037
3.764AlaGly: 3.764 ± 0.945
1.255AlaHis: 1.255 ± 0.671
1.882AlaIle: 1.882 ± 1.197
3.764AlaLys: 3.764 ± 1.353
2.509AlaLeu: 2.509 ± 1.349
2.509AlaMet: 2.509 ± 0.862
2.509AlaAsn: 2.509 ± 0.66
1.882AlaPro: 1.882 ± 0.805
1.255AlaGln: 1.255 ± 0.678
3.764AlaArg: 3.764 ± 1.287
1.882AlaSer: 1.882 ± 0.946
3.764AlaThr: 3.764 ± 1.203
3.137AlaVal: 3.137 ± 1.61
1.255AlaTrp: 1.255 ± 0.686
2.509AlaTyr: 2.509 ± 1.27
0.0AlaXaa: 0.0 ± 0.0
Cys
1.255CysAla: 1.255 ± 0.747
3.137CysCys: 3.137 ± 0.866
1.882CysAsp: 1.882 ± 1.954
0.0CysGlu: 0.0 ± 0.0
1.882CysPhe: 1.882 ± 0.822
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.882CysIle: 1.882 ± 0.878
3.137CysLys: 3.137 ± 1.561
1.882CysLeu: 1.882 ± 0.975
1.255CysMet: 1.255 ± 0.613
1.255CysAsn: 1.255 ± 1.028
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.627CysArg: 0.627 ± 0.514
2.509CysSer: 2.509 ± 1.148
0.627CysThr: 0.627 ± 0.651
1.255CysVal: 1.255 ± 0.815
1.255CysTrp: 1.255 ± 0.658
1.882CysTyr: 1.882 ± 0.874
0.0CysXaa: 0.0 ± 0.0
Asp
2.509AspAla: 2.509 ± 1.275
1.255AspCys: 1.255 ± 1.028
5.646AspAsp: 5.646 ± 2.421
3.764AspGlu: 3.764 ± 1.374
3.137AspPhe: 3.137 ± 1.917
4.391AspGly: 4.391 ± 1.046
1.255AspHis: 1.255 ± 0.658
3.137AspIle: 3.137 ± 1.327
1.882AspLys: 1.882 ± 1.087
3.137AspLeu: 3.137 ± 1.38
2.509AspMet: 2.509 ± 0.954
1.882AspAsn: 1.882 ± 0.737
3.137AspPro: 3.137 ± 2.267
0.0AspGln: 0.0 ± 0.0
4.391AspArg: 4.391 ± 0.905
3.764AspSer: 3.764 ± 0.691
1.255AspThr: 1.255 ± 0.786
6.274AspVal: 6.274 ± 1.92
0.627AspTrp: 0.627 ± 0.651
2.509AspTyr: 2.509 ± 1.383
0.0AspXaa: 0.0 ± 0.0
Glu
3.137GluAla: 3.137 ± 1.285
0.0GluCys: 0.0 ± 0.0
9.41GluAsp: 9.41 ± 2.095
11.292GluGlu: 11.292 ± 2.744
3.137GluPhe: 3.137 ± 1.766
5.019GluGly: 5.019 ± 1.246
0.627GluHis: 0.627 ± 0.587
5.019GluIle: 5.019 ± 2.094
3.137GluLys: 3.137 ± 0.956
7.528GluLeu: 7.528 ± 2.426
4.391GluMet: 4.391 ± 1.427
0.627GluAsn: 0.627 ± 0.587
3.137GluPro: 3.137 ± 1.64
1.882GluGln: 1.882 ± 1.105
4.391GluArg: 4.391 ± 1.421
4.391GluSer: 4.391 ± 1.907
3.764GluThr: 3.764 ± 1.728
5.019GluVal: 5.019 ± 1.538
1.255GluTrp: 1.255 ± 0.658
3.137GluTyr: 3.137 ± 1.158
0.0GluXaa: 0.0 ± 0.0
Phe
5.646PheAla: 5.646 ± 1.229
0.627PheCys: 0.627 ± 0.651
2.509PheAsp: 2.509 ± 0.655
3.137PheGlu: 3.137 ± 1.108
0.627PhePhe: 0.627 ± 0.651
1.255PheGly: 1.255 ± 0.671
0.627PheHis: 0.627 ± 0.651
1.882PheIle: 1.882 ± 1.105
1.882PheLys: 1.882 ± 0.644
1.882PheLeu: 1.882 ± 1.224
0.627PheMet: 0.627 ± 0.587
1.882PheAsn: 1.882 ± 1.232
1.882PhePro: 1.882 ± 0.958
0.627PheGln: 0.627 ± 0.594
1.882PheArg: 1.882 ± 1.0
5.019PheSer: 5.019 ± 1.246
3.137PheThr: 3.137 ± 1.253
3.137PheVal: 3.137 ± 1.228
0.627PheTrp: 0.627 ± 0.651
1.882PheTyr: 1.882 ± 0.822
0.0PheXaa: 0.0 ± 0.0
Gly
2.509GlyAla: 2.509 ± 0.999
0.627GlyCys: 0.627 ± 0.649
2.509GlyAsp: 2.509 ± 1.167
6.274GlyGlu: 6.274 ± 1.501
3.137GlyPhe: 3.137 ± 1.701
5.646GlyGly: 5.646 ± 1.556
0.627GlyHis: 0.627 ± 0.557
5.019GlyIle: 5.019 ± 1.613
8.156GlyLys: 8.156 ± 1.52
4.391GlyLeu: 4.391 ± 1.299
1.255GlyMet: 1.255 ± 0.734
1.255GlyAsn: 1.255 ± 0.678
3.137GlyPro: 3.137 ± 1.306
1.255GlyGln: 1.255 ± 0.854
0.627GlyArg: 0.627 ± 0.684
6.274GlySer: 6.274 ± 1.674
1.882GlyThr: 1.882 ± 0.949
3.137GlyVal: 3.137 ± 1.317
1.255GlyTrp: 1.255 ± 0.678
3.764GlyTyr: 3.764 ± 1.856
0.0GlyXaa: 0.0 ± 0.0
His
0.627HisAla: 0.627 ± 0.594
0.0HisCys: 0.0 ± 0.0
1.882HisAsp: 1.882 ± 0.933
1.255HisGlu: 1.255 ± 0.658
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.627HisLys: 0.627 ± 0.557
7.528HisLeu: 7.528 ± 2.082
0.627HisMet: 0.627 ± 0.649
0.0HisAsn: 0.0 ± 0.0
0.627HisPro: 0.627 ± 0.557
0.627HisGln: 0.627 ± 0.514
0.627HisArg: 0.627 ± 0.494
0.627HisSer: 0.627 ± 0.651
0.627HisThr: 0.627 ± 0.651
1.882HisVal: 1.882 ± 0.933
0.627HisTrp: 0.627 ± 0.594
1.255HisTyr: 1.255 ± 0.709
0.0HisXaa: 0.0 ± 0.0
Ile
5.019IleAla: 5.019 ± 1.794
1.882IleCys: 1.882 ± 1.356
1.882IleAsp: 1.882 ± 1.068
6.274IleGlu: 6.274 ± 1.324
0.627IlePhe: 0.627 ± 0.557
3.137IleGly: 3.137 ± 1.366
0.627IleHis: 0.627 ± 0.514
4.391IleIle: 4.391 ± 1.308
4.391IleLys: 4.391 ± 2.223
1.882IleLeu: 1.882 ± 0.815
1.882IleMet: 1.882 ± 1.121
3.137IleAsn: 3.137 ± 1.392
3.137IlePro: 3.137 ± 1.039
0.627IleGln: 0.627 ± 0.557
2.509IleArg: 2.509 ± 1.695
1.882IleSer: 1.882 ± 1.214
2.509IleThr: 2.509 ± 1.514
8.783IleVal: 8.783 ± 2.23
1.882IleTrp: 1.882 ± 0.644
1.882IleTyr: 1.882 ± 0.89
0.0IleXaa: 0.0 ± 0.0
Lys
2.509LysAla: 2.509 ± 0.925
1.255LysCys: 1.255 ± 0.861
2.509LysAsp: 2.509 ± 0.66
8.156LysGlu: 8.156 ± 2.074
1.255LysPhe: 1.255 ± 0.786
1.882LysGly: 1.882 ± 0.955
1.255LysHis: 1.255 ± 0.671
4.391LysIle: 4.391 ± 1.244
9.41LysLys: 9.41 ± 3.47
8.156LysLeu: 8.156 ± 0.865
1.882LysMet: 1.882 ± 0.881
3.137LysAsn: 3.137 ± 1.218
1.882LysPro: 1.882 ± 1.459
1.255LysGln: 1.255 ± 0.709
5.646LysArg: 5.646 ± 1.23
5.646LysSer: 5.646 ± 1.566
5.646LysThr: 5.646 ± 1.92
4.391LysVal: 4.391 ± 2.413
1.255LysTrp: 1.255 ± 0.902
5.019LysTyr: 5.019 ± 1.34
0.0LysXaa: 0.0 ± 0.0
Leu
0.627LeuAla: 0.627 ± 0.514
2.509LeuCys: 2.509 ± 1.214
1.255LeuAsp: 1.255 ± 1.303
5.646LeuGlu: 5.646 ± 1.947
1.882LeuPhe: 1.882 ± 0.644
4.391LeuGly: 4.391 ± 1.207
1.255LeuHis: 1.255 ± 0.671
6.274LeuIle: 6.274 ± 2.232
8.783LeuLys: 8.783 ± 2.217
3.137LeuLeu: 3.137 ± 1.317
3.137LeuMet: 3.137 ± 0.596
4.391LeuAsn: 4.391 ± 0.945
3.137LeuPro: 3.137 ± 1.596
2.509LeuGln: 2.509 ± 0.705
6.274LeuArg: 6.274 ± 0.938
5.019LeuSer: 5.019 ± 1.338
1.255LeuThr: 1.255 ± 0.838
6.901LeuVal: 6.901 ± 1.347
1.255LeuTrp: 1.255 ± 0.794
5.646LeuTyr: 5.646 ± 1.248
0.0LeuXaa: 0.0 ± 0.0
Met
5.019MetAla: 5.019 ± 0.86
0.0MetCys: 0.0 ± 0.0
2.509MetAsp: 2.509 ± 1.208
5.019MetGlu: 5.019 ± 1.263
1.882MetPhe: 1.882 ± 1.356
0.0MetGly: 0.0 ± 0.0
1.882MetHis: 1.882 ± 1.171
0.0MetIle: 0.0 ± 0.0
6.901MetLys: 6.901 ± 2.357
1.882MetLeu: 1.882 ± 0.947
1.255MetMet: 1.255 ± 0.678
1.255MetAsn: 1.255 ± 0.762
0.627MetPro: 0.627 ± 0.557
1.255MetGln: 1.255 ± 0.854
1.882MetArg: 1.882 ± 1.364
0.627MetSer: 0.627 ± 0.587
0.0MetThr: 0.0 ± 0.0
2.509MetVal: 2.509 ± 1.455
0.0MetTrp: 0.0 ± 0.0
1.255MetTyr: 1.255 ± 0.786
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.255AsnCys: 1.255 ± 0.762
1.255AsnAsp: 1.255 ± 0.765
1.882AsnGlu: 1.882 ± 1.171
1.255AsnPhe: 1.255 ± 0.658
2.509AsnGly: 2.509 ± 1.152
0.627AsnHis: 0.627 ± 0.557
1.255AsnIle: 1.255 ± 0.854
1.882AsnLys: 1.882 ± 1.214
1.255AsnLeu: 1.255 ± 0.861
0.0AsnMet: 0.0 ± 0.0
2.509AsnAsn: 2.509 ± 1.068
2.509AsnPro: 2.509 ± 0.961
0.627AsnGln: 0.627 ± 0.494
4.391AsnArg: 4.391 ± 1.631
0.627AsnSer: 0.627 ± 0.684
2.509AsnThr: 2.509 ± 1.111
4.391AsnVal: 4.391 ± 1.302
1.255AsnTrp: 1.255 ± 0.775
6.274AsnTyr: 6.274 ± 1.026
0.0AsnXaa: 0.0 ± 0.0
Pro
0.627ProAla: 0.627 ± 0.557
0.627ProCys: 0.627 ± 0.557
3.764ProAsp: 3.764 ± 1.824
2.509ProGlu: 2.509 ± 1.248
1.882ProPhe: 1.882 ± 0.882
1.882ProGly: 1.882 ± 0.81
1.255ProHis: 1.255 ± 0.676
3.137ProIle: 3.137 ± 1.403
0.627ProLys: 0.627 ± 0.557
1.255ProLeu: 1.255 ± 1.028
0.0ProMet: 0.0 ± 0.0
0.627ProAsn: 0.627 ± 0.587
0.627ProPro: 0.627 ± 0.684
0.627ProGln: 0.627 ± 0.514
3.764ProArg: 3.764 ± 0.947
1.882ProSer: 1.882 ± 1.352
1.255ProThr: 1.255 ± 0.678
2.509ProVal: 2.509 ± 1.05
3.137ProTrp: 3.137 ± 1.158
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.882GlnAla: 1.882 ± 0.644
0.0GlnCys: 0.0 ± 0.0
4.391GlnAsp: 4.391 ± 1.912
1.255GlnGlu: 1.255 ± 0.658
0.627GlnPhe: 0.627 ± 0.557
4.391GlnGly: 4.391 ± 1.252
1.255GlnHis: 1.255 ± 0.775
0.627GlnIle: 0.627 ± 0.594
1.255GlnLys: 1.255 ± 1.188
3.764GlnLeu: 3.764 ± 0.884
0.627GlnMet: 0.627 ± 0.514
1.255GlnAsn: 1.255 ± 0.776
0.627GlnPro: 0.627 ± 0.514
1.255GlnGln: 1.255 ± 1.303
2.509GlnArg: 2.509 ± 0.961
2.509GlnSer: 2.509 ± 1.152
1.255GlnThr: 1.255 ± 0.815
2.509GlnVal: 2.509 ± 0.874
0.0GlnTrp: 0.0 ± 0.0
0.627GlnTyr: 0.627 ± 0.557
0.0GlnXaa: 0.0 ± 0.0
Arg
1.255ArgAla: 1.255 ± 0.671
4.391ArgCys: 4.391 ± 1.645
3.137ArgAsp: 3.137 ± 0.919
2.509ArgGlu: 2.509 ± 0.881
2.509ArgPhe: 2.509 ± 0.951
4.391ArgGly: 4.391 ± 1.267
0.627ArgHis: 0.627 ± 0.557
4.391ArgIle: 4.391 ± 0.894
6.274ArgLys: 6.274 ± 1.825
3.137ArgLeu: 3.137 ± 1.432
0.627ArgMet: 0.627 ± 0.684
1.882ArgAsn: 1.882 ± 0.873
3.137ArgPro: 3.137 ± 1.099
2.509ArgGln: 2.509 ± 1.315
5.646ArgArg: 5.646 ± 1.5
3.764ArgSer: 3.764 ± 1.517
1.882ArgThr: 1.882 ± 0.737
5.019ArgVal: 5.019 ± 1.91
1.882ArgTrp: 1.882 ± 0.972
3.764ArgTyr: 3.764 ± 1.303
0.0ArgXaa: 0.0 ± 0.0
Ser
1.882SerAla: 1.882 ± 0.866
1.255SerCys: 1.255 ± 0.854
4.391SerAsp: 4.391 ± 1.7
5.646SerGlu: 5.646 ± 1.708
5.646SerPhe: 5.646 ± 1.745
4.391SerGly: 4.391 ± 1.74
1.882SerHis: 1.882 ± 0.854
3.137SerIle: 3.137 ± 1.238
0.627SerLys: 0.627 ± 0.684
5.019SerLeu: 5.019 ± 1.349
2.509SerMet: 2.509 ± 1.01
3.764SerAsn: 3.764 ± 1.678
0.627SerPro: 0.627 ± 0.684
3.764SerGln: 3.764 ± 1.511
5.019SerArg: 5.019 ± 1.829
8.783SerSer: 8.783 ± 3.247
3.764SerThr: 3.764 ± 1.15
5.646SerVal: 5.646 ± 2.419
0.627SerTrp: 0.627 ± 0.557
1.255SerTyr: 1.255 ± 0.686
0.0SerXaa: 0.0 ± 0.0
Thr
3.764ThrAla: 3.764 ± 1.453
0.627ThrCys: 0.627 ± 0.651
0.0ThrAsp: 0.0 ± 0.0
1.882ThrGlu: 1.882 ± 0.813
1.882ThrPhe: 1.882 ± 0.873
4.391ThrGly: 4.391 ± 1.39
0.627ThrHis: 0.627 ± 0.651
2.509ThrIle: 2.509 ± 0.881
3.764ThrLys: 3.764 ± 0.967
5.646ThrLeu: 5.646 ± 1.194
2.509ThrMet: 2.509 ± 1.052
0.627ThrAsn: 0.627 ± 0.494
0.627ThrPro: 0.627 ± 0.494
1.882ThrGln: 1.882 ± 0.74
2.509ThrArg: 2.509 ± 0.66
3.764ThrSer: 3.764 ± 1.34
1.255ThrThr: 1.255 ± 0.678
0.627ThrVal: 0.627 ± 0.684
0.0ThrTrp: 0.0 ± 0.0
1.255ThrTyr: 1.255 ± 0.686
0.0ThrXaa: 0.0 ± 0.0
Val
3.764ValAla: 3.764 ± 1.209
0.627ValCys: 0.627 ± 0.651
1.255ValAsp: 1.255 ± 0.747
6.274ValGlu: 6.274 ± 2.606
5.019ValPhe: 5.019 ± 1.594
3.137ValGly: 3.137 ± 0.896
1.882ValHis: 1.882 ± 0.873
6.274ValIle: 6.274 ± 1.85
6.901ValLys: 6.901 ± 3.068
6.901ValLeu: 6.901 ± 1.61
6.274ValMet: 6.274 ± 1.272
5.019ValAsn: 5.019 ± 1.083
1.255ValPro: 1.255 ± 0.776
5.019ValGln: 5.019 ± 1.966
2.509ValArg: 2.509 ± 1.476
6.274ValSer: 6.274 ± 3.017
2.509ValThr: 2.509 ± 1.997
3.764ValVal: 3.764 ± 1.482
0.0ValTrp: 0.0 ± 0.0
3.137ValTyr: 3.137 ± 1.382
0.0ValXaa: 0.0 ± 0.0
Trp
0.627TrpAla: 0.627 ± 0.494
0.627TrpCys: 0.627 ± 0.514
0.627TrpAsp: 0.627 ± 0.514
1.882TrpGlu: 1.882 ± 0.644
1.255TrpPhe: 1.255 ± 0.854
1.255TrpGly: 1.255 ± 0.775
0.0TrpHis: 0.0 ± 0.0
0.627TrpIle: 0.627 ± 0.649
1.255TrpLys: 1.255 ± 0.861
0.627TrpLeu: 0.627 ± 0.494
0.627TrpMet: 0.627 ± 0.514
0.627TrpAsn: 0.627 ± 0.557
0.0TrpPro: 0.0 ± 0.0
2.509TrpGln: 2.509 ± 0.999
0.627TrpArg: 0.627 ± 0.587
0.627TrpSer: 0.627 ± 0.684
0.627TrpThr: 0.627 ± 0.649
3.764TrpVal: 3.764 ± 1.458
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.646TyrAla: 5.646 ± 1.363
3.764TyrCys: 3.764 ± 1.609
2.509TyrAsp: 2.509 ± 0.705
1.882TyrGlu: 1.882 ± 0.782
1.255TyrPhe: 1.255 ± 0.762
6.901TyrGly: 6.901 ± 1.648
1.882TyrHis: 1.882 ± 1.02
2.509TyrIle: 2.509 ± 0.881
1.882TyrLys: 1.882 ± 0.972
3.764TyrLeu: 3.764 ± 1.654
0.627TyrMet: 0.627 ± 0.594
0.627TyrAsn: 0.627 ± 0.494
0.0TyrPro: 0.0 ± 0.0
3.137TyrGln: 3.137 ± 1.469
3.137TyrArg: 3.137 ± 1.473
3.764TyrSer: 3.764 ± 2.192
0.627TyrThr: 0.627 ± 0.594
3.137TyrVal: 3.137 ± 1.472
0.0TyrTrp: 0.0 ± 0.0
1.255TyrTyr: 1.255 ± 0.861
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1595 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski