Amino acid dipepetide frequency for Shuangao Insect Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.414AlaAla: 2.414 ± 4.033
0.724AlaCys: 0.724 ± 0.314
2.172AlaAsp: 2.172 ± 0.796
1.69AlaGlu: 1.69 ± 1.432
2.414AlaPhe: 2.414 ± 1.295
3.379AlaGly: 3.379 ± 2.864
1.69AlaHis: 1.69 ± 1.456
1.931AlaIle: 1.931 ± 0.453
3.379AlaLys: 3.379 ± 2.87
4.827AlaLeu: 4.827 ± 2.59
1.448AlaMet: 1.448 ± 0.59
1.207AlaAsn: 1.207 ± 0.44
1.207AlaPro: 1.207 ± 0.791
0.965AlaGln: 0.965 ± 0.314
2.655AlaArg: 2.655 ± 1.279
3.138AlaSer: 3.138 ± 1.072
2.414AlaThr: 2.414 ± 0.68
3.621AlaVal: 3.621 ± 0.949
0.241AlaTrp: 0.241 ± 0.145
1.931AlaTyr: 1.931 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
0.724CysAla: 0.724 ± 0.314
0.0CysCys: 0.0 ± 0.0
1.448CysAsp: 1.448 ± 0.629
1.931CysGlu: 1.931 ± 0.488
0.483CysPhe: 0.483 ± 0.136
2.896CysGly: 2.896 ± 2.445
0.724CysHis: 0.724 ± 0.314
2.414CysIle: 2.414 ± 0.357
2.655CysLys: 2.655 ± 1.066
1.207CysLeu: 1.207 ± 0.44
0.724CysMet: 0.724 ± 0.314
1.69CysAsn: 1.69 ± 0.825
1.207CysPro: 1.207 ± 0.302
0.0CysGln: 0.0 ± 0.0
0.241CysArg: 0.241 ± 0.145
2.172CysSer: 2.172 ± 0.386
0.965CysThr: 0.965 ± 0.815
1.931CysVal: 1.931 ± 1.394
0.965CysTrp: 0.965 ± 0.512
1.207CysTyr: 1.207 ± 0.44
0.0CysXaa: 0.0 ± 0.0
Asp
1.207AspAla: 1.207 ± 0.645
1.69AspCys: 1.69 ± 0.57
1.931AspAsp: 1.931 ± 0.875
4.345AspGlu: 4.345 ± 1.111
2.172AspPhe: 2.172 ± 0.581
0.724AspGly: 0.724 ± 0.194
0.965AspHis: 0.965 ± 0.314
6.517AspIle: 6.517 ± 1.655
5.552AspLys: 5.552 ± 1.569
6.517AspLeu: 6.517 ± 2.07
1.207AspMet: 1.207 ± 0.447
3.621AspAsn: 3.621 ± 0.949
1.931AspPro: 1.931 ± 0.488
1.448AspGln: 1.448 ± 0.64
1.931AspArg: 1.931 ± 0.504
2.655AspSer: 2.655 ± 0.722
3.621AspThr: 3.621 ± 0.798
3.379AspVal: 3.379 ± 1.211
1.207AspTrp: 1.207 ± 0.647
2.172AspTyr: 2.172 ± 0.72
0.0AspXaa: 0.0 ± 0.0
Glu
3.862GluAla: 3.862 ± 1.437
1.69GluCys: 1.69 ± 0.744
3.862GluAsp: 3.862 ± 0.341
5.552GluGlu: 5.552 ± 1.663
2.414GluPhe: 2.414 ± 0.357
2.172GluGly: 2.172 ± 1.326
1.207GluHis: 1.207 ± 0.302
6.276GluIle: 6.276 ± 0.914
4.586GluLys: 4.586 ± 0.741
5.069GluLeu: 5.069 ± 1.357
1.69GluMet: 1.69 ± 0.194
3.138GluAsn: 3.138 ± 0.589
2.172GluPro: 2.172 ± 0.762
2.655GluGln: 2.655 ± 0.264
3.379GluArg: 3.379 ± 1.101
5.793GluSer: 5.793 ± 0.651
5.31GluThr: 5.31 ± 1.327
2.655GluVal: 2.655 ± 0.722
0.483GluTrp: 0.483 ± 0.29
3.138GluTyr: 3.138 ± 0.284
0.0GluXaa: 0.0 ± 0.0
Phe
0.965PheAla: 0.965 ± 0.686
0.965PheCys: 0.965 ± 0.272
3.379PheAsp: 3.379 ± 1.139
1.931PheGlu: 1.931 ± 1.373
1.931PhePhe: 1.931 ± 1.024
2.414PheGly: 2.414 ± 2.233
0.965PheHis: 0.965 ± 0.776
2.414PheIle: 2.414 ± 0.603
2.655PheLys: 2.655 ± 0.687
6.758PheLeu: 6.758 ± 1.194
1.207PheMet: 1.207 ± 0.45
1.931PheAsn: 1.931 ± 0.488
0.483PhePro: 0.483 ± 0.136
0.965PheGln: 0.965 ± 0.272
0.965PheArg: 0.965 ± 0.272
5.069PheSer: 5.069 ± 1.276
2.414PheThr: 2.414 ± 1.451
2.172PheVal: 2.172 ± 0.72
0.0PheTrp: 0.0 ± 0.0
0.965PheTyr: 0.965 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
3.862GlyAla: 3.862 ± 2.747
2.414GlyCys: 2.414 ± 0.68
2.172GlyAsp: 2.172 ± 0.386
3.138GlyGlu: 3.138 ± 1.997
2.896GlyPhe: 2.896 ± 0.725
1.448GlyGly: 1.448 ± 0.565
1.207GlyHis: 1.207 ± 0.645
3.379GlyIle: 3.379 ± 1.101
4.586GlyLys: 4.586 ± 2.704
3.862GlyLeu: 3.862 ± 1.815
0.483GlyMet: 0.483 ± 0.29
2.172GlyAsn: 2.172 ± 0.702
0.965GlyPro: 0.965 ± 0.314
1.448GlyGln: 1.448 ± 0.629
2.655GlyArg: 2.655 ± 0.688
3.621GlySer: 3.621 ± 0.51
3.379GlyThr: 3.379 ± 1.651
3.379GlyVal: 3.379 ± 0.852
0.965GlyTrp: 0.965 ± 0.512
3.379GlyTyr: 3.379 ± 1.101
0.0GlyXaa: 0.0 ± 0.0
His
0.483HisAla: 0.483 ± 0.29
1.448HisCys: 1.448 ± 0.629
2.655HisAsp: 2.655 ± 0.688
1.448HisGlu: 1.448 ± 1.498
1.448HisPhe: 1.448 ± 0.388
2.172HisGly: 2.172 ± 0.702
0.241HisHis: 0.241 ± 0.204
1.931HisIle: 1.931 ± 0.629
1.69HisLys: 1.69 ± 0.665
2.655HisLeu: 2.655 ± 0.688
0.241HisMet: 0.241 ± 0.145
1.207HisAsn: 1.207 ± 0.726
0.241HisPro: 0.241 ± 0.867
0.483HisGln: 0.483 ± 0.136
0.724HisArg: 0.724 ± 0.435
1.69HisSer: 1.69 ± 0.732
1.207HisThr: 1.207 ± 0.791
0.483HisVal: 0.483 ± 0.812
0.0HisTrp: 0.0 ± 0.0
1.207HisTyr: 1.207 ± 0.791
0.0HisXaa: 0.0 ± 0.0
Ile
4.345IleAla: 4.345 ± 2.769
1.207IleCys: 1.207 ± 0.713
5.069IleAsp: 5.069 ± 1.011
6.276IleGlu: 6.276 ± 0.917
2.655IlePhe: 2.655 ± 0.722
4.103IleGly: 4.103 ± 1.516
1.931IleHis: 1.931 ± 0.752
3.621IleIle: 3.621 ± 0.955
5.069IleLys: 5.069 ± 0.34
7.241IleLeu: 7.241 ± 0.616
1.69IleMet: 1.69 ± 0.57
4.586IleAsn: 4.586 ± 1.656
2.896IlePro: 2.896 ± 1.008
3.621IleGln: 3.621 ± 1.009
3.621IleArg: 3.621 ± 1.009
6.276IleSer: 6.276 ± 1.642
5.31IleThr: 5.31 ± 0.605
3.138IleVal: 3.138 ± 1.076
0.241IleTrp: 0.241 ± 0.145
3.379IleTyr: 3.379 ± 0.288
0.0IleXaa: 0.0 ± 0.0
Lys
2.896LysAla: 2.896 ± 3.915
1.207LysCys: 1.207 ± 0.302
3.862LysAsp: 3.862 ± 1.295
5.793LysGlu: 5.793 ± 0.651
2.896LysPhe: 2.896 ± 0.793
3.379LysGly: 3.379 ± 0.097
1.931LysHis: 1.931 ± 0.875
6.758LysIle: 6.758 ± 3.894
6.517LysLys: 6.517 ± 2.07
6.758LysLeu: 6.758 ± 0.458
2.172LysMet: 2.172 ± 0.556
6.276LysAsn: 6.276 ± 0.642
3.621LysPro: 3.621 ± 0.51
1.931LysGln: 1.931 ± 0.488
4.586LysArg: 4.586 ± 0.347
6.758LysSer: 6.758 ± 0.772
7.724LysThr: 7.724 ± 0.359
5.793LysVal: 5.793 ± 0.856
0.241LysTrp: 0.241 ± 0.145
1.69LysTyr: 1.69 ± 1.118
0.0LysXaa: 0.0 ± 0.0
Leu
2.414LeuAla: 2.414 ± 0.598
1.931LeuCys: 1.931 ± 0.544
6.517LeuAsp: 6.517 ± 0.728
6.276LeuGlu: 6.276 ± 1.173
3.379LeuPhe: 3.379 ± 0.852
6.276LeuGly: 6.276 ± 1.376
3.138LeuHis: 3.138 ± 1.213
5.793LeuIle: 5.793 ± 1.966
8.689LeuLys: 8.689 ± 0.364
8.689LeuLeu: 8.689 ± 2.222
1.69LeuMet: 1.69 ± 0.426
6.034LeuAsn: 6.034 ± 1.817
1.931LeuPro: 1.931 ± 0.488
2.896LeuGln: 2.896 ± 1.112
3.379LeuArg: 3.379 ± 0.871
7.724LeuSer: 7.724 ± 1.961
4.586LeuThr: 4.586 ± 1.591
5.552LeuVal: 5.552 ± 0.881
0.965LeuTrp: 0.965 ± 0.581
3.138LeuTyr: 3.138 ± 1.233
0.0LeuXaa: 0.0 ± 0.0
Met
1.207MetAla: 1.207 ± 0.647
0.724MetCys: 0.724 ± 0.314
2.172MetAsp: 2.172 ± 0.581
2.172MetGlu: 2.172 ± 0.762
0.724MetPhe: 0.724 ± 0.194
1.207MetGly: 1.207 ± 0.45
0.241MetHis: 0.241 ± 0.145
0.965MetIle: 0.965 ± 0.314
2.414MetLys: 2.414 ± 0.598
1.69MetLeu: 1.69 ± 0.426
1.207MetMet: 1.207 ± 0.44
1.207MetAsn: 1.207 ± 0.44
0.241MetPro: 0.241 ± 0.145
0.483MetGln: 0.483 ± 0.29
1.207MetArg: 1.207 ± 0.647
2.655MetSer: 2.655 ± 0.405
1.69MetThr: 1.69 ± 0.732
1.207MetVal: 1.207 ± 0.302
0.0MetTrp: 0.0 ± 0.0
1.448MetTyr: 1.448 ± 0.388
0.0MetXaa: 0.0 ± 0.0
Asn
2.896AsnAla: 2.896 ± 0.232
1.448AsnCys: 1.448 ± 0.629
5.069AsnAsp: 5.069 ± 0.468
4.586AsnGlu: 4.586 ± 0.715
1.448AsnPhe: 1.448 ± 0.388
2.172AsnGly: 2.172 ± 0.702
0.724AsnHis: 0.724 ± 0.435
4.827AsnIle: 4.827 ± 0.542
5.31AsnLys: 5.31 ± 1.467
4.586AsnLeu: 4.586 ± 0.347
1.69AsnMet: 1.69 ± 0.515
3.138AsnAsn: 3.138 ± 0.788
0.724AsnPro: 0.724 ± 0.435
1.448AsnGln: 1.448 ± 0.59
2.172AsnArg: 2.172 ± 0.556
4.103AsnSer: 4.103 ± 1.026
4.345AsnThr: 4.345 ± 1.524
2.896AsnVal: 2.896 ± 2.059
0.483AsnTrp: 0.483 ± 0.29
3.621AsnTyr: 3.621 ± 0.51
0.0AsnXaa: 0.0 ± 0.0
Pro
1.448ProAla: 1.448 ± 0.59
0.241ProCys: 0.241 ± 0.204
2.655ProAsp: 2.655 ± 0.679
1.448ProGlu: 1.448 ± 0.565
1.207ProPhe: 1.207 ± 0.726
2.414ProGly: 2.414 ± 0.598
0.724ProHis: 0.724 ± 0.194
2.414ProIle: 2.414 ± 1.405
2.414ProLys: 2.414 ± 0.867
2.655ProLeu: 2.655 ± 0.679
1.207ProMet: 1.207 ± 0.302
2.655ProAsn: 2.655 ± 0.814
0.724ProPro: 0.724 ± 0.314
0.483ProGln: 0.483 ± 0.408
1.207ProArg: 1.207 ± 0.302
1.69ProSer: 1.69 ± 1.456
1.207ProThr: 1.207 ± 0.44
2.172ProVal: 2.172 ± 0.581
0.241ProTrp: 0.241 ± 0.204
1.207ProTyr: 1.207 ± 0.798
0.0ProXaa: 0.0 ± 0.0
Gln
0.965GlnAla: 0.965 ± 1.624
0.483GlnCys: 0.483 ± 0.136
0.483GlnAsp: 0.483 ± 0.29
2.172GlnGlu: 2.172 ± 0.581
0.965GlnPhe: 0.965 ± 0.272
1.448GlnGly: 1.448 ± 0.565
0.483GlnHis: 0.483 ± 0.136
2.172GlnIle: 2.172 ± 0.386
1.448GlnLys: 1.448 ± 0.59
3.138GlnLeu: 3.138 ± 1.076
0.483GlnMet: 0.483 ± 0.29
0.724GlnAsn: 0.724 ± 0.435
1.931GlnPro: 1.931 ± 0.719
1.931GlnGln: 1.931 ± 0.752
1.448GlnArg: 1.448 ± 0.388
2.172GlnSer: 2.172 ± 0.762
2.172GlnThr: 2.172 ± 0.581
3.138GlnVal: 3.138 ± 0.969
0.0GlnTrp: 0.0 ± 0.0
1.931GlnTyr: 1.931 ± 0.544
0.0GlnXaa: 0.0 ± 0.0
Arg
1.931ArgAla: 1.931 ± 1.394
1.448ArgCys: 1.448 ± 0.883
0.724ArgAsp: 0.724 ± 0.194
4.345ArgGlu: 4.345 ± 1.111
2.414ArgPhe: 2.414 ± 0.368
1.931ArgGly: 1.931 ± 0.488
0.724ArgHis: 0.724 ± 0.194
2.172ArgIle: 2.172 ± 1.316
3.379ArgLys: 3.379 ± 0.871
5.069ArgLeu: 5.069 ± 0.475
0.724ArgMet: 0.724 ± 0.752
2.655ArgAsn: 2.655 ± 0.434
1.69ArgPro: 1.69 ± 0.55
0.483ArgGln: 0.483 ± 0.136
1.448ArgArg: 1.448 ± 0.883
3.621ArgSer: 3.621 ± 1.128
3.138ArgThr: 3.138 ± 0.788
1.207ArgVal: 1.207 ± 0.726
0.483ArgTrp: 0.483 ± 0.29
2.896ArgTyr: 2.896 ± 1.28
0.0ArgXaa: 0.0 ± 0.0
Ser
3.379SerAla: 3.379 ± 1.034
1.69SerCys: 1.69 ± 0.57
3.379SerAsp: 3.379 ± 0.871
3.621SerGlu: 3.621 ± 0.955
3.379SerPhe: 3.379 ± 0.709
3.862SerGly: 3.862 ± 0.901
1.69SerHis: 1.69 ± 0.665
6.758SerIle: 6.758 ± 1.743
8.689SerLys: 8.689 ± 2.334
7.483SerLeu: 7.483 ± 0.995
1.931SerMet: 1.931 ± 0.875
3.862SerAsn: 3.862 ± 0.906
2.896SerPro: 2.896 ± 0.232
2.896SerGln: 2.896 ± 0.943
2.655SerArg: 2.655 ± 0.264
6.758SerSer: 6.758 ± 0.965
5.31SerThr: 5.31 ± 2.095
5.069SerVal: 5.069 ± 1.359
1.448SerTrp: 1.448 ± 0.408
3.138SerTyr: 3.138 ± 0.885
0.0SerXaa: 0.0 ± 0.0
Thr
3.621ThrAla: 3.621 ± 0.51
2.896ThrCys: 2.896 ± 1.537
1.931ThrAsp: 1.931 ± 0.544
3.621ThrGlu: 3.621 ± 0.471
2.414ThrPhe: 2.414 ± 0.89
4.586ThrGly: 4.586 ± 1.167
1.207ThrHis: 1.207 ± 0.44
6.276ThrIle: 6.276 ± 0.567
5.552ThrLys: 5.552 ± 0.528
3.862ThrLeu: 3.862 ± 1.087
1.448ThrMet: 1.448 ± 0.59
4.345ThrAsn: 4.345 ± 1.769
2.172ThrPro: 2.172 ± 0.943
1.931ThrGln: 1.931 ± 0.544
2.172ThrArg: 2.172 ± 0.72
4.586ThrSer: 4.586 ± 0.347
5.31ThrThr: 5.31 ± 0.447
5.069ThrVal: 5.069 ± 2.201
0.483ThrTrp: 0.483 ± 0.408
2.655ThrTyr: 2.655 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
1.69ValAla: 1.69 ± 0.502
2.414ValCys: 2.414 ± 1.405
3.379ValAsp: 3.379 ± 1.936
5.069ValGlu: 5.069 ± 0.741
2.896ValPhe: 2.896 ± 0.232
2.655ValGly: 2.655 ± 1.198
2.414ValHis: 2.414 ± 0.68
5.31ValIle: 5.31 ± 2.132
3.621ValLys: 3.621 ± 0.955
3.621ValLeu: 3.621 ± 0.905
1.207ValMet: 1.207 ± 0.302
3.862ValAsn: 3.862 ± 1.815
1.931ValPro: 1.931 ± 0.453
1.69ValGln: 1.69 ± 1.016
2.896ValArg: 2.896 ± 0.725
4.345ValSer: 4.345 ± 1.316
3.379ValThr: 3.379 ± 0.871
3.138ValVal: 3.138 ± 1.453
0.724ValTrp: 0.724 ± 0.194
2.414ValTyr: 2.414 ± 1.176
0.0ValXaa: 0.0 ± 0.0
Trp
0.724TrpAla: 0.724 ± 0.314
0.0TrpCys: 0.0 ± 0.0
0.724TrpAsp: 0.724 ± 0.435
0.483TrpGlu: 0.483 ± 0.29
0.724TrpPhe: 0.724 ± 0.194
0.965TrpGly: 0.965 ± 0.512
0.483TrpHis: 0.483 ± 0.29
0.241TrpIle: 0.241 ± 0.145
0.241TrpLys: 0.241 ± 0.204
0.483TrpLeu: 0.483 ± 0.29
0.483TrpMet: 0.483 ± 0.29
0.483TrpAsn: 0.483 ± 0.29
0.0TrpPro: 0.0 ± 0.0
0.241TrpGln: 0.241 ± 0.145
0.965TrpArg: 0.965 ± 0.272
1.448TrpSer: 1.448 ± 1.503
0.483TrpThr: 0.483 ± 0.408
0.241TrpVal: 0.241 ± 0.204
0.241TrpTrp: 0.241 ± 0.204
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.414TyrAla: 2.414 ± 0.867
0.965TyrCys: 0.965 ± 0.272
1.207TyrAsp: 1.207 ± 0.645
1.448TyrGlu: 1.448 ± 0.565
1.69TyrPhe: 1.69 ± 0.55
1.207TyrGly: 1.207 ± 0.791
0.965TyrHis: 0.965 ± 0.314
4.103TyrIle: 4.103 ± 0.542
4.345TyrLys: 4.345 ± 0.176
4.827TyrLeu: 4.827 ± 0.312
1.69TyrMet: 1.69 ± 0.912
2.896TyrAsn: 2.896 ± 0.577
1.448TyrPro: 1.448 ± 0.844
1.69TyrGln: 1.69 ± 0.426
2.172TyrArg: 2.172 ± 0.533
3.621TyrSer: 3.621 ± 0.969
2.172TyrThr: 2.172 ± 0.556
2.414TyrVal: 2.414 ± 0.68
0.241TyrTrp: 0.241 ± 0.867
2.414TyrTyr: 2.414 ± 0.357
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4144 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski