Amino acid dipepetide frequency for Shayang Fly Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.366AlaAla: 3.366 ± 0.737
0.721AlaCys: 0.721 ± 0.44
1.202AlaAsp: 1.202 ± 0.433
1.924AlaGlu: 1.924 ± 0.841
3.126AlaPhe: 3.126 ± 0.739
2.645AlaGly: 2.645 ± 0.9
1.924AlaHis: 1.924 ± 0.823
4.088AlaIle: 4.088 ± 1.028
2.164AlaLys: 2.164 ± 0.421
6.492AlaLeu: 6.492 ± 1.249
1.924AlaMet: 1.924 ± 0.822
2.885AlaAsn: 2.885 ± 1.45
2.164AlaPro: 2.164 ± 0.479
2.404AlaGln: 2.404 ± 0.651
3.366AlaArg: 3.366 ± 1.16
3.607AlaSer: 3.607 ± 0.348
4.328AlaThr: 4.328 ± 0.638
4.088AlaVal: 4.088 ± 0.338
1.443AlaTrp: 1.443 ± 0.204
2.164AlaTyr: 2.164 ± 0.629
0.0AlaXaa: 0.0 ± 0.0
Cys
0.481CysAla: 0.481 ± 0.177
0.24CysCys: 0.24 ± 0.133
0.721CysAsp: 0.721 ± 0.155
0.24CysGlu: 0.24 ± 0.292
0.962CysPhe: 0.962 ± 0.354
0.721CysGly: 0.721 ± 0.59
0.481CysHis: 0.481 ± 0.177
1.683CysIle: 1.683 ± 1.149
1.443CysLys: 1.443 ± 0.8
1.202CysLeu: 1.202 ± 0.305
0.0CysMet: 0.0 ± 0.0
0.481CysAsn: 0.481 ± 0.267
1.683CysPro: 1.683 ± 0.934
2.404CysGln: 2.404 ± 1.225
0.962CysArg: 0.962 ± 0.229
1.924CysSer: 1.924 ± 0.458
0.481CysThr: 0.481 ± 0.259
1.202CysVal: 1.202 ± 0.305
0.0CysTrp: 0.0 ± 0.0
0.24CysTyr: 0.24 ± 0.292
0.0CysXaa: 0.0 ± 0.0
Asp
2.404AspAla: 2.404 ± 1.006
0.962AspCys: 0.962 ± 0.229
2.885AspAsp: 2.885 ± 0.385
5.049AspGlu: 5.049 ± 0.295
1.443AspPhe: 1.443 ± 0.8
3.126AspGly: 3.126 ± 0.36
1.443AspHis: 1.443 ± 0.531
3.847AspIle: 3.847 ± 1.145
1.924AspLys: 1.924 ± 0.407
3.366AspLeu: 3.366 ± 0.136
1.683AspMet: 1.683 ± 0.475
2.885AspAsn: 2.885 ± 0.667
2.885AspPro: 2.885 ± 1.116
3.847AspGln: 3.847 ± 1.421
1.202AspArg: 1.202 ± 0.324
4.809AspSer: 4.809 ± 1.06
3.366AspThr: 3.366 ± 0.377
3.366AspVal: 3.366 ± 0.524
1.202AspTrp: 1.202 ± 0.981
2.404AspTyr: 2.404 ± 0.683
0.0AspXaa: 0.0 ± 0.0
Glu
3.607GluAla: 3.607 ± 1.187
0.24GluCys: 0.24 ± 0.133
2.164GluAsp: 2.164 ± 0.216
2.164GluGlu: 2.164 ± 0.583
1.202GluPhe: 1.202 ± 0.667
1.924GluGly: 1.924 ± 0.458
0.721GluHis: 0.721 ± 0.155
3.607GluIle: 3.607 ± 0.497
2.164GluLys: 2.164 ± 0.583
5.29GluLeu: 5.29 ± 1.102
1.683GluMet: 1.683 ± 0.389
2.404GluAsn: 2.404 ± 0.517
0.721GluPro: 0.721 ± 0.44
2.164GluGln: 2.164 ± 0.875
2.885GluArg: 2.885 ± 0.316
4.809GluSer: 4.809 ± 0.069
2.885GluThr: 2.885 ± 0.68
3.366GluVal: 3.366 ± 1.015
1.683GluTrp: 1.683 ± 0.371
0.481GluTyr: 0.481 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
1.202PheAla: 1.202 ± 0.305
0.721PheCys: 0.721 ± 0.292
2.164PheAsp: 2.164 ± 0.216
1.202PheGlu: 1.202 ± 0.305
1.683PhePhe: 1.683 ± 0.325
2.164PheGly: 2.164 ± 0.479
1.924PheHis: 1.924 ± 1.067
2.164PheIle: 2.164 ± 0.853
1.202PheLys: 1.202 ± 0.305
3.847PheLeu: 3.847 ± 0.413
0.962PheMet: 0.962 ± 0.534
1.443PheAsn: 1.443 ± 0.204
2.885PhePro: 2.885 ± 0.316
1.924PheGln: 1.924 ± 0.089
1.683PheArg: 1.683 ± 0.593
4.809PheSer: 4.809 ± 0.685
1.202PheThr: 1.202 ± 0.433
1.202PheVal: 1.202 ± 0.341
0.24PheTrp: 0.24 ± 0.133
0.721PheTyr: 0.721 ± 0.536
0.0PheXaa: 0.0 ± 0.0
Gly
1.924GlyAla: 1.924 ± 0.823
0.721GlyCys: 0.721 ± 0.4
1.924GlyAsp: 1.924 ± 0.744
3.126GlyGlu: 3.126 ± 0.72
1.924GlyPhe: 1.924 ± 0.089
3.126GlyGly: 3.126 ± 0.747
0.962GlyHis: 0.962 ± 0.229
1.683GlyIle: 1.683 ± 0.325
0.721GlyLys: 0.721 ± 0.252
6.011GlyLeu: 6.011 ± 0.58
1.202GlyMet: 1.202 ± 0.324
0.962GlyAsn: 0.962 ± 0.457
1.924GlyPro: 1.924 ± 0.879
1.202GlyGln: 1.202 ± 0.433
2.164GlyArg: 2.164 ± 0.242
3.607GlySer: 3.607 ± 0.391
1.443GlyThr: 1.443 ± 0.531
3.847GlyVal: 3.847 ± 0.308
2.404GlyTrp: 2.404 ± 0.232
1.443GlyTyr: 1.443 ± 0.311
0.0GlyXaa: 0.0 ± 0.0
His
1.443HisAla: 1.443 ± 0.192
0.481HisCys: 0.481 ± 0.545
0.962HisAsp: 0.962 ± 0.519
1.443HisGlu: 1.443 ± 0.311
0.962HisPhe: 0.962 ± 0.534
1.924HisGly: 1.924 ± 0.458
3.607HisHis: 3.607 ± 1.488
2.164HisIle: 2.164 ± 0.421
1.683HisLys: 1.683 ± 0.787
3.847HisLeu: 3.847 ± 0.906
1.683HisMet: 1.683 ± 0.389
1.683HisAsn: 1.683 ± 0.325
1.924HisPro: 1.924 ± 0.089
0.962HisGln: 0.962 ± 0.754
1.202HisArg: 1.202 ± 0.324
2.885HisSer: 2.885 ± 0.316
1.443HisThr: 1.443 ± 0.192
0.962HisVal: 0.962 ± 0.144
1.202HisTrp: 1.202 ± 0.305
1.202HisTyr: 1.202 ± 0.341
0.0HisXaa: 0.0 ± 0.0
Ile
4.568IleAla: 4.568 ± 1.694
1.202IleCys: 1.202 ± 0.613
3.847IleAsp: 3.847 ± 0.917
4.568IleGlu: 4.568 ± 0.702
1.683IlePhe: 1.683 ± 0.593
2.645IleGly: 2.645 ± 0.481
1.443IleHis: 1.443 ± 0.204
6.732IleIle: 6.732 ± 1.705
3.366IleLys: 3.366 ± 0.524
6.973IleLeu: 6.973 ± 2.118
2.164IleMet: 2.164 ± 0.853
2.404IleAsn: 2.404 ± 0.29
3.847IlePro: 3.847 ± 1.455
3.126IleGln: 3.126 ± 1.321
2.885IleArg: 2.885 ± 0.409
6.732IleSer: 6.732 ± 1.502
3.126IleThr: 3.126 ± 0.539
6.252IleVal: 6.252 ± 0.638
0.962IleTrp: 0.962 ± 0.144
4.088IleTyr: 4.088 ± 0.505
0.0IleXaa: 0.0 ± 0.0
Lys
3.126LysAla: 3.126 ± 1.504
1.443LysCys: 1.443 ± 0.311
4.088LysAsp: 4.088 ± 0.179
1.683LysGlu: 1.683 ± 0.368
1.683LysPhe: 1.683 ± 0.457
1.443LysGly: 1.443 ± 0.593
0.24LysHis: 0.24 ± 0.133
3.607LysIle: 3.607 ± 0.449
1.683LysLys: 1.683 ± 0.593
4.568LysLeu: 4.568 ± 1.201
0.962LysMet: 0.962 ± 0.229
1.683LysAsn: 1.683 ± 0.325
0.962LysPro: 0.962 ± 0.516
0.721LysGln: 0.721 ± 0.4
1.683LysArg: 1.683 ± 0.368
5.29LysSer: 5.29 ± 1.251
2.164LysThr: 2.164 ± 0.613
3.847LysVal: 3.847 ± 2.234
0.962LysTrp: 0.962 ± 0.144
1.924LysTyr: 1.924 ± 0.723
0.0LysXaa: 0.0 ± 0.0
Leu
5.53LeuAla: 5.53 ± 0.708
2.164LeuCys: 2.164 ± 0.225
6.973LeuAsp: 6.973 ± 2.012
3.126LeuGlu: 3.126 ± 1.382
3.126LeuPhe: 3.126 ± 0.999
4.809LeuGly: 4.809 ± 0.783
1.924LeuHis: 1.924 ± 0.635
7.694LeuIle: 7.694 ± 0.826
5.29LeuLys: 5.29 ± 0.45
10.339LeuLeu: 10.339 ± 2.427
3.607LeuMet: 3.607 ± 0.086
6.252LeuAsn: 6.252 ± 1.186
6.011LeuPro: 6.011 ± 0.55
5.53LeuGln: 5.53 ± 0.376
6.732LeuArg: 6.732 ± 0.91
10.099LeuSer: 10.099 ± 1.489
8.656LeuThr: 8.656 ± 1.98
6.011LeuVal: 6.011 ± 1.661
1.443LeuTrp: 1.443 ± 0.593
2.645LeuTyr: 2.645 ± 1.117
0.0LeuXaa: 0.0 ± 0.0
Met
1.202MetAla: 1.202 ± 0.324
0.721MetCys: 0.721 ± 0.44
2.404MetAsp: 2.404 ± 0.683
0.481MetGlu: 0.481 ± 0.259
1.683MetPhe: 1.683 ± 0.389
0.481MetGly: 0.481 ± 0.177
0.481MetHis: 0.481 ± 0.267
3.607MetIle: 3.607 ± 1.119
1.202MetLys: 1.202 ± 0.708
2.645MetLeu: 2.645 ± 0.593
1.202MetMet: 1.202 ± 0.951
0.962MetAsn: 0.962 ± 0.229
1.443MetPro: 1.443 ± 0.192
0.962MetGln: 0.962 ± 0.229
1.924MetArg: 1.924 ± 0.823
2.645MetSer: 2.645 ± 0.304
2.645MetThr: 2.645 ± 0.481
1.202MetVal: 1.202 ± 0.628
0.0MetTrp: 0.0 ± 0.0
0.962MetTyr: 0.962 ± 0.372
0.0MetXaa: 0.0 ± 0.0
Asn
2.885AsnAla: 2.885 ± 0.409
0.481AsnCys: 0.481 ± 0.545
2.645AsnAsp: 2.645 ± 0.529
2.404AsnGlu: 2.404 ± 0.514
0.962AsnPhe: 0.962 ± 0.534
1.443AsnGly: 1.443 ± 0.381
2.164AsnHis: 2.164 ± 0.648
3.126AsnIle: 3.126 ± 1.148
1.683AsnLys: 1.683 ± 0.389
4.328AsnLeu: 4.328 ± 1.166
2.404AsnMet: 2.404 ± 0.508
2.404AsnAsn: 2.404 ± 0.348
4.088AsnPro: 4.088 ± 0.708
2.404AsnGln: 2.404 ± 0.348
1.924AsnArg: 1.924 ± 0.744
4.088AsnSer: 4.088 ± 0.303
0.962AsnThr: 0.962 ± 0.229
3.607AsnVal: 3.607 ± 0.348
0.481AsnTrp: 0.481 ± 0.177
1.443AsnTyr: 1.443 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
2.404ProAla: 2.404 ± 0.517
1.443ProCys: 1.443 ± 0.465
2.404ProAsp: 2.404 ± 0.517
2.404ProGlu: 2.404 ± 0.867
2.164ProPhe: 2.164 ± 0.576
2.404ProGly: 2.404 ± 0.232
1.683ProHis: 1.683 ± 0.371
3.847ProIle: 3.847 ± 1.143
2.164ProLys: 2.164 ± 0.78
5.049ProLeu: 5.049 ± 0.452
0.962ProMet: 0.962 ± 0.229
3.607ProAsn: 3.607 ± 1.378
2.404ProPro: 2.404 ± 1.6
2.164ProGln: 2.164 ± 0.216
1.443ProArg: 1.443 ± 0.583
3.607ProSer: 3.607 ± 0.972
3.126ProThr: 3.126 ± 0.343
3.126ProVal: 3.126 ± 0.747
0.721ProTrp: 0.721 ± 0.44
2.885ProTyr: 2.885 ± 0.622
0.0ProXaa: 0.0 ± 0.0
Gln
2.885GlnAla: 2.885 ± 0.484
0.481GlnCys: 0.481 ± 0.177
1.683GlnAsp: 1.683 ± 0.848
1.443GlnGlu: 1.443 ± 0.465
2.164GlnPhe: 2.164 ± 0.466
1.683GlnGly: 1.683 ± 0.325
2.164GlnHis: 2.164 ± 0.613
5.53GlnIle: 5.53 ± 0.9
1.443GlnLys: 1.443 ± 0.581
4.328GlnLeu: 4.328 ± 0.551
1.683GlnMet: 1.683 ± 0.562
2.404GlnAsn: 2.404 ± 0.232
1.443GlnPro: 1.443 ± 0.381
2.164GlnGln: 2.164 ± 0.576
3.126GlnArg: 3.126 ± 0.43
3.607GlnSer: 3.607 ± 0.604
3.366GlnThr: 3.366 ± 0.136
3.607GlnVal: 3.607 ± 1.352
0.24GlnTrp: 0.24 ± 0.272
0.962GlnTyr: 0.962 ± 0.534
0.0GlnXaa: 0.0 ± 0.0
Arg
3.366ArgAla: 3.366 ± 1.402
0.721ArgCys: 0.721 ± 0.252
3.126ArgAsp: 3.126 ± 1.622
1.202ArgGlu: 1.202 ± 0.477
1.924ArgPhe: 1.924 ± 1.067
2.164ArgGly: 2.164 ± 0.875
1.924ArgHis: 1.924 ± 0.089
2.164ArgIle: 2.164 ± 0.583
2.164ArgLys: 2.164 ± 0.853
6.492ArgLeu: 6.492 ± 1.987
0.721ArgMet: 0.721 ± 0.536
3.366ArgAsn: 3.366 ± 1.04
2.645ArgPro: 2.645 ± 0.481
1.443ArgGln: 1.443 ± 0.465
2.645ArgArg: 2.645 ± 0.602
5.29ArgSer: 5.29 ± 0.806
1.443ArgThr: 1.443 ± 0.465
2.645ArgVal: 2.645 ± 0.823
1.202ArgTrp: 1.202 ± 1.097
2.164ArgTyr: 2.164 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
6.732SerAla: 6.732 ± 1.048
1.924SerCys: 1.924 ± 0.458
6.011SerAsp: 6.011 ± 1.028
5.53SerGlu: 5.53 ± 0.84
2.645SerPhe: 2.645 ± 0.847
3.126SerGly: 3.126 ± 0.891
3.847SerHis: 3.847 ± 1.455
5.771SerIle: 5.771 ± 1.004
2.885SerLys: 2.885 ± 0.688
11.06SerLeu: 11.06 ± 0.443
0.962SerMet: 0.962 ± 0.534
2.164SerAsn: 2.164 ± 0.566
2.404SerPro: 2.404 ± 0.514
4.328SerGln: 4.328 ± 0.883
4.568SerArg: 4.568 ± 1.116
6.492SerSer: 6.492 ± 0.726
8.656SerThr: 8.656 ± 1.767
4.328SerVal: 4.328 ± 0.593
1.202SerTrp: 1.202 ± 0.324
3.607SerTyr: 3.607 ± 0.301
0.0SerXaa: 0.0 ± 0.0
Thr
2.885ThrAla: 2.885 ± 0.761
0.721ThrCys: 0.721 ± 0.44
2.645ThrAsp: 2.645 ± 0.529
3.126ThrGlu: 3.126 ± 1.037
1.683ThrPhe: 1.683 ± 0.593
2.164ThrGly: 2.164 ± 0.613
2.164ThrHis: 2.164 ± 0.648
4.568ThrIle: 4.568 ± 0.677
2.164ThrLys: 2.164 ± 0.78
8.415ThrLeu: 8.415 ± 0.475
1.683ThrMet: 1.683 ± 0.368
2.645ThrAsn: 2.645 ± 0.481
2.885ThrPro: 2.885 ± 0.93
2.885ThrGln: 2.885 ± 0.804
2.404ThrArg: 2.404 ± 0.953
4.088ThrSer: 4.088 ± 1.339
6.011ThrThr: 6.011 ± 1.355
4.809ThrVal: 4.809 ± 0.58
1.924ThrTrp: 1.924 ± 0.723
2.645ThrTyr: 2.645 ± 0.304
0.0ThrXaa: 0.0 ± 0.0
Val
3.366ValAla: 3.366 ± 1.41
0.721ValCys: 0.721 ± 0.155
3.607ValAsp: 3.607 ± 0.086
3.366ValGlu: 3.366 ± 0.65
1.683ValPhe: 1.683 ± 0.068
1.683ValGly: 1.683 ± 0.457
2.885ValHis: 2.885 ± 0.409
4.088ValIle: 4.088 ± 1.021
5.29ValLys: 5.29 ± 0.477
6.492ValLeu: 6.492 ± 1.97
1.924ValMet: 1.924 ± 0.709
2.645ValAsn: 2.645 ± 0.419
4.088ValPro: 4.088 ± 0.576
2.885ValGln: 2.885 ± 0.68
4.088ValArg: 4.088 ± 0.939
4.809ValSer: 4.809 ± 1.279
3.607ValThr: 3.607 ± 0.604
3.847ValVal: 3.847 ± 1.387
0.481ValTrp: 0.481 ± 0.177
2.885ValTyr: 2.885 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.962TrpAla: 0.962 ± 0.144
0.721TrpCys: 0.721 ± 0.155
0.721TrpAsp: 0.721 ± 0.155
0.481TrpGlu: 0.481 ± 0.267
0.962TrpPhe: 0.962 ± 0.372
0.962TrpGly: 0.962 ± 0.354
0.24TrpHis: 0.24 ± 0.133
0.481TrpIle: 0.481 ± 0.259
1.443TrpLys: 1.443 ± 0.311
2.404TrpLeu: 2.404 ± 0.611
0.962TrpMet: 0.962 ± 0.354
0.962TrpAsn: 0.962 ± 0.823
0.962TrpPro: 0.962 ± 0.534
0.721TrpGln: 0.721 ± 0.536
0.0TrpArg: 0.0 ± 0.0
1.683TrpSer: 1.683 ± 0.787
1.924TrpThr: 1.924 ± 0.709
1.202TrpVal: 1.202 ± 0.433
0.24TrpTrp: 0.24 ± 0.272
0.481TrpTyr: 0.481 ± 0.545
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.683TyrAla: 1.683 ± 1.358
0.721TyrCys: 0.721 ± 0.4
1.924TyrAsp: 1.924 ± 0.723
1.202TyrGlu: 1.202 ± 0.667
1.683TyrPhe: 1.683 ± 0.325
1.683TyrGly: 1.683 ± 0.668
1.443TyrHis: 1.443 ± 0.505
2.164TyrIle: 2.164 ± 0.479
1.683TyrLys: 1.683 ± 0.368
4.809TyrLeu: 4.809 ± 0.775
0.24TyrMet: 0.24 ± 0.133
1.683TyrAsn: 1.683 ± 0.668
2.645TyrPro: 2.645 ± 0.805
2.404TyrGln: 2.404 ± 0.192
1.924TyrArg: 1.924 ± 0.451
3.366TyrSer: 3.366 ± 1.185
1.683TyrThr: 1.683 ± 0.371
1.924TyrVal: 1.924 ± 0.553
0.481TyrTrp: 0.481 ± 0.267
1.443TyrTyr: 1.443 ± 0.505
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4160 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski