Amino acid dipepetide frequency for Shahe heteroptera virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.899AlaAla: 2.899 ± 3.3
1.74AlaCys: 1.74 ± 0.54
2.899AlaAsp: 2.899 ± 0.384
4.639AlaGlu: 4.639 ± 0.394
2.899AlaPhe: 2.899 ± 1.207
2.32AlaGly: 2.32 ± 1.234
0.29AlaHis: 0.29 ± 0.288
3.479AlaIle: 3.479 ± 1.228
4.349AlaLys: 4.349 ± 1.444
3.769AlaLeu: 3.769 ± 2.329
2.03AlaMet: 2.03 ± 1.119
1.74AlaAsn: 1.74 ± 0.909
1.74AlaPro: 1.74 ± 0.909
1.16AlaGln: 1.16 ± 0.539
3.189AlaArg: 3.189 ± 0.92
6.089AlaSer: 6.089 ± 2.148
4.349AlaThr: 4.349 ± 2.291
2.899AlaVal: 2.899 ± 0.945
0.29AlaTrp: 0.29 ± 0.156
0.87AlaTyr: 0.87 ± 1.352
0.0AlaXaa: 0.0 ± 0.0
Cys
1.74CysAla: 1.74 ± 0.453
0.0CysCys: 0.0 ± 0.0
2.609CysAsp: 2.609 ± 1.363
0.58CysGlu: 0.58 ± 0.576
1.45CysPhe: 1.45 ± 0.609
1.45CysGly: 1.45 ± 0.319
0.87CysHis: 0.87 ± 0.175
1.45CysIle: 1.45 ± 0.628
0.87CysLys: 0.87 ± 0.864
2.899CysLeu: 2.899 ± 0.959
0.29CysMet: 0.29 ± 0.288
1.74CysAsn: 1.74 ± 1.729
1.74CysPro: 1.74 ± 1.313
0.0CysGln: 0.0 ± 0.0
1.74CysArg: 1.74 ± 0.54
2.899CysSer: 2.899 ± 1.647
0.87CysThr: 0.87 ± 0.454
0.87CysVal: 0.87 ± 0.454
0.58CysTrp: 0.58 ± 0.576
1.16CysTyr: 1.16 ± 1.153
0.0CysXaa: 0.0 ± 0.0
Asp
1.74AspAla: 1.74 ± 0.478
2.03AspCys: 2.03 ± 0.632
4.349AspAsp: 4.349 ± 0.875
5.219AspGlu: 5.219 ± 2.046
2.32AspPhe: 2.32 ± 0.482
3.479AspGly: 3.479 ± 0.655
1.16AspHis: 1.16 ± 0.36
6.089AspIle: 6.089 ± 0.8
4.349AspLys: 4.349 ± 1.071
4.639AspLeu: 4.639 ± 0.801
0.58AspMet: 0.58 ± 0.313
2.03AspAsn: 2.03 ± 0.367
4.349AspPro: 4.349 ± 0.238
1.74AspGln: 1.74 ± 0.564
3.189AspArg: 3.189 ± 0.98
3.769AspSer: 3.769 ± 0.837
2.899AspThr: 2.899 ± 0.788
4.929AspVal: 4.929 ± 1.167
1.16AspTrp: 1.16 ± 0.279
2.899AspTyr: 2.899 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
2.609GluAla: 2.609 ± 0.307
0.58GluCys: 0.58 ± 0.18
3.189GluAsp: 3.189 ± 1.333
3.769GluGlu: 3.769 ± 0.348
4.059GluPhe: 4.059 ± 1.108
1.74GluGly: 1.74 ± 0.35
0.87GluHis: 0.87 ± 0.469
3.479GluIle: 3.479 ± 0.905
2.609GluLys: 2.609 ± 1.407
4.639GluLeu: 4.639 ± 1.523
1.45GluMet: 1.45 ± 0.319
1.74GluAsn: 1.74 ± 0.35
4.639GluPro: 4.639 ± 0.792
1.45GluGln: 1.45 ± 0.782
2.609GluArg: 2.609 ± 1.407
4.349GluSer: 4.349 ± 1.585
4.059GluThr: 4.059 ± 0.941
4.349GluVal: 4.349 ± 0.652
1.74GluTrp: 1.74 ± 0.564
1.74GluTyr: 1.74 ± 0.785
0.0GluXaa: 0.0 ± 0.0
Phe
2.32PheAla: 2.32 ± 0.3
1.45PheCys: 1.45 ± 1.441
0.87PheAsp: 0.87 ± 0.896
0.58PheGlu: 0.58 ± 0.313
2.899PhePhe: 2.899 ± 0.607
1.74PheGly: 1.74 ± 0.785
1.16PheHis: 1.16 ± 0.279
4.059PheIle: 4.059 ± 0.877
2.899PheLys: 2.899 ± 0.788
2.32PheLeu: 2.32 ± 0.3
2.03PheMet: 2.03 ± 1.14
2.609PheAsn: 2.609 ± 0.525
2.32PhePro: 2.32 ± 0.869
1.74PheGln: 1.74 ± 0.564
1.16PheArg: 1.16 ± 0.625
3.189PheSer: 3.189 ± 1.083
2.03PheThr: 2.03 ± 0.438
2.32PheVal: 2.32 ± 0.558
0.29PheTrp: 0.29 ± 0.156
1.16PheTyr: 1.16 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
2.609GlyAla: 2.609 ± 0.982
1.74GlyCys: 1.74 ± 1.313
2.899GlyAsp: 2.899 ± 1.256
2.609GlyGlu: 2.609 ± 0.661
2.899GlyPhe: 2.899 ± 1.071
2.899GlyGly: 2.899 ± 0.637
0.87GlyHis: 0.87 ± 0.175
3.189GlyIle: 3.189 ± 1.1
3.189GlyLys: 3.189 ± 0.389
3.769GlyLeu: 3.769 ± 0.969
0.58GlyMet: 0.58 ± 0.313
2.03GlyAsn: 2.03 ± 0.804
0.29GlyPro: 0.29 ± 0.156
1.45GlyGln: 1.45 ± 0.628
2.609GlyArg: 2.609 ± 0.525
5.219GlySer: 5.219 ± 2.356
2.32GlyThr: 2.32 ± 1.081
4.929GlyVal: 4.929 ± 0.24
1.16GlyTrp: 1.16 ± 0.739
2.609GlyTyr: 2.609 ± 0.307
0.0GlyXaa: 0.0 ± 0.0
His
1.16HisAla: 1.16 ± 0.62
0.58HisCys: 0.58 ± 0.18
0.58HisAsp: 0.58 ± 0.313
1.16HisGlu: 1.16 ± 0.539
0.58HisPhe: 0.58 ± 0.313
1.16HisGly: 1.16 ± 0.739
0.29HisHis: 0.29 ± 0.156
2.32HisIle: 2.32 ± 0.72
2.03HisLys: 2.03 ± 0.438
2.03HisLeu: 2.03 ± 0.438
0.29HisMet: 0.29 ± 0.156
0.0HisAsn: 0.0 ± 0.0
0.58HisPro: 0.58 ± 0.646
1.16HisGln: 1.16 ± 0.539
1.16HisArg: 1.16 ± 0.625
1.16HisSer: 1.16 ± 0.279
1.45HisThr: 1.45 ± 1.026
1.16HisVal: 1.16 ± 0.279
0.87HisTrp: 0.87 ± 0.469
2.32HisTyr: 2.32 ± 0.482
0.0HisXaa: 0.0 ± 0.0
Ile
4.929IleAla: 4.929 ± 2.064
2.32IleCys: 2.32 ± 1.478
4.059IleAsp: 4.059 ± 0.822
4.639IleGlu: 4.639 ± 0.394
1.45IlePhe: 1.45 ± 0.319
3.479IleGly: 3.479 ± 0.803
2.609IleHis: 2.609 ± 1.113
7.828IleIle: 7.828 ± 3.046
5.509IleLys: 5.509 ± 2.201
9.278IleLeu: 9.278 ± 1.146
1.74IleMet: 1.74 ± 0.478
2.609IleAsn: 2.609 ± 0.333
2.899IlePro: 2.899 ± 0.2
2.899IleGln: 2.899 ± 1.217
2.609IleArg: 2.609 ± 0.525
9.278IleSer: 9.278 ± 0.62
5.799IleThr: 5.799 ± 2.094
4.349IleVal: 4.349 ± 1.071
1.16IleTrp: 1.16 ± 0.36
2.03IleTyr: 2.03 ± 0.716
0.0IleXaa: 0.0 ± 0.0
Lys
4.059LysAla: 4.059 ± 1.675
0.29LysCys: 0.29 ± 0.288
3.769LysAsp: 3.769 ± 0.348
3.189LysGlu: 3.189 ± 1.333
2.03LysPhe: 2.03 ± 0.487
2.899LysGly: 2.899 ± 0.607
0.58LysHis: 0.58 ± 0.313
7.538LysIle: 7.538 ± 1.143
3.769LysLys: 3.769 ± 0.775
7.538LysLeu: 7.538 ± 2.357
2.03LysMet: 2.03 ± 0.536
2.609LysAsn: 2.609 ± 1.841
3.479LysPro: 3.479 ± 0.633
2.899LysGln: 2.899 ± 0.9
2.32LysArg: 2.32 ± 0.48
6.089LysSer: 6.089 ± 2.188
6.959LysThr: 6.959 ± 0.678
5.219LysVal: 5.219 ± 1.323
0.87LysTrp: 0.87 ± 0.638
3.189LysTyr: 3.189 ± 1.1
0.0LysXaa: 0.0 ± 0.0
Leu
5.219LeuAla: 5.219 ± 2.128
2.03LeuCys: 2.03 ± 0.716
6.379LeuAsp: 6.379 ± 1.733
5.219LeuGlu: 5.219 ± 0.706
2.609LeuPhe: 2.609 ± 0.666
4.349LeuGly: 4.349 ± 1.037
2.03LeuHis: 2.03 ± 0.487
7.538LeuIle: 7.538 ± 1.014
6.379LeuLys: 6.379 ± 0.953
5.799LeuLeu: 5.799 ± 0.411
2.609LeuMet: 2.609 ± 0.307
6.089LeuAsn: 6.089 ± 1.225
3.189LeuPro: 3.189 ± 1.231
4.349LeuGln: 4.349 ± 0.647
6.379LeuArg: 6.379 ± 2.264
5.219LeuSer: 5.219 ± 1.244
6.379LeuThr: 6.379 ± 0.523
4.059LeuVal: 4.059 ± 0.974
1.16LeuTrp: 1.16 ± 0.625
3.479LeuTyr: 3.479 ± 0.633
0.0LeuXaa: 0.0 ± 0.0
Met
1.16MetAla: 1.16 ± 0.739
0.87MetCys: 0.87 ± 0.614
0.87MetAsp: 0.87 ± 0.175
1.45MetGlu: 1.45 ± 0.782
0.29MetPhe: 0.29 ± 0.713
1.74MetGly: 1.74 ± 1.171
0.58MetHis: 0.58 ± 0.313
2.03MetIle: 2.03 ± 0.485
2.32MetLys: 2.32 ± 1.081
3.189MetLeu: 3.189 ± 0.912
1.74MetMet: 1.74 ± 0.453
0.87MetAsn: 0.87 ± 0.614
0.58MetPro: 0.58 ± 0.18
1.16MetGln: 1.16 ± 0.279
0.87MetArg: 0.87 ± 0.614
2.32MetSer: 2.32 ± 0.561
1.45MetThr: 1.45 ± 0.319
2.32MetVal: 2.32 ± 1.081
0.0MetTrp: 0.0 ± 0.0
0.58MetTyr: 0.58 ± 0.313
0.0MetXaa: 0.0 ± 0.0
Asn
2.609AsnAla: 2.609 ± 0.702
2.32AsnCys: 2.32 ± 1.081
2.03AsnAsp: 2.03 ± 0.716
1.16AsnGlu: 1.16 ± 0.279
1.74AsnPhe: 1.74 ± 0.564
1.74AsnGly: 1.74 ± 1.313
1.16AsnHis: 1.16 ± 0.539
2.609AsnIle: 2.609 ± 0.661
2.609AsnLys: 2.609 ± 0.307
4.929AsnLeu: 4.929 ± 1.044
0.87AsnMet: 0.87 ± 0.638
2.32AsnAsn: 2.32 ± 0.558
2.609AsnPro: 2.609 ± 0.692
2.03AsnGln: 2.03 ± 0.438
1.16AsnArg: 1.16 ± 0.625
2.609AsnSer: 2.609 ± 1.002
1.74AsnThr: 1.74 ± 0.35
2.32AsnVal: 2.32 ± 0.482
0.87AsnTrp: 0.87 ± 0.469
1.45AsnTyr: 1.45 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
1.45ProAla: 1.45 ± 0.473
0.58ProCys: 0.58 ± 0.18
3.189ProAsp: 3.189 ± 0.912
3.769ProGlu: 3.769 ± 1.28
2.03ProPhe: 2.03 ± 0.367
3.189ProGly: 3.189 ± 0.838
0.87ProHis: 0.87 ± 0.175
2.609ProIle: 2.609 ± 0.525
2.899ProLys: 2.899 ± 1.217
3.769ProLeu: 3.769 ± 0.778
0.87ProMet: 0.87 ± 0.175
1.16ProAsn: 1.16 ± 0.279
0.87ProPro: 0.87 ± 0.614
2.32ProGln: 2.32 ± 1.251
1.74ProArg: 1.74 ± 0.785
2.899ProSer: 2.899 ± 0.2
3.479ProThr: 3.479 ± 0.827
2.03ProVal: 2.03 ± 0.632
0.58ProTrp: 0.58 ± 0.313
1.45ProTyr: 1.45 ± 0.664
0.0ProXaa: 0.0 ± 0.0
Gln
0.87GlnAla: 0.87 ± 0.469
1.16GlnCys: 1.16 ± 1.153
1.45GlnAsp: 1.45 ± 0.417
2.03GlnGlu: 2.03 ± 0.716
1.45GlnPhe: 1.45 ± 0.473
2.609GlnGly: 2.609 ± 0.307
0.58GlnHis: 0.58 ± 0.313
2.899GlnIle: 2.899 ± 0.607
2.899GlnLys: 2.899 ± 0.945
3.189GlnLeu: 3.189 ± 1.345
1.45GlnMet: 1.45 ± 0.609
1.45GlnAsn: 1.45 ± 0.417
0.87GlnPro: 0.87 ± 0.175
0.29GlnGln: 0.29 ± 0.288
1.74GlnArg: 1.74 ± 0.739
1.74GlnSer: 1.74 ± 0.35
1.45GlnThr: 1.45 ± 0.628
2.609GlnVal: 2.609 ± 1.023
0.58GlnTrp: 0.58 ± 0.18
0.87GlnTyr: 0.87 ± 0.454
0.0GlnXaa: 0.0 ± 0.0
Arg
3.189ArgAla: 3.189 ± 0.912
0.0ArgCys: 0.0 ± 0.0
3.479ArgAsp: 3.479 ± 1.488
2.32ArgGlu: 2.32 ± 0.561
2.03ArgPhe: 2.03 ± 0.804
1.74ArgGly: 1.74 ± 0.478
0.87ArgHis: 0.87 ± 0.175
4.929ArgIle: 4.929 ± 0.956
2.609ArgLys: 2.609 ± 1.023
4.929ArgLeu: 4.929 ± 2.267
1.16ArgMet: 1.16 ± 0.62
1.45ArgAsn: 1.45 ± 0.319
1.74ArgPro: 1.74 ± 0.739
0.87ArgGln: 0.87 ± 0.469
3.189ArgArg: 3.189 ± 0.389
5.509ArgSer: 5.509 ± 0.351
2.03ArgThr: 2.03 ± 0.367
3.189ArgVal: 3.189 ± 0.5
0.58ArgTrp: 0.58 ± 0.313
1.45ArgTyr: 1.45 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.349SerAla: 4.349 ± 0.754
2.32SerCys: 2.32 ± 2.108
7.828SerAsp: 7.828 ± 2.371
4.639SerGlu: 4.639 ± 1.38
2.899SerPhe: 2.899 ± 0.9
3.479SerGly: 3.479 ± 0.234
1.16SerHis: 1.16 ± 0.36
6.959SerIle: 6.959 ± 1.225
6.379SerLys: 6.379 ± 0.779
8.698SerLeu: 8.698 ± 0.8
2.32SerMet: 2.32 ± 0.482
3.479SerAsn: 3.479 ± 1.488
2.609SerPro: 2.609 ± 0.333
2.32SerGln: 2.32 ± 0.3
4.059SerArg: 4.059 ± 1.348
6.089SerSer: 6.089 ± 0.779
6.089SerThr: 6.089 ± 1.205
2.899SerVal: 2.899 ± 0.637
0.87SerTrp: 0.87 ± 0.454
1.74SerTyr: 1.74 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
3.479ThrAla: 3.479 ± 1.228
2.609ThrCys: 2.609 ± 2.593
5.219ThrAsp: 5.219 ± 0.706
3.769ThrGlu: 3.769 ± 1.538
1.16ThrPhe: 1.16 ± 0.625
3.769ThrGly: 3.769 ± 1.016
2.609ThrHis: 2.609 ± 1.023
5.509ThrIle: 5.509 ± 1.374
5.509ThrLys: 5.509 ± 0.6
4.639ThrLeu: 4.639 ± 0.527
2.899ThrMet: 2.899 ± 0.545
2.32ThrAsn: 2.32 ± 0.869
2.32ThrPro: 2.32 ± 0.951
1.74ThrGln: 1.74 ± 0.564
2.899ThrArg: 2.899 ± 0.959
5.509ThrSer: 5.509 ± 2.315
5.799ThrThr: 5.799 ± 1.274
2.32ThrVal: 2.32 ± 1.478
1.16ThrTrp: 1.16 ± 0.62
2.03ThrTyr: 2.03 ± 0.438
0.0ThrXaa: 0.0 ± 0.0
Val
3.479ValAla: 3.479 ± 0.955
0.87ValCys: 0.87 ± 0.175
4.059ValAsp: 4.059 ± 0.877
2.899ValGlu: 2.899 ± 0.384
2.32ValPhe: 2.32 ± 0.858
2.609ValGly: 2.609 ± 0.661
1.45ValHis: 1.45 ± 0.628
4.639ValIle: 4.639 ± 0.963
6.089ValLys: 6.089 ± 1.152
6.959ValLeu: 6.959 ± 0.128
0.58ValMet: 0.58 ± 0.313
2.03ValAsn: 2.03 ± 1.193
2.32ValPro: 2.32 ± 0.869
1.74ValGln: 1.74 ± 0.35
2.899ValArg: 2.899 ± 0.384
4.059ValSer: 4.059 ± 0.877
3.769ValThr: 3.769 ± 1.838
4.349ValVal: 4.349 ± 0.654
1.45ValTrp: 1.45 ± 0.782
1.74ValTyr: 1.74 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
1.74TrpAla: 1.74 ± 0.478
1.16TrpCys: 1.16 ± 0.36
1.45TrpAsp: 1.45 ± 0.664
0.87TrpGlu: 0.87 ± 0.469
0.87TrpPhe: 0.87 ± 0.454
0.87TrpGly: 0.87 ± 0.469
0.0TrpHis: 0.0 ± 0.0
0.87TrpIle: 0.87 ± 0.175
1.16TrpLys: 1.16 ± 0.279
0.87TrpLeu: 0.87 ± 0.175
0.58TrpMet: 0.58 ± 0.186
0.58TrpAsn: 0.58 ± 0.313
0.0TrpPro: 0.0 ± 0.0
0.29TrpGln: 0.29 ± 0.156
0.58TrpArg: 0.58 ± 0.313
1.45TrpSer: 1.45 ± 0.417
1.16TrpThr: 1.16 ± 0.36
1.16TrpVal: 1.16 ± 0.279
0.0TrpTrp: 0.0 ± 0.0
0.29TrpTyr: 0.29 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.03TyrAla: 2.03 ± 0.716
1.16TyrCys: 1.16 ± 0.36
1.74TyrAsp: 1.74 ± 0.35
1.16TyrGlu: 1.16 ± 0.62
1.16TyrPhe: 1.16 ± 0.279
2.609TyrGly: 2.609 ± 0.661
2.03TyrHis: 2.03 ± 0.367
1.45TyrIle: 1.45 ± 0.628
3.189TyrLys: 3.189 ± 0.389
2.609TyrLeu: 2.609 ± 0.307
0.0TyrMet: 0.0 ± 0.0
2.03TyrAsn: 2.03 ± 0.716
2.609TyrPro: 2.609 ± 1.113
0.58TyrGln: 0.58 ± 0.646
1.16TyrArg: 1.16 ± 0.36
1.74TyrSer: 1.74 ± 0.35
3.189TyrThr: 3.189 ± 0.98
2.03TyrVal: 2.03 ± 0.485
0.58TyrTrp: 0.58 ± 0.18
1.45TyrTyr: 1.45 ± 0.628
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3450 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski