Amino acid dipepetide frequency for Shayang Spider Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.44AlaAla: 3.44 ± 3.476
1.251AlaCys: 1.251 ± 0.36
1.407AlaAsp: 1.407 ± 0.645
3.596AlaGlu: 3.596 ± 1.692
2.345AlaPhe: 2.345 ± 0.405
2.033AlaGly: 2.033 ± 2.104
0.469AlaHis: 0.469 ± 0.32
3.596AlaIle: 3.596 ± 0.401
3.283AlaLys: 3.283 ± 1.078
4.534AlaLeu: 4.534 ± 0.814
1.563AlaMet: 1.563 ± 0.754
2.658AlaAsn: 2.658 ± 1.563
2.345AlaPro: 2.345 ± 3.393
1.72AlaGln: 1.72 ± 1.288
1.72AlaArg: 1.72 ± 0.865
3.127AlaSer: 3.127 ± 0.546
1.563AlaThr: 1.563 ± 0.476
1.407AlaVal: 1.407 ± 0.645
0.625AlaTrp: 0.625 ± 0.279
1.407AlaTyr: 1.407 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.782CysAla: 0.782 ± 0.69
0.625CysCys: 0.625 ± 0.294
0.156CysAsp: 0.156 ± 0.077
1.251CysGlu: 1.251 ± 0.559
1.407CysPhe: 1.407 ± 0.53
1.094CysGly: 1.094 ± 1.062
0.469CysHis: 0.469 ± 0.558
1.876CysIle: 1.876 ± 0.648
1.251CysLys: 1.251 ± 0.777
1.72CysLeu: 1.72 ± 0.106
0.313CysMet: 0.313 ± 0.351
0.625CysAsn: 0.625 ± 0.279
1.407CysPro: 1.407 ± 0.53
0.625CysGln: 0.625 ± 0.157
0.782CysArg: 0.782 ± 0.373
1.251CysSer: 1.251 ± 0.559
0.625CysThr: 0.625 ± 0.505
0.625CysVal: 0.625 ± 0.307
0.469CysTrp: 0.469 ± 0.32
0.625CysTyr: 0.625 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
2.814AspAla: 2.814 ± 0.611
1.094AspCys: 1.094 ± 0.275
3.596AspAsp: 3.596 ± 1.187
4.534AspGlu: 4.534 ± 1.211
5.159AspPhe: 5.159 ± 1.03
2.345AspGly: 2.345 ± 1.538
1.094AspHis: 1.094 ± 0.673
4.065AspIle: 4.065 ± 0.445
4.534AspLys: 4.534 ± 0.829
5.159AspLeu: 5.159 ± 0.618
1.407AspMet: 1.407 ± 0.976
2.189AspAsn: 2.189 ± 0.455
1.094AspPro: 1.094 ± 0.18
0.938AspGln: 0.938 ± 0.196
2.814AspArg: 2.814 ± 0.422
5.785AspSer: 5.785 ± 1.517
1.876AspThr: 1.876 ± 0.393
3.44AspVal: 3.44 ± 0.458
1.251AspTrp: 1.251 ± 0.379
2.189AspTyr: 2.189 ± 0.576
0.0AspXaa: 0.0 ± 0.0
Glu
3.127GluAla: 3.127 ± 0.112
1.094GluCys: 1.094 ± 0.222
6.098GluAsp: 6.098 ± 0.363
7.505GluGlu: 7.505 ± 1.563
2.658GluPhe: 2.658 ± 0.346
3.752GluGly: 3.752 ± 0.352
1.094GluHis: 1.094 ± 0.222
6.723GluIle: 6.723 ± 1.449
5.785GluLys: 5.785 ± 0.792
9.225GluLeu: 9.225 ± 0.338
2.345GluMet: 2.345 ± 0.767
2.971GluAsn: 2.971 ± 0.787
1.563GluPro: 1.563 ± 0.922
3.44GluGln: 3.44 ± 0.461
1.72GluArg: 1.72 ± 0.43
6.567GluSer: 6.567 ± 1.382
4.847GluThr: 4.847 ± 0.47
4.534GluVal: 4.534 ± 1.262
0.625GluTrp: 0.625 ± 0.307
2.345GluTyr: 2.345 ± 0.371
0.0GluXaa: 0.0 ± 0.0
Phe
1.094PheAla: 1.094 ± 0.348
0.469PheCys: 0.469 ± 0.23
3.283PheAsp: 3.283 ± 0.826
3.127PheGlu: 3.127 ± 0.456
2.502PhePhe: 2.502 ± 1.561
3.283PheGly: 3.283 ± 0.52
0.782PheHis: 0.782 ± 0.312
4.221PheIle: 4.221 ± 1.073
5.472PheLys: 5.472 ± 1.484
3.127PheLeu: 3.127 ± 0.972
0.938PheMet: 0.938 ± 0.297
3.283PheAsn: 3.283 ± 1.088
0.313PhePro: 0.313 ± 0.351
1.876PheGln: 1.876 ± 0.177
1.876PheArg: 1.876 ± 0.573
3.909PheSer: 3.909 ± 0.548
1.72PheThr: 1.72 ± 1.099
2.345PheVal: 2.345 ± 0.793
0.156PheTrp: 0.156 ± 0.395
1.251PheTyr: 1.251 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
2.814GlyAla: 2.814 ± 4.077
0.782GlyCys: 0.782 ± 0.931
1.251GlyAsp: 1.251 ± 0.36
1.876GlyGlu: 1.876 ± 1.069
2.971GlyPhe: 2.971 ± 0.475
2.189GlyGly: 2.189 ± 1.426
1.094GlyHis: 1.094 ± 0.824
3.909GlyIle: 3.909 ± 1.19
4.065GlyLys: 4.065 ± 1.015
3.909GlyLeu: 3.909 ± 1.602
0.938GlyMet: 0.938 ± 0.254
2.814GlyAsn: 2.814 ± 0.666
0.782GlyPro: 0.782 ± 0.256
1.72GlyGln: 1.72 ± 0.142
2.033GlyArg: 2.033 ± 1.087
3.596GlySer: 3.596 ± 0.753
2.658GlyThr: 2.658 ± 0.774
3.283GlyVal: 3.283 ± 1.078
0.313GlyTrp: 0.313 ± 0.154
2.502GlyTyr: 2.502 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
0.782HisAla: 0.782 ± 0.457
0.469HisCys: 0.469 ± 0.32
0.469HisAsp: 0.469 ± 0.127
0.625HisGlu: 0.625 ± 0.294
0.469HisPhe: 0.469 ± 0.32
0.782HisGly: 0.782 ± 0.69
0.782HisHis: 0.782 ± 0.69
1.72HisIle: 1.72 ± 0.505
0.938HisLys: 0.938 ± 0.419
1.72HisLeu: 1.72 ± 0.886
0.625HisMet: 0.625 ± 0.294
0.782HisAsn: 0.782 ± 0.212
0.625HisPro: 0.625 ± 0.157
0.625HisGln: 0.625 ± 0.294
1.094HisArg: 1.094 ± 0.392
1.094HisSer: 1.094 ± 0.673
1.407HisThr: 1.407 ± 0.53
1.094HisVal: 1.094 ± 0.18
0.313HisTrp: 0.313 ± 0.154
0.625HisTyr: 0.625 ± 0.449
0.0HisXaa: 0.0 ± 0.0
Ile
2.658IleAla: 2.658 ± 1.563
1.251IleCys: 1.251 ± 0.559
5.472IleAsp: 5.472 ± 0.217
6.879IleGlu: 6.879 ± 1.131
2.971IlePhe: 2.971 ± 0.787
2.814IleGly: 2.814 ± 0.164
1.251IleHis: 1.251 ± 0.36
5.159IleIle: 5.159 ± 0.472
7.817IleLys: 7.817 ± 1.523
5.785IleLeu: 5.785 ± 1.539
2.033IleMet: 2.033 ± 0.475
4.065IleAsn: 4.065 ± 0.742
2.189IlePro: 2.189 ± 0.225
1.876IleGln: 1.876 ± 0.326
4.69IleArg: 4.69 ± 0.386
8.755IleSer: 8.755 ± 2.545
3.127IleThr: 3.127 ± 1.155
4.065IleVal: 4.065 ± 0.659
0.782IleTrp: 0.782 ± 0.457
2.658IleTyr: 2.658 ± 0.346
0.0IleXaa: 0.0 ± 0.0
Lys
4.69LysAla: 4.69 ± 1.282
0.938LysCys: 0.938 ± 0.64
5.159LysAsp: 5.159 ± 1.303
8.599LysGlu: 8.599 ± 2.004
4.221LysPhe: 4.221 ± 1.073
4.534LysGly: 4.534 ± 0.894
1.72LysHis: 1.72 ± 0.505
5.159LysIle: 5.159 ± 0.862
7.505LysLys: 7.505 ± 1.506
8.286LysLeu: 8.286 ± 1.647
2.189LysMet: 2.189 ± 0.326
3.127LysAsn: 3.127 ± 0.423
3.283LysPro: 3.283 ± 0.396
2.658LysGln: 2.658 ± 1.099
4.378LysArg: 4.378 ± 0.26
5.159LysSer: 5.159 ± 1.368
6.254LysThr: 6.254 ± 1.709
5.629LysVal: 5.629 ± 1.106
1.251LysTrp: 1.251 ± 0.379
3.44LysTyr: 3.44 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
3.752LeuAla: 3.752 ± 0.729
0.938LeuCys: 0.938 ± 0.461
5.472LeuAsp: 5.472 ± 1.046
5.785LeuGlu: 5.785 ± 1.479
2.345LeuPhe: 2.345 ± 0.402
4.69LeuGly: 4.69 ± 1.374
1.72LeuHis: 1.72 ± 0.505
9.225LeuIle: 9.225 ± 0.862
9.068LeuLys: 9.068 ± 2.525
9.381LeuLeu: 9.381 ± 1.967
2.502LeuMet: 2.502 ± 0.689
7.348LeuAsn: 7.348 ± 0.599
3.752LeuPro: 3.752 ± 0.124
3.44LeuGln: 3.44 ± 0.86
4.221LeuArg: 4.221 ± 0.725
8.755LeuSer: 8.755 ± 1.417
6.254LeuThr: 6.254 ± 0.79
4.69LeuVal: 4.69 ± 1.339
0.469LeuTrp: 0.469 ± 0.321
2.033LeuTyr: 2.033 ± 0.504
0.0LeuXaa: 0.0 ± 0.0
Met
0.625MetAla: 0.625 ± 0.307
0.469MetCys: 0.469 ± 0.127
1.72MetAsp: 1.72 ± 0.142
0.938MetGlu: 0.938 ± 0.335
1.094MetPhe: 1.094 ± 0.18
1.251MetGly: 1.251 ± 0.42
0.625MetHis: 0.625 ± 0.279
1.876MetIle: 1.876 ± 0.882
2.502MetLys: 2.502 ± 0.712
2.033MetLeu: 2.033 ± 0.504
0.469MetMet: 0.469 ± 0.127
1.876MetAsn: 1.876 ± 0.393
0.938MetPro: 0.938 ± 0.641
0.938MetGln: 0.938 ± 0.632
0.625MetArg: 0.625 ± 0.307
3.752MetSer: 3.752 ± 0.124
1.563MetThr: 1.563 ± 0.056
0.938MetVal: 0.938 ± 0.277
0.0MetTrp: 0.0 ± 0.0
1.094MetTyr: 1.094 ± 0.596
0.0MetXaa: 0.0 ± 0.0
Asn
2.345AsnAla: 2.345 ± 1.131
1.407AsnCys: 1.407 ± 0.53
2.814AsnAsp: 2.814 ± 0.892
3.909AsnGlu: 3.909 ± 1.258
2.658AsnPhe: 2.658 ± 0.346
2.814AsnGly: 2.814 ± 1.455
0.625AsnHis: 0.625 ± 0.307
3.752AsnIle: 3.752 ± 0.508
5.472AsnLys: 5.472 ± 0.768
7.817AsnLeu: 7.817 ± 1.413
1.563AsnMet: 1.563 ± 0.768
4.69AsnAsn: 4.69 ± 0.546
1.72AsnPro: 1.72 ± 0.43
2.033AsnGln: 2.033 ± 0.704
2.814AsnArg: 2.814 ± 1.033
4.221AsnSer: 4.221 ± 0.915
2.971AsnThr: 2.971 ± 1.009
2.814AsnVal: 2.814 ± 1.154
0.625AsnTrp: 0.625 ± 0.157
2.189AsnTyr: 2.189 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
2.033ProAla: 2.033 ± 2.549
0.0ProCys: 0.0 ± 0.0
2.814ProAsp: 2.814 ± 0.89
2.658ProGlu: 2.658 ± 0.425
1.407ProPhe: 1.407 ± 0.477
1.094ProGly: 1.094 ± 1.027
0.469ProHis: 0.469 ± 0.127
2.189ProIle: 2.189 ± 0.565
1.876ProLys: 1.876 ± 0.085
2.971ProLeu: 2.971 ± 0.523
0.469ProMet: 0.469 ± 0.23
2.033ProAsn: 2.033 ± 0.822
0.313ProPro: 0.313 ± 0.789
0.625ProGln: 0.625 ± 0.307
0.782ProArg: 0.782 ± 0.312
2.814ProSer: 2.814 ± 0.164
1.094ProThr: 1.094 ± 0.222
1.876ProVal: 1.876 ± 0.882
0.156ProTrp: 0.156 ± 0.077
1.563ProTyr: 1.563 ± 0.458
0.0ProXaa: 0.0 ± 0.0
Gln
2.345GlnAla: 2.345 ± 1.99
0.625GlnCys: 0.625 ± 0.279
1.876GlnAsp: 1.876 ± 0.393
2.189GlnGlu: 2.189 ± 0.565
1.407GlnPhe: 1.407 ± 0.082
1.563GlnGly: 1.563 ± 0.476
0.625GlnHis: 0.625 ± 0.294
2.971GlnIle: 2.971 ± 1.062
3.127GlnLys: 3.127 ± 0.972
3.127GlnLeu: 3.127 ± 1.137
1.094GlnMet: 1.094 ± 0.586
2.033GlnAsn: 2.033 ± 0.079
0.313GlnPro: 0.313 ± 0.154
1.407GlnGln: 1.407 ± 0.477
1.094GlnArg: 1.094 ± 1.018
2.345GlnSer: 2.345 ± 0.636
1.72GlnThr: 1.72 ± 0.43
1.407GlnVal: 1.407 ± 0.082
0.156GlnTrp: 0.156 ± 0.077
1.094GlnTyr: 1.094 ± 0.65
0.0GlnXaa: 0.0 ± 0.0
Arg
1.251ArgAla: 1.251 ± 1.034
0.469ArgCys: 0.469 ± 0.127
2.814ArgAsp: 2.814 ± 0.953
3.283ArgGlu: 3.283 ± 0.847
1.563ArgPhe: 1.563 ± 0.424
1.251ArgGly: 1.251 ± 0.379
1.094ArgHis: 1.094 ± 0.392
2.189ArgIle: 2.189 ± 0.326
4.534ArgLys: 4.534 ± 1.378
3.752ArgLeu: 3.752 ± 1.092
0.625ArgMet: 0.625 ± 0.307
4.378ArgAsn: 4.378 ± 3.639
2.033ArgPro: 2.033 ± 0.78
1.876ArgGln: 1.876 ± 1.686
2.814ArgArg: 2.814 ± 1.058
4.534ArgSer: 4.534 ± 0.894
2.502ArgThr: 2.502 ± 1.227
1.407ArgVal: 1.407 ± 0.285
0.313ArgTrp: 0.313 ± 0.351
2.033ArgTyr: 2.033 ± 0.079
0.0ArgXaa: 0.0 ± 0.0
Ser
3.909SerAla: 3.909 ± 0.618
2.033SerCys: 2.033 ± 0.808
6.254SerAsp: 6.254 ± 1.252
7.661SerGlu: 7.661 ± 1.865
2.658SerPhe: 2.658 ± 0.765
3.127SerGly: 3.127 ± 0.644
1.094SerHis: 1.094 ± 0.435
5.003SerIle: 5.003 ± 1.425
8.912SerLys: 8.912 ± 2.704
8.13SerLeu: 8.13 ± 1.369
2.502SerMet: 2.502 ± 0.298
4.69SerAsn: 4.69 ± 0.397
2.658SerPro: 2.658 ± 0.555
2.502SerGln: 2.502 ± 0.84
4.847SerArg: 4.847 ± 0.594
8.912SerSer: 8.912 ± 2.226
3.909SerThr: 3.909 ± 1.334
5.316SerVal: 5.316 ± 0.537
1.251SerTrp: 1.251 ± 0.777
3.44SerTyr: 3.44 ± 0.726
0.0SerXaa: 0.0 ± 0.0
Thr
2.189ThrAla: 2.189 ± 1.601
1.407ThrCys: 1.407 ± 1.195
3.127ThrAsp: 3.127 ± 0.185
4.847ThrGlu: 4.847 ± 1.039
2.502ThrPhe: 2.502 ± 0.732
2.033ThrGly: 2.033 ± 0.758
0.313ThrHis: 0.313 ± 0.372
5.472ThrIle: 5.472 ± 0.657
3.752ThrLys: 3.752 ± 0.692
5.785ThrLeu: 5.785 ± 0.344
1.876ThrMet: 1.876 ± 0.555
3.127ThrAsn: 3.127 ± 1.38
0.938ThrPro: 0.938 ± 0.277
1.563ThrGln: 1.563 ± 0.424
2.189ThrArg: 2.189 ± 0.319
4.847ThrSer: 4.847 ± 0.441
4.847ThrThr: 4.847 ± 0.441
3.44ThrVal: 3.44 ± 0.592
0.469ThrTrp: 0.469 ± 0.358
1.094ThrTyr: 1.094 ± 0.435
0.0ThrXaa: 0.0 ± 0.0
Val
2.189ValAla: 2.189 ± 0.225
1.094ValCys: 1.094 ± 0.392
1.72ValAsp: 1.72 ± 0.669
5.629ValGlu: 5.629 ± 1.013
2.658ValPhe: 2.658 ± 1.061
2.814ValGly: 2.814 ± 1.458
0.938ValHis: 0.938 ± 0.419
3.596ValIle: 3.596 ± 0.528
4.065ValLys: 4.065 ± 1.245
5.316ValLeu: 5.316 ± 0.827
0.782ValMet: 0.782 ± 0.212
3.127ValAsn: 3.127 ± 1.248
1.563ValPro: 1.563 ± 0.623
1.563ValGln: 1.563 ± 0.476
2.345ValArg: 2.345 ± 0.713
5.629ValSer: 5.629 ± 1.045
4.065ValThr: 4.065 ± 0.539
3.283ValVal: 3.283 ± 0.82
0.156ValTrp: 0.156 ± 0.077
1.563ValTyr: 1.563 ± 0.212
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.313TrpCys: 0.313 ± 0.426
0.313TrpAsp: 0.313 ± 0.14
0.938TrpGlu: 0.938 ± 0.876
0.469TrpPhe: 0.469 ± 0.32
0.469TrpGly: 0.469 ± 0.32
0.313TrpHis: 0.313 ± 0.351
0.938TrpIle: 0.938 ± 0.277
1.094TrpLys: 1.094 ± 0.372
0.938TrpLeu: 0.938 ± 1.156
0.156TrpMet: 0.156 ± 0.186
0.469TrpAsn: 0.469 ± 0.127
0.469TrpPro: 0.469 ± 0.127
0.0TrpGln: 0.0 ± 0.0
0.469TrpArg: 0.469 ± 0.23
1.094TrpSer: 1.094 ± 0.348
0.938TrpThr: 0.938 ± 0.419
0.469TrpVal: 0.469 ± 0.23
0.313TrpTrp: 0.313 ± 0.14
0.156TrpTyr: 0.156 ± 0.077
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.407TyrAla: 1.407 ± 0.529
1.251TyrCys: 1.251 ± 0.845
1.563TyrAsp: 1.563 ± 0.212
2.189TyrGlu: 2.189 ± 0.871
1.563TyrPhe: 1.563 ± 0.212
1.563TyrGly: 1.563 ± 0.575
0.469TyrHis: 0.469 ± 0.32
2.658TyrIle: 2.658 ± 0.674
3.127TyrLys: 3.127 ± 0.785
3.127TyrLeu: 3.127 ± 0.112
0.782TyrMet: 0.782 ± 0.668
2.971TyrAsn: 2.971 ± 0.708
0.938TyrPro: 0.938 ± 0.196
1.094TyrGln: 1.094 ± 0.596
1.407TyrArg: 1.407 ± 0.082
2.658TyrSer: 2.658 ± 1.099
1.876TyrThr: 1.876 ± 0.799
2.033TyrVal: 2.033 ± 0.52
0.625TyrTrp: 0.625 ± 0.294
1.251TyrTyr: 1.251 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (6397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski