Amino acid dipepetide frequency for Hubei diptera virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.117AlaAla: 2.117 ± 0.915
2.646AlaCys: 2.646 ± 1.633
2.646AlaAsp: 2.646 ± 0.866
3.705AlaGlu: 3.705 ± 0.828
1.852AlaPhe: 1.852 ± 1.143
3.705AlaGly: 3.705 ± 0.954
0.265AlaHis: 0.265 ± 0.163
2.646AlaIle: 2.646 ± 0.46
4.234AlaLys: 4.234 ± 1.782
5.557AlaLeu: 5.557 ± 2.306
0.529AlaMet: 0.529 ± 0.327
0.794AlaAsn: 0.794 ± 0.146
1.852AlaPro: 1.852 ± 0.753
1.588AlaGln: 1.588 ± 1.166
2.646AlaArg: 2.646 ± 0.187
6.351AlaSer: 6.351 ± 1.17
1.852AlaThr: 1.852 ± 0.305
2.382AlaVal: 2.382 ± 0.783
0.529AlaTrp: 0.529 ± 0.327
1.323AlaTyr: 1.323 ± 0.905
0.0AlaXaa: 0.0 ± 0.0
Cys
1.058CysAla: 1.058 ± 0.441
0.794CysCys: 0.794 ± 0.384
0.529CysAsp: 0.529 ± 0.136
1.588CysGlu: 1.588 ± 1.026
1.852CysPhe: 1.852 ± 0.348
0.529CysGly: 0.529 ± 0.136
0.794CysHis: 0.794 ± 0.787
1.588CysIle: 1.588 ± 0.651
1.588CysLys: 1.588 ± 0.768
3.175CysLeu: 3.175 ± 0.444
0.529CysMet: 0.529 ± 0.136
1.588CysAsn: 1.588 ± 0.768
2.646CysPro: 2.646 ± 0.376
0.794CysGln: 0.794 ± 0.52
0.529CysArg: 0.529 ± 0.136
3.969CysSer: 3.969 ± 1.18
2.382CysThr: 2.382 ± 1.152
1.588CysVal: 1.588 ± 0.768
0.265CysTrp: 0.265 ± 0.163
1.058CysTyr: 1.058 ± 0.271
0.0CysXaa: 0.0 ± 0.0
Asp
2.646AspAla: 2.646 ± 0.555
1.058AspCys: 1.058 ± 0.644
2.382AspAsp: 2.382 ± 1.47
4.763AspGlu: 4.763 ± 1.64
2.911AspPhe: 2.911 ± 0.691
2.382AspGly: 2.382 ± 0.895
1.058AspHis: 1.058 ± 0.279
3.705AspIle: 3.705 ± 0.292
4.763AspLys: 4.763 ± 0.8
5.557AspLeu: 5.557 ± 1.035
1.058AspMet: 1.058 ± 0.441
1.852AspAsn: 1.852 ± 0.348
1.323AspPro: 1.323 ± 0.433
1.588AspGln: 1.588 ± 0.407
0.794AspArg: 0.794 ± 0.49
5.028AspSer: 5.028 ± 1.103
0.794AspThr: 0.794 ± 0.146
3.175AspVal: 3.175 ± 0.748
1.058AspTrp: 1.058 ± 0.653
1.588AspTyr: 1.588 ± 0.98
0.0AspXaa: 0.0 ± 0.0
Glu
5.292GluAla: 5.292 ± 0.666
1.058GluCys: 1.058 ± 1.049
4.234GluAsp: 4.234 ± 1.44
6.616GluGlu: 6.616 ± 0.51
3.44GluPhe: 3.44 ± 0.551
2.911GluGly: 2.911 ± 1.017
1.058GluHis: 1.058 ± 0.279
2.911GluIle: 2.911 ± 0.691
4.234GluLys: 4.234 ± 0.216
7.674GluLeu: 7.674 ± 1.942
1.323GluMet: 1.323 ± 0.513
1.588GluAsn: 1.588 ± 0.651
1.852GluPro: 1.852 ± 0.534
2.911GluGln: 2.911 ± 0.267
3.969GluArg: 3.969 ± 0.731
6.351GluSer: 6.351 ± 1.918
3.969GluThr: 3.969 ± 0.866
6.086GluVal: 6.086 ± 2.066
1.323GluTrp: 1.323 ± 0.817
2.117GluTyr: 2.117 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
1.588PheAla: 1.588 ± 0.292
1.058PheCys: 1.058 ± 0.279
1.323PheAsp: 1.323 ± 0.414
3.705PheGlu: 3.705 ± 1.506
2.646PhePhe: 2.646 ± 0.925
1.588PheGly: 1.588 ± 0.292
1.058PheHis: 1.058 ± 0.279
3.44PheIle: 3.44 ± 0.698
1.852PheLys: 1.852 ± 0.534
3.969PheLeu: 3.969 ± 0.065
1.058PheMet: 1.058 ± 0.877
1.852PheAsn: 1.852 ± 0.348
2.646PhePro: 2.646 ± 0.866
1.058PheGln: 1.058 ± 1.095
2.117PheArg: 2.117 ± 0.542
6.086PheSer: 6.086 ± 0.937
2.382PheThr: 2.382 ± 0.538
1.588PheVal: 1.588 ± 0.97
0.529PheTrp: 0.529 ± 0.327
2.646PheTyr: 2.646 ± 1.24
0.0PheXaa: 0.0 ± 0.0
Gly
2.382GlyAla: 2.382 ± 1.114
2.117GlyCys: 2.117 ± 0.542
1.852GlyAsp: 1.852 ± 0.305
3.175GlyGlu: 3.175 ± 0.585
3.44GlyPhe: 3.44 ± 0.282
3.175GlyGly: 3.175 ± 1.157
1.588GlyHis: 1.588 ± 0.407
2.117GlyIle: 2.117 ± 0.65
2.911GlyLys: 2.911 ± 0.267
3.969GlyLeu: 3.969 ± 1.18
2.382GlyMet: 2.382 ± 0.4
1.588GlyAsn: 1.588 ± 0.97
1.852GlyPro: 1.852 ± 0.305
1.323GlyGln: 1.323 ± 0.23
2.646GlyArg: 2.646 ± 0.46
5.292GlySer: 5.292 ± 3.218
2.382GlyThr: 2.382 ± 1.777
3.175GlyVal: 3.175 ± 1.565
0.265GlyTrp: 0.265 ± 0.163
1.852GlyTyr: 1.852 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
1.323HisAla: 1.323 ± 0.433
0.794HisCys: 0.794 ± 0.384
0.794HisAsp: 0.794 ± 0.384
1.588HisGlu: 1.588 ± 0.98
0.529HisPhe: 0.529 ± 0.136
0.529HisGly: 0.529 ± 0.327
0.794HisHis: 0.794 ± 0.49
1.058HisIle: 1.058 ± 0.441
0.529HisLys: 0.529 ± 0.327
1.323HisLeu: 1.323 ± 0.513
0.529HisMet: 0.529 ± 0.136
1.058HisAsn: 1.058 ± 0.279
1.058HisPro: 1.058 ± 0.279
1.323HisGln: 1.323 ± 0.433
1.058HisArg: 1.058 ± 0.653
1.588HisSer: 1.588 ± 0.449
1.588HisThr: 1.588 ± 0.768
1.852HisVal: 1.852 ± 0.534
0.265HisTrp: 0.265 ± 0.163
0.794HisTyr: 0.794 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
2.911IleAla: 2.911 ± 0.267
2.117IleCys: 2.117 ± 0.542
3.175IleAsp: 3.175 ± 0.401
4.499IleGlu: 4.499 ± 0.786
1.852IlePhe: 1.852 ± 0.951
3.969IleGly: 3.969 ± 0.843
0.265IleHis: 0.265 ± 0.163
3.175IleIle: 3.175 ± 2.198
5.292IleLys: 5.292 ± 0.374
5.557IleLeu: 5.557 ± 1.834
1.058IleMet: 1.058 ± 0.248
1.588IleAsn: 1.588 ± 1.07
2.646IlePro: 2.646 ± 1.026
1.852IleGln: 1.852 ± 1.26
2.911IleArg: 2.911 ± 0.691
6.88IleSer: 6.88 ± 0.278
1.588IleThr: 1.588 ± 0.374
3.969IleVal: 3.969 ± 0.873
0.794IleTrp: 0.794 ± 0.146
1.058IleTyr: 1.058 ± 0.653
0.0IleXaa: 0.0 ± 0.0
Lys
5.028LysAla: 5.028 ± 0.566
1.588LysCys: 1.588 ± 0.768
3.175LysAsp: 3.175 ± 1.94
4.499LysGlu: 4.499 ± 0.504
2.382LysPhe: 2.382 ± 0.783
2.646LysGly: 2.646 ± 0.376
1.588LysHis: 1.588 ± 0.292
2.382LysIle: 2.382 ± 0.538
5.557LysLys: 5.557 ± 0.431
6.351LysLeu: 6.351 ± 1.382
2.117LysMet: 2.117 ± 0.361
2.911LysAsn: 2.911 ± 0.848
2.911LysPro: 2.911 ± 1.122
2.382LysGln: 2.382 ± 0.839
3.44LysArg: 3.44 ± 0.139
6.351LysSer: 6.351 ± 0.404
4.499LysThr: 4.499 ± 0.485
6.351LysVal: 6.351 ± 0.889
1.058LysTrp: 1.058 ± 0.653
1.323LysTyr: 1.323 ± 0.669
0.0LysXaa: 0.0 ± 0.0
Leu
5.292LeuAla: 5.292 ± 0.962
3.705LeuCys: 3.705 ± 0.124
6.351LeuAsp: 6.351 ± 2.336
6.88LeuGlu: 6.88 ± 1.174
2.117LeuPhe: 2.117 ± 0.882
5.292LeuGly: 5.292 ± 1.081
1.323LeuHis: 1.323 ± 0.433
6.351LeuIle: 6.351 ± 2.1
7.409LeuLys: 7.409 ± 0.999
9.791LeuLeu: 9.791 ± 1.197
2.382LeuMet: 2.382 ± 0.475
5.028LeuAsn: 5.028 ± 0.63
4.499LeuPro: 4.499 ± 1.238
2.117LeuGln: 2.117 ± 0.361
3.175LeuArg: 3.175 ± 0.836
11.379LeuSer: 11.379 ± 1.398
6.086LeuThr: 6.086 ± 1.064
6.616LeuVal: 6.616 ± 1.967
1.588LeuTrp: 1.588 ± 0.407
2.117LeuTyr: 2.117 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
2.117MetAla: 2.117 ± 0.915
0.265MetCys: 0.265 ± 0.262
0.794MetAsp: 0.794 ± 0.49
1.852MetGlu: 1.852 ± 0.496
1.852MetPhe: 1.852 ± 1.143
2.117MetGly: 2.117 ± 0.35
0.529MetHis: 0.529 ± 0.327
2.646MetIle: 2.646 ± 0.98
2.382MetLys: 2.382 ± 0.228
1.588MetLeu: 1.588 ± 0.592
1.323MetMet: 1.323 ± 0.669
1.058MetAsn: 1.058 ± 0.271
0.794MetPro: 0.794 ± 0.49
0.794MetGln: 0.794 ± 0.146
2.911MetArg: 2.911 ± 0.606
1.323MetSer: 1.323 ± 0.414
2.911MetThr: 2.911 ± 0.848
1.588MetVal: 1.588 ± 0.407
0.529MetTrp: 0.529 ± 0.327
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.265AsnAla: 0.265 ± 0.163
1.323AsnCys: 1.323 ± 0.809
1.323AsnAsp: 1.323 ± 0.433
1.058AsnGlu: 1.058 ± 0.279
3.705AsnPhe: 3.705 ± 0.712
1.588AsnGly: 1.588 ± 0.768
1.852AsnHis: 1.852 ± 0.534
1.588AsnIle: 1.588 ± 0.78
3.175AsnLys: 3.175 ± 0.574
3.969AsnLeu: 3.969 ± 1.563
1.588AsnMet: 1.588 ± 0.78
1.058AsnAsn: 1.058 ± 0.627
1.323AsnPro: 1.323 ± 0.433
1.058AsnGln: 1.058 ± 0.653
1.058AsnArg: 1.058 ± 0.627
6.88AsnSer: 6.88 ± 0.817
1.058AsnThr: 1.058 ± 0.279
3.175AsnVal: 3.175 ± 1.301
0.529AsnTrp: 0.529 ± 0.327
2.382AsnTyr: 2.382 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
1.058ProAla: 1.058 ± 0.627
0.529ProCys: 0.529 ± 0.136
4.499ProAsp: 4.499 ± 0.504
3.705ProGlu: 3.705 ± 1.123
0.529ProPhe: 0.529 ± 0.534
2.646ProGly: 2.646 ± 0.678
1.323ProHis: 1.323 ± 0.433
1.852ProIle: 1.852 ± 0.348
2.646ProLys: 2.646 ± 0.98
3.175ProLeu: 3.175 ± 0.836
2.117ProMet: 2.117 ± 0.542
1.852ProAsn: 1.852 ± 0.496
1.588ProPro: 1.588 ± 0.768
1.058ProGln: 1.058 ± 0.279
2.117ProArg: 2.117 ± 0.361
3.44ProSer: 3.44 ± 0.739
2.382ProThr: 2.382 ± 0.777
2.646ProVal: 2.646 ± 0.187
0.265ProTrp: 0.265 ± 0.262
0.794ProTyr: 0.794 ± 0.535
0.0ProXaa: 0.0 ± 0.0
Gln
0.794GlnAla: 0.794 ± 0.146
0.529GlnCys: 0.529 ± 0.524
2.117GlnAsp: 2.117 ± 0.35
2.382GlnGlu: 2.382 ± 0.475
1.588GlnPhe: 1.588 ± 0.407
2.382GlnGly: 2.382 ± 1.47
1.058GlnHis: 1.058 ± 0.279
1.323GlnIle: 1.323 ± 0.414
1.323GlnLys: 1.323 ± 0.49
1.852GlnLeu: 1.852 ± 0.753
1.058GlnMet: 1.058 ± 0.644
2.117GlnAsn: 2.117 ± 0.558
0.265GlnPro: 0.265 ± 0.262
3.175GlnGln: 3.175 ± 0.401
1.058GlnArg: 1.058 ± 0.583
1.323GlnSer: 1.323 ± 0.23
0.794GlnThr: 0.794 ± 0.384
3.44GlnVal: 3.44 ± 1.505
0.529GlnTrp: 0.529 ± 0.136
2.117GlnTyr: 2.117 ± 0.882
0.0GlnXaa: 0.0 ± 0.0
Arg
2.117ArgAla: 2.117 ± 0.558
1.058ArgCys: 1.058 ± 0.644
1.852ArgAsp: 1.852 ± 0.753
2.117ArgGlu: 2.117 ± 0.558
2.117ArgPhe: 2.117 ± 0.915
2.117ArgGly: 2.117 ± 0.7
0.265ArgHis: 0.265 ± 0.163
3.705ArgIle: 3.705 ± 0.642
3.969ArgLys: 3.969 ± 0.689
3.969ArgLeu: 3.969 ± 0.799
1.852ArgMet: 1.852 ± 1.587
2.646ArgAsn: 2.646 ± 0.678
1.323ArgPro: 1.323 ± 0.513
2.117ArgGln: 2.117 ± 0.361
2.646ArgArg: 2.646 ± 0.555
4.234ArgSer: 4.234 ± 0.216
3.175ArgThr: 3.175 ± 0.585
4.499ArgVal: 4.499 ± 0.377
0.265ArgTrp: 0.265 ± 0.163
0.529ArgTyr: 0.529 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
5.028SerAla: 5.028 ± 0.859
4.234SerCys: 4.234 ± 1.422
5.822SerAsp: 5.822 ± 0.646
8.732SerGlu: 8.732 ± 2.594
4.234SerPhe: 4.234 ± 0.216
4.234SerGly: 4.234 ± 0.822
2.117SerHis: 2.117 ± 0.65
7.145SerIle: 7.145 ± 0.668
5.822SerLys: 5.822 ± 0.436
12.437SerLeu: 12.437 ± 2.093
1.588SerMet: 1.588 ± 0.98
5.028SerAsn: 5.028 ± 0.916
3.969SerPro: 3.969 ± 0.799
1.323SerGln: 1.323 ± 0.49
4.763SerArg: 4.763 ± 0.714
10.585SerSer: 10.585 ± 0.713
6.351SerThr: 6.351 ± 0.503
6.086SerVal: 6.086 ± 1.67
1.323SerTrp: 1.323 ± 0.817
2.382SerTyr: 2.382 ± 0.4
0.0SerXaa: 0.0 ± 0.0
Thr
1.852ThrAla: 1.852 ± 0.944
0.794ThrCys: 0.794 ± 0.146
2.382ThrAsp: 2.382 ± 0.783
2.646ThrGlu: 2.646 ± 1.271
2.382ThrPhe: 2.382 ± 0.228
2.646ThrGly: 2.646 ± 0.678
1.323ThrHis: 1.323 ± 0.414
3.705ThrIle: 3.705 ± 0.539
2.646ThrLys: 2.646 ± 0.523
8.997ThrLeu: 8.997 ± 0.971
2.911ThrMet: 2.911 ± 0.5
2.117ThrAsn: 2.117 ± 0.947
2.382ThrPro: 2.382 ± 0.4
1.058ThrGln: 1.058 ± 0.271
2.646ThrArg: 2.646 ± 0.555
4.499ThrSer: 4.499 ± 0.161
3.705ThrThr: 3.705 ± 1.142
4.499ThrVal: 4.499 ± 0.786
0.529ThrTrp: 0.529 ± 0.524
2.911ThrTyr: 2.911 ± 1.122
0.0ThrXaa: 0.0 ± 0.0
Val
3.969ValAla: 3.969 ± 1.213
2.117ValCys: 2.117 ± 0.542
2.382ValAsp: 2.382 ± 0.475
5.557ValGlu: 5.557 ± 0.916
2.911ValPhe: 2.911 ± 0.218
3.44ValGly: 3.44 ± 0.663
1.323ValHis: 1.323 ± 0.905
3.44ValIle: 3.44 ± 1.146
5.028ValLys: 5.028 ± 0.63
6.086ValLeu: 6.086 ± 1.272
2.382ValMet: 2.382 ± 0.6
2.117ValAsn: 2.117 ± 0.361
3.705ValPro: 3.705 ± 0.696
2.382ValGln: 2.382 ± 0.439
3.705ValArg: 3.705 ± 0.124
8.203ValSer: 8.203 ± 1.428
5.292ValThr: 5.292 ± 2.204
5.292ValVal: 5.292 ± 0.511
0.794ValTrp: 0.794 ± 0.384
1.588ValTyr: 1.588 ± 0.374
0.0ValXaa: 0.0 ± 0.0
Trp
0.529TrpAla: 0.529 ± 0.327
0.529TrpCys: 0.529 ± 0.136
1.058TrpAsp: 1.058 ± 0.279
0.265TrpGlu: 0.265 ± 0.262
0.794TrpPhe: 0.794 ± 0.49
0.265TrpGly: 0.265 ± 0.163
0.529TrpHis: 0.529 ± 0.327
0.529TrpIle: 0.529 ± 0.136
0.794TrpLys: 0.794 ± 0.384
1.323TrpLeu: 1.323 ± 0.23
0.794TrpMet: 0.794 ± 0.49
1.058TrpAsn: 1.058 ± 0.653
0.0TrpPro: 0.0 ± 0.0
0.265TrpGln: 0.265 ± 0.163
0.529TrpArg: 0.529 ± 0.136
1.323TrpSer: 1.323 ± 0.433
1.058TrpThr: 1.058 ± 0.279
1.058TrpVal: 1.058 ± 0.653
0.0TrpTrp: 0.0 ± 0.0
0.265TrpTyr: 0.265 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.852TyrAla: 1.852 ± 0.907
0.529TyrCys: 0.529 ± 0.534
0.794TyrAsp: 0.794 ± 0.146
1.588TyrGlu: 1.588 ± 0.407
1.058TyrPhe: 1.058 ± 0.271
1.323TyrGly: 1.323 ± 0.669
0.265TyrHis: 0.265 ± 0.163
1.852TyrIle: 1.852 ± 0.753
2.382TyrLys: 2.382 ± 1.478
3.175TyrLeu: 3.175 ± 0.813
0.529TyrMet: 0.529 ± 0.327
1.058TyrAsn: 1.058 ± 0.627
1.588TyrPro: 1.588 ± 0.407
1.058TyrGln: 1.058 ± 0.653
1.852TyrArg: 1.852 ± 0.348
2.117TyrSer: 2.117 ± 0.7
2.382TyrThr: 2.382 ± 0.783
2.911TyrVal: 2.911 ± 0.663
0.529TyrTrp: 0.529 ± 0.136
1.588TyrTyr: 1.588 ± 0.651
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3780 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski