Amino acid dipepetide frequency for Hubei myriapoda virus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.77AlaAla: 1.77 ± 1.13
1.011AlaCys: 1.011 ± 0.598
2.781AlaAsp: 2.781 ± 1.131
4.551AlaGlu: 4.551 ± 0.901
2.276AlaPhe: 2.276 ± 0.29
2.276AlaGly: 2.276 ± 1.309
2.781AlaHis: 2.781 ± 0.859
4.046AlaIle: 4.046 ± 1.02
3.034AlaLys: 3.034 ± 0.757
6.574AlaLeu: 6.574 ± 1.862
3.034AlaMet: 3.034 ± 2.2
2.276AlaAsn: 2.276 ± 1.043
1.011AlaPro: 1.011 ± 1.809
2.528AlaGln: 2.528 ± 1.137
2.781AlaArg: 2.781 ± 0.998
3.54AlaSer: 3.54 ± 0.774
3.54AlaThr: 3.54 ± 1.55
3.034AlaVal: 3.034 ± 0.336
1.264AlaTrp: 1.264 ± 0.223
2.023AlaTyr: 2.023 ± 1.916
0.0AlaXaa: 0.0 ± 0.0
Cys
1.011CysAla: 1.011 ± 0.756
0.0CysCys: 0.0 ± 0.0
0.506CysAsp: 0.506 ± 0.204
0.506CysGlu: 0.506 ± 0.204
0.759CysPhe: 0.759 ± 0.348
0.506CysGly: 0.506 ± 0.299
0.506CysHis: 0.506 ± 0.299
1.517CysIle: 1.517 ± 0.257
0.759CysLys: 0.759 ± 0.489
2.023CysLeu: 2.023 ± 0.502
0.253CysMet: 0.253 ± 0.149
1.264CysAsn: 1.264 ± 0.687
2.023CysPro: 2.023 ± 0.566
0.506CysGln: 0.506 ± 0.299
1.011CysArg: 1.011 ± 0.251
2.023CysSer: 2.023 ± 0.557
0.253CysThr: 0.253 ± 0.299
0.506CysVal: 0.506 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.759CysTyr: 0.759 ± 0.448
0.0CysXaa: 0.0 ± 0.0
Asp
3.54AspAla: 3.54 ± 0.331
0.759AspCys: 0.759 ± 0.394
4.298AspAsp: 4.298 ± 1.077
2.781AspGlu: 2.781 ± 0.815
2.781AspPhe: 2.781 ± 0.787
2.276AspGly: 2.276 ± 0.29
1.517AspHis: 1.517 ± 0.678
5.815AspIle: 5.815 ± 2.871
2.023AspLys: 2.023 ± 0.557
4.804AspLeu: 4.804 ± 0.679
0.759AspMet: 0.759 ± 0.834
2.528AspAsn: 2.528 ± 0.262
0.759AspPro: 0.759 ± 0.448
2.528AspGln: 2.528 ± 0.566
1.77AspArg: 1.77 ± 1.35
2.781AspSer: 2.781 ± 0.748
2.276AspThr: 2.276 ± 0.764
3.287AspVal: 3.287 ± 0.695
1.517AspTrp: 1.517 ± 0.257
3.287AspTyr: 3.287 ± 0.432
0.0AspXaa: 0.0 ± 0.0
Glu
2.781GluAla: 2.781 ± 1.131
1.011GluCys: 1.011 ± 0.251
2.528GluAsp: 2.528 ± 0.424
2.276GluGlu: 2.276 ± 0.666
2.023GluPhe: 2.023 ± 0.566
3.034GluGly: 3.034 ± 1.547
2.023GluHis: 2.023 ± 0.502
7.08GluIle: 7.08 ± 1.552
3.287GluLys: 3.287 ± 0.528
8.344GluLeu: 8.344 ± 1.355
1.517GluMet: 1.517 ± 0.257
4.804GluAsn: 4.804 ± 0.625
1.011GluPro: 1.011 ± 0.784
2.528GluGln: 2.528 ± 0.923
4.046GluArg: 4.046 ± 1.689
5.057GluSer: 5.057 ± 0.561
2.781GluThr: 2.781 ± 1.644
2.276GluVal: 2.276 ± 1.181
0.253GluTrp: 0.253 ± 0.149
1.77GluTyr: 1.77 ± 0.806
0.0GluXaa: 0.0 ± 0.0
Phe
2.781PheAla: 2.781 ± 1.425
1.011PheCys: 1.011 ± 0.527
3.287PheAsp: 3.287 ± 0.928
3.793PheGlu: 3.793 ± 0.801
1.011PhePhe: 1.011 ± 0.46
3.54PheGly: 3.54 ± 0.941
1.517PheHis: 1.517 ± 0.545
3.54PheIle: 3.54 ± 0.623
1.264PheLys: 1.264 ± 0.223
6.321PheLeu: 6.321 ± 1.201
1.011PheMet: 1.011 ± 0.407
5.057PheAsn: 5.057 ± 0.255
1.264PhePro: 1.264 ± 0.662
1.77PheGln: 1.77 ± 0.565
1.517PheArg: 1.517 ± 0.678
2.781PheSer: 2.781 ± 0.601
1.517PheThr: 1.517 ± 0.392
2.276PheVal: 2.276 ± 0.964
0.759PheTrp: 0.759 ± 0.448
1.517PheTyr: 1.517 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
2.276GlyAla: 2.276 ± 1.258
0.253GlyCys: 0.253 ± 0.149
2.276GlyAsp: 2.276 ± 0.653
1.517GlyGlu: 1.517 ± 0.611
3.287GlyPhe: 3.287 ± 0.6
2.276GlyGly: 2.276 ± 1.389
1.264GlyHis: 1.264 ± 0.223
3.54GlyIle: 3.54 ± 2.156
1.517GlyLys: 1.517 ± 0.978
6.574GlyLeu: 6.574 ± 1.199
1.264GlyMet: 1.264 ± 0.223
2.276GlyAsn: 2.276 ± 0.964
0.506GlyPro: 0.506 ± 0.299
2.781GlyGln: 2.781 ± 0.565
2.528GlyArg: 2.528 ± 0.847
2.528GlySer: 2.528 ± 0.738
2.781GlyThr: 2.781 ± 1.407
3.034GlyVal: 3.034 ± 1.645
0.506GlyTrp: 0.506 ± 0.299
1.264GlyTyr: 1.264 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.56
0.759HisCys: 0.759 ± 0.196
1.77HisAsp: 1.77 ± 0.577
1.011HisGlu: 1.011 ± 0.283
2.276HisPhe: 2.276 ± 1.014
0.759HisGly: 0.759 ± 0.394
0.506HisHis: 0.506 ± 0.204
1.264HisIle: 1.264 ± 0.782
1.517HisLys: 1.517 ± 0.678
3.287HisLeu: 3.287 ± 0.967
1.264HisMet: 1.264 ± 0.408
1.77HisAsn: 1.77 ± 0.688
1.264HisPro: 1.264 ± 0.696
1.517HisGln: 1.517 ± 1.943
1.264HisArg: 1.264 ± 0.747
2.781HisSer: 2.781 ± 0.377
0.759HisThr: 0.759 ± 0.448
1.264HisVal: 1.264 ± 0.386
0.506HisTrp: 0.506 ± 0.473
0.759HisTyr: 0.759 ± 0.394
0.0HisXaa: 0.0 ± 0.0
Ile
6.321IleAla: 6.321 ± 2.038
1.011IleCys: 1.011 ± 0.46
3.793IleAsp: 3.793 ± 0.54
5.815IleGlu: 5.815 ± 1.051
3.034IlePhe: 3.034 ± 1.955
3.54IleGly: 3.54 ± 2.152
2.781IleHis: 2.781 ± 0.748
7.332IleIle: 7.332 ± 0.818
4.298IleLys: 4.298 ± 0.796
8.597IleLeu: 8.597 ± 1.335
3.034IleMet: 3.034 ± 0.71
5.057IleAsn: 5.057 ± 1.305
3.793IlePro: 3.793 ± 0.602
3.287IleGln: 3.287 ± 0.697
4.046IleArg: 4.046 ± 1.403
7.585IleSer: 7.585 ± 1.701
7.332IleThr: 7.332 ± 1.04
3.287IleVal: 3.287 ± 0.805
0.759IleTrp: 0.759 ± 0.348
1.77IleTyr: 1.77 ± 0.688
0.0IleXaa: 0.0 ± 0.0
Lys
2.781LysAla: 2.781 ± 0.815
0.506LysCys: 0.506 ± 0.597
3.793LysAsp: 3.793 ± 0.896
3.793LysGlu: 3.793 ± 0.98
2.528LysPhe: 2.528 ± 0.968
1.517LysGly: 1.517 ± 0.815
0.759LysHis: 0.759 ± 0.196
7.332LysIle: 7.332 ± 1.63
1.264LysLys: 1.264 ± 0.687
7.838LysLeu: 7.838 ± 1.147
1.011LysMet: 1.011 ± 0.407
2.023LysAsn: 2.023 ± 0.183
2.276LysPro: 2.276 ± 1.306
2.528LysGln: 2.528 ± 0.741
4.046LysArg: 4.046 ± 0.571
2.528LysSer: 2.528 ± 1.374
4.046LysThr: 4.046 ± 1.179
2.528LysVal: 2.528 ± 0.653
0.0LysTrp: 0.0 ± 0.0
1.77LysTyr: 1.77 ± 0.806
0.0LysXaa: 0.0 ± 0.0
Leu
7.838LeuAla: 7.838 ± 2.2
1.77LeuCys: 1.77 ± 0.565
4.298LeuAsp: 4.298 ± 1.237
6.321LeuGlu: 6.321 ± 0.561
5.815LeuPhe: 5.815 ± 1.148
3.54LeuGly: 3.54 ± 0.662
3.034LeuHis: 3.034 ± 1.543
9.355LeuIle: 9.355 ± 1.596
10.619LeuLys: 10.619 ± 0.449
13.906LeuLeu: 13.906 ± 3.053
1.517LeuMet: 1.517 ± 0.743
7.585LeuAsn: 7.585 ± 1.132
5.563LeuPro: 5.563 ± 0.355
3.54LeuGln: 3.54 ± 0.953
6.068LeuArg: 6.068 ± 1.506
6.574LeuSer: 6.574 ± 1.052
7.08LeuThr: 7.08 ± 1.032
5.31LeuVal: 5.31 ± 0.55
2.023LeuTrp: 2.023 ± 0.527
3.54LeuTyr: 3.54 ± 0.704
0.0LeuXaa: 0.0 ± 0.0
Met
1.517MetAla: 1.517 ± 1.079
0.506MetCys: 0.506 ± 0.378
1.011MetAsp: 1.011 ± 0.46
1.517MetGlu: 1.517 ± 0.392
1.011MetPhe: 1.011 ± 0.646
1.011MetGly: 1.011 ± 0.283
1.011MetHis: 1.011 ± 0.837
2.023MetIle: 2.023 ± 0.39
2.023MetLys: 2.023 ± 0.668
2.781MetLeu: 2.781 ± 0.377
1.517MetMet: 1.517 ± 0.678
0.759MetAsn: 0.759 ± 0.489
0.759MetPro: 0.759 ± 0.629
1.011MetGln: 1.011 ± 0.251
2.023MetArg: 2.023 ± 0.833
2.528MetSer: 2.528 ± 1.018
1.517MetThr: 1.517 ± 1.39
1.264MetVal: 1.264 ± 0.747
0.506MetTrp: 0.506 ± 0.299
0.759MetTyr: 0.759 ± 0.394
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 1.131
1.011AsnCys: 1.011 ± 0.407
4.046AsnAsp: 4.046 ± 1.02
4.046AsnGlu: 4.046 ± 0.78
3.287AsnPhe: 3.287 ± 0.926
1.517AsnGly: 1.517 ± 0.285
1.517AsnHis: 1.517 ± 0.678
4.804AsnIle: 4.804 ± 0.625
3.793AsnLys: 3.793 ± 2.444
6.827AsnLeu: 6.827 ± 1.603
1.011AsnMet: 1.011 ± 0.283
3.287AsnAsn: 3.287 ± 0.677
3.287AsnPro: 3.287 ± 0.528
2.781AsnGln: 2.781 ± 0.815
3.034AsnArg: 3.034 ± 0.336
2.528AsnSer: 2.528 ± 0.773
3.034AsnThr: 3.034 ± 1.091
2.276AsnVal: 2.276 ± 0.637
1.517AsnTrp: 1.517 ± 0.98
1.77AsnTyr: 1.77 ± 0.565
0.0AsnXaa: 0.0 ± 0.0
Pro
3.287ProAla: 3.287 ± 1.013
0.253ProCys: 0.253 ± 0.299
2.276ProAsp: 2.276 ± 1.293
2.023ProGlu: 2.023 ± 0.183
3.034ProPhe: 3.034 ± 0.336
1.77ProGly: 1.77 ± 0.624
0.253ProHis: 0.253 ± 0.149
2.023ProIle: 2.023 ± 0.566
2.023ProLys: 2.023 ± 0.557
4.298ProLeu: 4.298 ± 0.89
0.506ProMet: 0.506 ± 0.299
2.276ProAsn: 2.276 ± 0.29
2.276ProPro: 2.276 ± 1.983
0.759ProGln: 0.759 ± 0.671
2.276ProArg: 2.276 ± 0.979
4.046ProSer: 4.046 ± 1.491
3.034ProThr: 3.034 ± 2.483
2.023ProVal: 2.023 ± 0.566
0.0ProTrp: 0.0 ± 0.0
1.264ProTyr: 1.264 ± 0.569
0.0ProXaa: 0.0 ± 0.0
Gln
2.528GlnAla: 2.528 ± 0.866
0.506GlnCys: 0.506 ± 0.204
0.759GlnAsp: 0.759 ± 0.196
2.781GlnGlu: 2.781 ± 0.815
2.023GlnPhe: 2.023 ± 0.566
1.517GlnGly: 1.517 ± 1.241
0.759GlnHis: 0.759 ± 0.896
3.54GlnIle: 3.54 ± 0.463
4.298GlnLys: 4.298 ± 1.602
4.551GlnLeu: 4.551 ± 0.675
0.759GlnMet: 0.759 ± 0.448
3.034GlnAsn: 3.034 ± 1.338
1.77GlnPro: 1.77 ± 0.565
1.517GlnGln: 1.517 ± 0.67
2.276GlnArg: 2.276 ± 0.659
2.023GlnSer: 2.023 ± 0.725
2.276GlnThr: 2.276 ± 0.579
2.528GlnVal: 2.528 ± 0.653
0.0GlnTrp: 0.0 ± 0.0
2.023GlnTyr: 2.023 ± 0.941
0.0GlnXaa: 0.0 ± 0.0
Arg
3.034ArgAla: 3.034 ± 1.047
1.264ArgCys: 1.264 ± 0.747
2.276ArgAsp: 2.276 ± 1.309
3.793ArgGlu: 3.793 ± 1.726
1.77ArgPhe: 1.77 ± 0.688
3.54ArgGly: 3.54 ± 0.952
1.011ArgHis: 1.011 ± 0.598
4.046ArgIle: 4.046 ± 1.132
2.276ArgLys: 2.276 ± 0.588
7.08ArgLeu: 7.08 ± 1.339
1.517ArgMet: 1.517 ± 0.787
2.528ArgAsn: 2.528 ± 0.851
1.264ArgPro: 1.264 ± 1.268
3.287ArgGln: 3.287 ± 1.482
2.781ArgArg: 2.781 ± 0.565
3.287ArgSer: 3.287 ± 0.91
1.77ArgThr: 1.77 ± 0.806
2.276ArgVal: 2.276 ± 0.659
1.011ArgTrp: 1.011 ± 0.598
1.517ArgTyr: 1.517 ± 1.05
0.0ArgXaa: 0.0 ± 0.0
Ser
3.793SerAla: 3.793 ± 1.205
2.528SerCys: 2.528 ± 0.424
3.793SerAsp: 3.793 ± 2.06
2.528SerGlu: 2.528 ± 1.126
3.793SerPhe: 3.793 ± 1.038
4.298SerGly: 4.298 ± 1.077
1.011SerHis: 1.011 ± 0.46
6.068SerIle: 6.068 ± 2.111
3.54SerLys: 3.54 ± 0.793
7.08SerLeu: 7.08 ± 0.108
1.77SerMet: 1.77 ± 0.451
5.057SerAsn: 5.057 ± 0.858
2.781SerPro: 2.781 ± 0.748
2.781SerGln: 2.781 ± 0.306
2.276SerArg: 2.276 ± 0.979
5.31SerSer: 5.31 ± 1.481
3.287SerThr: 3.287 ± 1.103
3.287SerVal: 3.287 ± 0.69
1.011SerTrp: 1.011 ± 0.598
1.77SerTyr: 1.77 ± 0.577
0.0SerXaa: 0.0 ± 0.0
Thr
3.793ThrAla: 3.793 ± 1.276
0.506ThrCys: 0.506 ± 0.299
2.781ThrAsp: 2.781 ± 0.306
5.815ThrGlu: 5.815 ± 1.647
2.528ThrPhe: 2.528 ± 0.809
2.276ThrGly: 2.276 ± 1.052
2.276ThrHis: 2.276 ± 0.686
5.057ThrIle: 5.057 ± 0.997
2.276ThrLys: 2.276 ± 0.565
5.31ThrLeu: 5.31 ± 1.043
1.264ThrMet: 1.264 ± 0.725
1.77ThrAsn: 1.77 ± 0.688
2.781ThrPro: 2.781 ± 0.787
2.023ThrGln: 2.023 ± 1.054
3.54ThrArg: 3.54 ± 1.178
5.057ThrSer: 5.057 ± 0.992
2.023ThrThr: 2.023 ± 0.557
3.034ThrVal: 3.034 ± 1.047
0.759ThrTrp: 0.759 ± 0.489
2.023ThrTyr: 2.023 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
2.023ValAla: 2.023 ± 0.502
0.759ValCys: 0.759 ± 0.489
3.793ValAsp: 3.793 ± 0.602
3.793ValGlu: 3.793 ± 1.439
1.77ValPhe: 1.77 ± 0.843
2.023ValGly: 2.023 ± 0.607
1.77ValHis: 1.77 ± 0.577
3.287ValIle: 3.287 ± 1.57
3.034ValLys: 3.034 ± 1.15
4.551ValLeu: 4.551 ± 1.045
1.77ValMet: 1.77 ± 0.248
2.276ValAsn: 2.276 ± 1.089
1.264ValPro: 1.264 ± 0.743
1.264ValGln: 1.264 ± 0.662
2.276ValArg: 2.276 ± 0.659
2.276ValSer: 2.276 ± 0.29
4.551ValThr: 4.551 ± 1.958
2.276ValVal: 2.276 ± 0.579
0.759ValTrp: 0.759 ± 0.842
2.528ValTyr: 2.528 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.378
0.0TrpCys: 0.0 ± 0.0
0.506TrpAsp: 0.506 ± 0.299
0.759TrpGlu: 0.759 ± 0.448
0.759TrpPhe: 0.759 ± 0.784
0.253TrpGly: 0.253 ± 0.149
0.253TrpHis: 0.253 ± 0.149
2.023TrpIle: 2.023 ± 1.053
0.759TrpLys: 0.759 ± 0.348
2.023TrpLeu: 2.023 ± 0.183
0.506TrpMet: 0.506 ± 0.299
1.011TrpAsn: 1.011 ± 0.251
1.517TrpPro: 1.517 ± 0.392
0.253TrpGln: 0.253 ± 0.419
0.253TrpArg: 0.253 ± 0.149
0.506TrpSer: 0.506 ± 0.299
1.011TrpThr: 1.011 ± 0.784
0.253TrpVal: 0.253 ± 0.149
0.0TrpTrp: 0.0 ± 0.0
0.506TrpTyr: 0.506 ± 0.299
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.506TyrAla: 0.506 ± 0.905
1.264TyrCys: 1.264 ± 0.386
1.517TyrAsp: 1.517 ± 0.285
1.011TyrGlu: 1.011 ± 0.251
1.77TyrPhe: 1.77 ± 0.397
2.781TyrGly: 2.781 ± 0.815
1.011TyrHis: 1.011 ± 0.598
2.781TyrIle: 2.781 ± 0.306
1.264TyrLys: 1.264 ± 0.386
2.781TyrLeu: 2.781 ± 0.776
1.517TyrMet: 1.517 ± 0.565
1.517TyrAsn: 1.517 ± 0.545
2.528TyrPro: 2.528 ± 1.043
2.276TyrGln: 2.276 ± 0.588
1.517TyrArg: 1.517 ± 1.134
2.023TyrSer: 2.023 ± 1.196
2.023TyrThr: 2.023 ± 0.566
2.023TyrVal: 2.023 ± 0.566
0.506TyrTrp: 0.506 ± 0.204
2.023TyrTyr: 2.023 ± 0.566
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski