Amino acid dipepetide frequency for Chinese broad-headed pond turtle arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.957AlaAla: 2.957 ± 1.147
3.168AlaCys: 3.168 ± 2.163
1.056AlaAsp: 1.056 ± 0.463
1.267AlaGlu: 1.267 ± 0.643
3.59AlaPhe: 3.59 ± 1.327
2.957AlaGly: 2.957 ± 0.99
1.478AlaHis: 1.478 ± 0.49
4.435AlaIle: 4.435 ± 1.23
2.112AlaLys: 2.112 ± 1.351
5.069AlaLeu: 5.069 ± 1.129
0.634AlaMet: 0.634 ± 0.865
2.534AlaAsn: 2.534 ± 1.273
2.323AlaPro: 2.323 ± 0.965
1.056AlaGln: 1.056 ± 0.246
2.112AlaArg: 2.112 ± 2.81
5.069AlaSer: 5.069 ± 1.279
2.746AlaThr: 2.746 ± 1.046
3.801AlaVal: 3.801 ± 1.561
0.845AlaTrp: 0.845 ± 0.201
2.112AlaTyr: 2.112 ± 0.711
0.0AlaXaa: 0.0 ± 0.0
Cys
2.112CysAla: 2.112 ± 0.37
0.422CysCys: 0.422 ± 0.214
1.901CysAsp: 1.901 ± 0.437
0.634CysGlu: 0.634 ± 0.208
2.323CysPhe: 2.323 ± 0.815
3.168CysGly: 3.168 ± 1.235
1.478CysHis: 1.478 ± 1.347
2.323CysIle: 2.323 ± 0.957
1.478CysLys: 1.478 ± 0.517
3.801CysLeu: 3.801 ± 1.553
0.634CysMet: 0.634 ± 0.322
1.901CysAsn: 1.901 ± 0.625
1.69CysPro: 1.69 ± 0.6
0.211CysGln: 0.211 ± 0.107
1.267CysArg: 1.267 ± 0.61
4.435CysSer: 4.435 ± 0.796
2.112CysThr: 2.112 ± 1.066
2.534CysVal: 2.534 ± 0.919
0.211CysTrp: 0.211 ± 0.344
2.534CysTyr: 2.534 ± 0.504
0.0CysXaa: 0.0 ± 0.0
Asp
2.746AspAla: 2.746 ± 0.969
1.69AspCys: 1.69 ± 0.858
2.534AspAsp: 2.534 ± 0.919
1.478AspGlu: 1.478 ± 0.412
1.901AspPhe: 1.901 ± 0.597
2.957AspGly: 2.957 ± 0.746
1.056AspHis: 1.056 ± 1.289
2.112AspIle: 2.112 ± 0.651
2.323AspLys: 2.323 ± 0.563
4.857AspLeu: 4.857 ± 0.97
1.056AspMet: 1.056 ± 0.536
1.056AspAsn: 1.056 ± 0.246
3.379AspPro: 3.379 ± 1.017
0.845AspGln: 0.845 ± 0.201
1.901AspArg: 1.901 ± 0.437
1.901AspSer: 1.901 ± 0.965
3.379AspThr: 3.379 ± 1.353
4.435AspVal: 4.435 ± 1.113
0.634AspTrp: 0.634 ± 0.322
1.267AspTyr: 1.267 ± 0.716
0.0AspXaa: 0.0 ± 0.0
Glu
4.224GluAla: 4.224 ± 0.701
1.267GluCys: 1.267 ± 0.322
3.379GluAsp: 3.379 ± 1.341
6.125GluGlu: 6.125 ± 1.588
1.901GluPhe: 1.901 ± 0.658
2.112GluGly: 2.112 ± 0.599
2.112GluHis: 2.112 ± 0.599
1.69GluIle: 1.69 ± 0.255
4.224GluLys: 4.224 ± 1.081
4.013GluLeu: 4.013 ± 0.62
1.056GluMet: 1.056 ± 0.536
2.112GluAsn: 2.112 ± 0.846
1.478GluPro: 1.478 ± 0.929
1.478GluGln: 1.478 ± 0.412
0.845GluArg: 0.845 ± 0.716
2.746GluSer: 2.746 ± 0.732
3.168GluThr: 3.168 ± 0.739
3.59GluVal: 3.59 ± 1.227
0.634GluTrp: 0.634 ± 0.322
0.845GluTyr: 0.845 ± 0.201
0.0GluXaa: 0.0 ± 0.0
Phe
1.478PheAla: 1.478 ± 0.49
2.746PheCys: 2.746 ± 0.881
2.746PheAsp: 2.746 ± 0.625
2.323PheGlu: 2.323 ± 0.815
0.634PhePhe: 0.634 ± 0.74
2.957PheGly: 2.957 ± 0.42
1.69PheHis: 1.69 ± 1.467
2.534PheIle: 2.534 ± 0.644
2.323PheLys: 2.323 ± 0.572
4.646PheLeu: 4.646 ± 2.589
1.478PheMet: 1.478 ± 0.648
3.801PheAsn: 3.801 ± 1.25
1.056PhePro: 1.056 ± 0.246
1.478PheGln: 1.478 ± 1.702
1.901PheArg: 1.901 ± 0.609
4.013PheSer: 4.013 ± 1.009
2.957PheThr: 2.957 ± 0.42
5.069PheVal: 5.069 ± 0.87
0.211PheTrp: 0.211 ± 0.107
1.478PheTyr: 1.478 ± 0.723
0.0PheXaa: 0.0 ± 0.0
Gly
3.59GlyAla: 3.59 ± 0.739
3.379GlyCys: 3.379 ± 1.341
3.379GlyAsp: 3.379 ± 1.715
4.224GlyGlu: 4.224 ± 1.766
3.168GlyPhe: 3.168 ± 0.497
2.746GlyGly: 2.746 ± 1.126
1.478GlyHis: 1.478 ± 0.396
3.379GlyIle: 3.379 ± 0.557
2.957GlyLys: 2.957 ± 0.935
5.069GlyLeu: 5.069 ± 1.08
0.422GlyMet: 0.422 ± 0.214
2.746GlyAsn: 2.746 ± 0.986
4.013GlyPro: 4.013 ± 1.009
0.845GlyGln: 0.845 ± 1.055
2.112GlyArg: 2.112 ± 0.667
6.547GlySer: 6.547 ± 0.73
5.069GlyThr: 5.069 ± 1.208
5.491GlyVal: 5.491 ± 1.182
0.422GlyTrp: 0.422 ± 0.214
2.323GlyTyr: 2.323 ± 2.028
0.0GlyXaa: 0.0 ± 0.0
His
1.056HisAla: 1.056 ± 0.669
1.056HisCys: 1.056 ± 0.536
1.056HisAsp: 1.056 ± 0.246
1.267HisGlu: 1.267 ± 0.417
0.845HisPhe: 0.845 ± 0.775
1.478HisGly: 1.478 ± 0.412
0.845HisHis: 0.845 ± 0.201
2.534HisIle: 2.534 ± 1.185
0.845HisLys: 0.845 ± 0.353
2.746HisLeu: 2.746 ± 2.331
0.211HisMet: 0.211 ± 0.107
1.901HisAsn: 1.901 ± 0.355
1.267HisPro: 1.267 ± 0.322
0.845HisGln: 0.845 ± 1.034
0.634HisArg: 0.634 ± 0.322
2.323HisSer: 2.323 ± 0.755
2.746HisThr: 2.746 ± 1.511
2.323HisVal: 2.323 ± 0.592
0.0HisTrp: 0.0 ± 0.0
1.056HisTyr: 1.056 ± 0.864
0.0HisXaa: 0.0 ± 0.0
Ile
2.112IleAla: 2.112 ± 1.591
1.056IleCys: 1.056 ± 0.246
2.112IleAsp: 2.112 ± 1.034
2.323IleGlu: 2.323 ± 0.278
2.323IlePhe: 2.323 ± 0.572
3.59IleGly: 3.59 ± 0.546
1.901IleHis: 1.901 ± 0.625
5.702IleIle: 5.702 ± 1.061
2.957IleLys: 2.957 ± 1.457
6.125IleLeu: 6.125 ± 1.082
1.69IleMet: 1.69 ± 0.667
2.323IleAsn: 2.323 ± 0.592
4.646IlePro: 4.646 ± 0.871
1.478IleGln: 1.478 ± 0.75
4.435IleArg: 4.435 ± 0.89
5.28IleSer: 5.28 ± 0.981
4.857IleThr: 4.857 ± 0.957
4.646IleVal: 4.646 ± 0.518
0.845IleTrp: 0.845 ± 0.526
2.957IleTyr: 2.957 ± 0.42
0.0IleXaa: 0.0 ± 0.0
Lys
4.224LysAla: 4.224 ± 1.766
0.845LysCys: 0.845 ± 0.429
1.478LysAsp: 1.478 ± 0.412
1.478LysGlu: 1.478 ± 1.285
2.323LysPhe: 2.323 ± 0.563
3.168LysGly: 3.168 ± 1.725
1.478LysHis: 1.478 ± 0.412
3.379LysIle: 3.379 ± 0.844
3.168LysLys: 3.168 ± 1.534
4.013LysLeu: 4.013 ± 1.011
0.422LysMet: 0.422 ± 0.242
1.69LysAsn: 1.69 ± 0.509
4.013LysPro: 4.013 ± 1.414
1.056LysGln: 1.056 ± 1.146
2.112LysArg: 2.112 ± 2.468
4.224LysSer: 4.224 ± 0.513
3.168LysThr: 3.168 ± 1.258
2.534LysVal: 2.534 ± 0.895
0.634LysTrp: 0.634 ± 0.848
2.323LysTyr: 2.323 ± 0.648
0.0LysXaa: 0.0 ± 0.0
Leu
6.547LeuAla: 6.547 ± 1.034
2.323LeuCys: 2.323 ± 0.978
4.013LeuAsp: 4.013 ± 0.895
4.857LeuGlu: 4.857 ± 0.563
3.59LeuPhe: 3.59 ± 2.164
5.28LeuGly: 5.28 ± 0.836
2.534LeuHis: 2.534 ± 1.185
5.491LeuIle: 5.491 ± 0.883
2.534LeuLys: 2.534 ± 0.637
8.448LeuLeu: 8.448 ± 0.985
1.056LeuMet: 1.056 ± 0.246
4.435LeuAsn: 4.435 ± 0.545
5.491LeuPro: 5.491 ± 0.793
1.901LeuGln: 1.901 ± 0.639
4.224LeuArg: 4.224 ± 0.622
10.771LeuSer: 10.771 ± 1.567
5.28LeuThr: 5.28 ± 1.18
7.181LeuVal: 7.181 ± 1.357
0.422LeuTrp: 0.422 ± 0.263
4.646LeuTyr: 4.646 ± 0.556
0.0LeuXaa: 0.0 ± 0.0
Met
1.056MetAla: 1.056 ± 0.669
0.845MetCys: 0.845 ± 0.201
0.211MetAsp: 0.211 ± 0.344
1.056MetGlu: 1.056 ± 0.536
0.422MetPhe: 0.422 ± 0.263
1.69MetGly: 1.69 ± 0.858
0.422MetHis: 0.422 ± 0.214
0.634MetIle: 0.634 ± 0.208
1.056MetLys: 1.056 ± 0.708
1.69MetLeu: 1.69 ± 0.509
0.211MetMet: 0.211 ± 0.107
0.422MetAsn: 0.422 ± 0.706
1.056MetPro: 1.056 ± 0.916
0.0MetGln: 0.0 ± 0.0
0.634MetArg: 0.634 ± 0.363
1.267MetSer: 1.267 ± 0.322
0.845MetThr: 0.845 ± 0.445
1.267MetVal: 1.267 ± 0.643
0.211MetTrp: 0.211 ± 0.107
0.845MetTyr: 0.845 ± 0.526
0.0MetXaa: 0.0 ± 0.0
Asn
1.478AsnAla: 1.478 ± 0.49
1.478AsnCys: 1.478 ± 0.412
1.056AsnAsp: 1.056 ± 0.246
1.69AsnGlu: 1.69 ± 0.403
2.957AsnPhe: 2.957 ± 0.505
3.801AsnGly: 3.801 ± 1.105
1.056AsnHis: 1.056 ± 0.463
3.801AsnIle: 3.801 ± 1.255
3.168AsnLys: 3.168 ± 0.676
4.646AsnLeu: 4.646 ± 1.121
0.634AsnMet: 0.634 ± 0.208
3.168AsnAsn: 3.168 ± 1.143
1.901AsnPro: 1.901 ± 0.777
1.901AsnGln: 1.901 ± 0.641
1.69AsnArg: 1.69 ± 1.004
3.379AsnSer: 3.379 ± 0.82
2.957AsnThr: 2.957 ± 1.827
4.435AsnVal: 4.435 ± 0.51
0.211AsnTrp: 0.211 ± 0.107
1.478AsnTyr: 1.478 ± 0.75
0.0AsnXaa: 0.0 ± 0.0
Pro
2.534ProAla: 2.534 ± 0.942
0.845ProCys: 0.845 ± 0.429
2.112ProAsp: 2.112 ± 0.621
4.013ProGlu: 4.013 ± 2.037
2.957ProPhe: 2.957 ± 0.675
4.857ProGly: 4.857 ± 0.976
1.056ProHis: 1.056 ± 0.536
3.59ProIle: 3.59 ± 0.694
2.957ProLys: 2.957 ± 1.197
5.069ProLeu: 5.069 ± 1.288
0.845ProMet: 0.845 ± 0.429
1.478ProAsn: 1.478 ± 0.412
3.379ProPro: 3.379 ± 0.682
1.478ProGln: 1.478 ± 0.595
3.168ProArg: 3.168 ± 2.187
5.069ProSer: 5.069 ± 1.802
5.069ProThr: 5.069 ± 1.82
3.379ProVal: 3.379 ± 1.419
0.634ProTrp: 0.634 ± 0.322
1.478ProTyr: 1.478 ± 1.127
0.0ProXaa: 0.0 ± 0.0
Gln
1.056GlnAla: 1.056 ± 1.812
1.478GlnCys: 1.478 ± 0.723
0.845GlnAsp: 0.845 ± 0.429
2.112GlnGlu: 2.112 ± 0.621
1.267GlnPhe: 1.267 ± 0.79
1.901GlnGly: 1.901 ± 0.641
0.634GlnHis: 0.634 ± 0.208
1.056GlnIle: 1.056 ± 0.246
2.323GlnLys: 2.323 ± 0.805
1.267GlnLeu: 1.267 ± 0.643
0.0GlnMet: 0.0 ± 0.0
0.845GlnAsn: 0.845 ± 0.896
1.056GlnPro: 1.056 ± 1.812
1.267GlnGln: 1.267 ± 0.835
1.69GlnArg: 1.69 ± 0.584
1.478GlnSer: 1.478 ± 0.412
1.056GlnThr: 1.056 ± 0.944
2.323GlnVal: 2.323 ± 2.156
0.422GlnTrp: 0.422 ± 0.214
0.845GlnTyr: 0.845 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
3.59ArgAla: 3.59 ± 1.627
0.845ArgCys: 0.845 ± 0.429
1.69ArgAsp: 1.69 ± 1.432
1.69ArgGlu: 1.69 ± 0.403
3.168ArgPhe: 3.168 ± 1.061
2.323ArgGly: 2.323 ± 0.787
1.478ArgHis: 1.478 ± 0.621
2.323ArgIle: 2.323 ± 0.572
0.0ArgLys: 0.0 ± 0.0
4.013ArgLeu: 4.013 ± 0.9
0.211ArgMet: 0.211 ± 0.107
2.746ArgAsn: 2.746 ± 1.639
2.746ArgPro: 2.746 ± 0.79
0.845ArgGln: 0.845 ± 0.727
2.534ArgArg: 2.534 ± 1.448
4.224ArgSer: 4.224 ± 3.155
2.957ArgThr: 2.957 ± 0.563
3.59ArgVal: 3.59 ± 1.117
0.211ArgTrp: 0.211 ± 0.107
2.323ArgTyr: 2.323 ± 0.755
0.0ArgXaa: 0.0 ± 0.0
Ser
3.379SerAla: 3.379 ± 1.258
5.491SerCys: 5.491 ± 1.625
5.28SerAsp: 5.28 ± 1.899
4.646SerGlu: 4.646 ± 0.569
4.646SerPhe: 4.646 ± 1.629
6.547SerGly: 6.547 ± 1.101
2.534SerHis: 2.534 ± 1.715
5.069SerIle: 5.069 ± 1.667
4.435SerLys: 4.435 ± 1.195
8.448SerLeu: 8.448 ± 2.836
1.478SerMet: 1.478 ± 0.604
3.168SerAsn: 3.168 ± 0.676
5.28SerPro: 5.28 ± 1.216
2.534SerGln: 2.534 ± 0.725
3.168SerArg: 3.168 ± 1.323
9.926SerSer: 9.926 ± 0.947
5.702SerThr: 5.702 ± 1.506
6.125SerVal: 6.125 ± 0.76
1.69SerTrp: 1.69 ± 0.674
4.224SerTyr: 4.224 ± 1.464
0.0SerXaa: 0.0 ± 0.0
Thr
2.957ThrAla: 2.957 ± 0.865
2.534ThrCys: 2.534 ± 0.295
2.746ThrAsp: 2.746 ± 0.969
3.59ThrGlu: 3.59 ± 0.81
2.957ThrPhe: 2.957 ± 0.677
3.379ThrGly: 3.379 ± 0.928
1.267ThrHis: 1.267 ± 0.417
3.59ThrIle: 3.59 ± 0.514
2.957ThrLys: 2.957 ± 0.866
4.857ThrLeu: 4.857 ± 2.814
1.056ThrMet: 1.056 ± 0.66
4.224ThrAsn: 4.224 ± 1.766
6.125ThrPro: 6.125 ± 1.192
2.323ThrGln: 2.323 ± 0.874
4.013ThrArg: 4.013 ± 0.973
7.181ThrSer: 7.181 ± 1.35
3.59ThrThr: 3.59 ± 0.833
4.435ThrVal: 4.435 ± 1.187
1.056ThrTrp: 1.056 ± 0.463
2.534ThrTyr: 2.534 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
3.168ValAla: 3.168 ± 0.684
2.534ValCys: 2.534 ± 0.295
3.801ValAsp: 3.801 ± 0.604
1.69ValGlu: 1.69 ± 0.703
4.013ValPhe: 4.013 ± 1.578
5.28ValGly: 5.28 ± 1.659
1.267ValHis: 1.267 ± 0.624
7.181ValIle: 7.181 ± 1.317
2.957ValLys: 2.957 ± 0.865
7.603ValLeu: 7.603 ± 2.671
1.056ValMet: 1.056 ± 0.536
3.59ValAsn: 3.59 ± 0.994
2.323ValPro: 2.323 ± 0.648
1.69ValGln: 1.69 ± 1.318
3.59ValArg: 3.59 ± 0.603
7.814ValSer: 7.814 ± 1.099
6.547ValThr: 6.547 ± 1.484
8.659ValVal: 8.659 ± 1.367
0.845ValTrp: 0.845 ± 0.429
3.59ValTyr: 3.59 ± 1.616
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.211TrpCys: 0.211 ± 0.107
0.634TrpAsp: 0.634 ± 0.322
0.845TrpGlu: 0.845 ± 0.353
0.634TrpPhe: 0.634 ± 0.208
0.634TrpGly: 0.634 ± 0.322
0.211TrpHis: 0.211 ± 0.107
0.422TrpIle: 0.422 ± 0.214
0.634TrpLys: 0.634 ± 0.208
0.845TrpLeu: 0.845 ± 0.201
0.422TrpMet: 0.422 ± 0.634
0.211TrpAsn: 0.211 ± 0.344
0.845TrpPro: 0.845 ± 0.429
0.634TrpGln: 0.634 ± 0.848
0.422TrpArg: 0.422 ± 0.263
1.056TrpSer: 1.056 ± 0.536
1.267TrpThr: 1.267 ± 0.417
0.634TrpVal: 0.634 ± 0.322
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.267TyrAla: 1.267 ± 0.292
2.957TyrCys: 2.957 ± 0.58
1.478TyrAsp: 1.478 ± 0.49
1.69TyrGlu: 1.69 ± 0.667
1.901TyrPhe: 1.901 ± 0.609
2.534TyrGly: 2.534 ± 0.961
1.056TyrHis: 1.056 ± 0.246
2.112TyrIle: 2.112 ± 0.592
2.323TyrLys: 2.323 ± 1.248
3.801TyrLeu: 3.801 ± 0.877
1.056TyrMet: 1.056 ± 0.864
2.534TyrAsn: 2.534 ± 1.739
1.901TyrPro: 1.901 ± 0.597
1.056TyrGln: 1.056 ± 0.361
1.056TyrArg: 1.056 ± 0.246
5.069TyrSer: 5.069 ± 0.692
1.901TyrThr: 1.901 ± 0.437
2.746TyrVal: 2.746 ± 0.674
0.422TyrTrp: 0.422 ± 0.214
2.112TyrTyr: 2.112 ± 0.492
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4736 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski