Amino acid dipepetide frequency for Hainan oligodon formosanus arterivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.746AlaAla: 7.746 ± 2.82
1.834AlaCys: 1.834 ± 0.834
3.057AlaAsp: 3.057 ± 1.606
2.854AlaGlu: 2.854 ± 1.499
3.261AlaPhe: 3.261 ± 0.936
4.892AlaGly: 4.892 ± 0.735
1.834AlaHis: 1.834 ± 0.879
4.892AlaIle: 4.892 ± 1.118
3.465AlaLys: 3.465 ± 1.407
5.096AlaLeu: 5.096 ± 1.331
1.834AlaMet: 1.834 ± 0.566
2.854AlaAsn: 2.854 ± 1.54
3.669AlaPro: 3.669 ± 1.97
2.446AlaGln: 2.446 ± 0.827
3.669AlaArg: 3.669 ± 0.717
6.726AlaSer: 6.726 ± 0.972
5.096AlaThr: 5.096 ± 2.47
4.688AlaVal: 4.688 ± 1.016
0.204AlaTrp: 0.204 ± 0.647
2.242AlaTyr: 2.242 ± 1.924
0.0AlaXaa: 0.0 ± 0.0
Cys
3.465CysAla: 3.465 ± 1.157
1.631CysCys: 1.631 ± 0.664
1.427CysAsp: 1.427 ± 0.749
0.815CysGlu: 0.815 ± 0.449
1.631CysPhe: 1.631 ± 1.348
2.65CysGly: 2.65 ± 0.39
1.631CysHis: 1.631 ± 0.664
1.631CysIle: 1.631 ± 0.856
1.019CysLys: 1.019 ± 0.498
4.077CysLeu: 4.077 ± 1.216
0.815CysMet: 0.815 ± 0.548
1.223CysAsn: 1.223 ± 1.179
1.834CysPro: 1.834 ± 0.634
0.408CysGln: 0.408 ± 0.547
2.242CysArg: 2.242 ± 1.662
1.834CysSer: 1.834 ± 0.567
2.038CysThr: 2.038 ± 1.337
1.631CysVal: 1.631 ± 1.777
0.0CysTrp: 0.0 ± 0.0
1.019CysTyr: 1.019 ± 0.535
0.0CysXaa: 0.0 ± 0.0
Asp
3.057AspAla: 3.057 ± 0.856
1.223AspCys: 1.223 ± 0.642
2.65AspAsp: 2.65 ± 1.197
1.631AspGlu: 1.631 ± 0.554
1.631AspPhe: 1.631 ± 0.856
3.261AspGly: 3.261 ± 1.304
1.631AspHis: 1.631 ± 0.572
1.834AspIle: 1.834 ± 0.964
1.834AspLys: 1.834 ± 0.985
5.3AspLeu: 5.3 ± 1.51
1.631AspMet: 1.631 ± 1.462
2.038AspAsn: 2.038 ± 0.812
3.873AspPro: 3.873 ± 1.327
2.446AspGln: 2.446 ± 0.868
1.631AspArg: 1.631 ± 0.636
3.057AspSer: 3.057 ± 0.92
2.65AspThr: 2.65 ± 0.39
3.669AspVal: 3.669 ± 1.461
0.204AspTrp: 0.204 ± 0.107
1.631AspTyr: 1.631 ± 0.632
0.0AspXaa: 0.0 ± 0.0
Glu
2.854GluAla: 2.854 ± 0.915
0.611GluCys: 0.611 ± 0.321
3.873GluAsp: 3.873 ± 1.655
3.057GluGlu: 3.057 ± 1.159
1.834GluPhe: 1.834 ± 0.879
2.854GluGly: 2.854 ± 1.155
2.446GluHis: 2.446 ± 0.906
2.854GluIle: 2.854 ± 1.499
2.854GluLys: 2.854 ± 0.758
2.65GluLeu: 2.65 ± 0.783
1.019GluMet: 1.019 ± 0.535
2.65GluAsn: 2.65 ± 0.728
2.446GluPro: 2.446 ± 0.799
1.834GluGln: 1.834 ± 0.964
1.834GluArg: 1.834 ± 0.591
3.057GluSer: 3.057 ± 0.587
2.446GluThr: 2.446 ± 0.908
3.057GluVal: 3.057 ± 1.159
0.611GluTrp: 0.611 ± 0.321
1.019GluTyr: 1.019 ± 0.404
0.0GluXaa: 0.0 ± 0.0
Phe
5.3PheAla: 5.3 ± 1.9
1.834PheCys: 1.834 ± 0.614
1.834PheAsp: 1.834 ± 0.964
1.834PheGlu: 1.834 ± 0.634
3.873PhePhe: 3.873 ± 0.939
5.096PheGly: 5.096 ± 0.658
1.019PheHis: 1.019 ± 1.262
2.242PheIle: 2.242 ± 0.892
2.854PheLys: 2.854 ± 0.797
5.3PheLeu: 5.3 ± 1.037
1.019PheMet: 1.019 ± 0.808
3.057PheAsn: 3.057 ± 0.919
1.834PhePro: 1.834 ± 1.095
1.223PheGln: 1.223 ± 0.642
1.631PheArg: 1.631 ± 0.554
3.465PheSer: 3.465 ± 0.82
4.892PheThr: 4.892 ± 0.971
2.446PheVal: 2.446 ± 0.871
0.408PheTrp: 0.408 ± 0.214
1.834PheTyr: 1.834 ± 0.732
0.0PheXaa: 0.0 ± 0.0
Gly
2.65GlyAla: 2.65 ± 0.579
2.242GlyCys: 2.242 ± 1.628
3.057GlyAsp: 3.057 ± 0.856
2.242GlyGlu: 2.242 ± 0.811
5.707GlyPhe: 5.707 ± 1.203
3.669GlyGly: 3.669 ± 1.002
1.427GlyHis: 1.427 ± 0.829
3.669GlyIle: 3.669 ± 0.575
4.077GlyLys: 4.077 ± 1.417
5.3GlyLeu: 5.3 ± 1.953
1.427GlyMet: 1.427 ± 1.331
1.427GlyAsn: 1.427 ± 0.585
3.465GlyPro: 3.465 ± 1.147
1.427GlyGln: 1.427 ± 0.608
3.057GlyArg: 3.057 ± 1.253
4.892GlySer: 4.892 ± 0.732
3.261GlyThr: 3.261 ± 1.738
4.892GlyVal: 4.892 ± 2.083
1.019GlyTrp: 1.019 ± 0.552
2.65GlyTyr: 2.65 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
1.427HisAla: 1.427 ± 0.815
1.631HisCys: 1.631 ± 0.856
1.427HisAsp: 1.427 ± 0.596
1.223HisGlu: 1.223 ± 1.328
2.446HisPhe: 2.446 ± 0.761
1.631HisGly: 1.631 ± 1.148
0.815HisHis: 0.815 ± 0.949
2.65HisIle: 2.65 ± 1.624
1.223HisLys: 1.223 ± 0.911
4.077HisLeu: 4.077 ± 1.4
0.204HisMet: 0.204 ± 0.107
1.834HisAsn: 1.834 ± 0.959
1.834HisPro: 1.834 ± 1.134
0.408HisGln: 0.408 ± 0.474
1.427HisArg: 1.427 ± 1.051
2.038HisSer: 2.038 ± 0.812
2.038HisThr: 2.038 ± 0.873
0.611HisVal: 0.611 ± 0.485
0.611HisTrp: 0.611 ± 0.781
1.223HisTyr: 1.223 ± 0.45
0.0HisXaa: 0.0 ± 0.0
Ile
3.261IleAla: 3.261 ± 0.931
3.261IleCys: 3.261 ± 1.075
2.446IleAsp: 2.446 ± 0.868
2.65IleGlu: 2.65 ± 0.438
2.242IlePhe: 2.242 ± 0.615
1.834IleGly: 1.834 ± 0.732
2.038IleHis: 2.038 ± 0.873
4.688IleIle: 4.688 ± 1.344
2.446IleLys: 2.446 ± 0.73
5.503IleLeu: 5.503 ± 0.933
1.427IleMet: 1.427 ± 0.574
1.834IleAsn: 1.834 ± 1.085
3.669IlePro: 3.669 ± 0.969
1.834IleGln: 1.834 ± 0.567
2.65IleArg: 2.65 ± 0.976
5.503IleSer: 5.503 ± 2.521
4.688IleThr: 4.688 ± 1.413
4.077IleVal: 4.077 ± 0.97
0.611IleTrp: 0.611 ± 0.321
2.854IleTyr: 2.854 ± 1.419
0.0IleXaa: 0.0 ± 0.0
Lys
4.688LysAla: 4.688 ± 1.979
1.834LysCys: 1.834 ± 1.414
2.446LysAsp: 2.446 ± 0.979
3.057LysGlu: 3.057 ± 1.159
1.834LysPhe: 1.834 ± 0.567
2.446LysGly: 2.446 ± 0.613
1.834LysHis: 1.834 ± 0.868
4.484LysIle: 4.484 ± 1.926
4.484LysLys: 4.484 ± 1.527
4.28LysLeu: 4.28 ± 0.969
0.815LysMet: 0.815 ± 0.833
2.446LysAsn: 2.446 ± 1.285
3.669LysPro: 3.669 ± 0.897
1.427LysGln: 1.427 ± 0.486
1.834LysArg: 1.834 ± 0.708
3.873LysSer: 3.873 ± 0.721
3.057LysThr: 3.057 ± 0.654
3.057LysVal: 3.057 ± 0.876
0.815LysTrp: 0.815 ± 0.449
2.446LysTyr: 2.446 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
8.357LeuAla: 8.357 ± 0.555
3.261LeuCys: 3.261 ± 0.625
3.669LeuAsp: 3.669 ± 1.009
4.892LeuGlu: 4.892 ± 1.016
3.465LeuPhe: 3.465 ± 1.023
4.077LeuGly: 4.077 ± 1.521
1.834LeuHis: 1.834 ± 2.06
5.707LeuIle: 5.707 ± 1.59
4.28LeuLys: 4.28 ± 0.867
11.415LeuLeu: 11.415 ± 4.392
1.631LeuMet: 1.631 ± 0.572
4.28LeuAsn: 4.28 ± 1.665
6.115LeuPro: 6.115 ± 1.871
3.465LeuGln: 3.465 ± 1.069
4.688LeuArg: 4.688 ± 1.84
7.542LeuSer: 7.542 ± 1.407
6.115LeuThr: 6.115 ± 1.061
4.892LeuVal: 4.892 ± 1.158
1.019LeuTrp: 1.019 ± 0.635
1.834LeuTyr: 1.834 ± 1.446
0.0LeuXaa: 0.0 ± 0.0
Met
2.242MetAla: 2.242 ± 0.744
0.204MetCys: 0.204 ± 0.647
1.427MetAsp: 1.427 ± 0.749
1.019MetGlu: 1.019 ± 0.542
0.611MetPhe: 0.611 ± 0.557
1.019MetGly: 1.019 ± 1.424
0.0MetHis: 0.0 ± 0.0
1.223MetIle: 1.223 ± 0.911
0.815MetLys: 0.815 ± 0.428
2.038MetLeu: 2.038 ± 1.203
0.408MetMet: 0.408 ± 0.594
1.223MetAsn: 1.223 ± 1.364
1.019MetPro: 1.019 ± 1.282
0.815MetGln: 0.815 ± 0.428
0.408MetArg: 0.408 ± 0.214
1.019MetSer: 1.019 ± 0.535
1.223MetThr: 1.223 ± 0.561
1.427MetVal: 1.427 ± 1.761
0.0MetTrp: 0.0 ± 0.0
0.815MetTyr: 0.815 ± 0.548
0.0MetXaa: 0.0 ± 0.0
Asn
3.057AsnAla: 3.057 ± 1.589
1.427AsnCys: 1.427 ± 1.431
1.631AsnAsp: 1.631 ± 0.836
1.019AsnGlu: 1.019 ± 0.535
3.057AsnPhe: 3.057 ± 1.906
3.669AsnGly: 3.669 ± 0.809
2.038AsnHis: 2.038 ± 2.131
2.65AsnIle: 2.65 ± 1.544
2.038AsnLys: 2.038 ± 0.774
4.688AsnLeu: 4.688 ± 1.18
0.815AsnMet: 0.815 ± 0.428
1.427AsnAsn: 1.427 ± 1.1
1.834AsnPro: 1.834 ± 0.759
2.854AsnGln: 2.854 ± 1.482
1.223AsnArg: 1.223 ± 0.642
3.465AsnSer: 3.465 ± 1.601
2.65AsnThr: 2.65 ± 1.255
2.038AsnVal: 2.038 ± 0.517
0.611AsnTrp: 0.611 ± 0.426
1.019AsnTyr: 1.019 ± 0.542
0.0AsnXaa: 0.0 ± 0.0
Pro
4.688ProAla: 4.688 ± 0.95
0.408ProCys: 0.408 ± 0.214
1.631ProAsp: 1.631 ± 0.674
3.669ProGlu: 3.669 ± 1.417
2.65ProPhe: 2.65 ± 1.392
4.077ProGly: 4.077 ± 1.667
1.223ProHis: 1.223 ± 0.566
2.446ProIle: 2.446 ± 0.72
4.484ProLys: 4.484 ± 1.465
3.873ProLeu: 3.873 ± 0.966
0.611ProMet: 0.611 ± 0.567
2.446ProAsn: 2.446 ± 1.0
1.834ProPro: 1.834 ± 0.964
1.631ProGln: 1.631 ± 0.74
2.65ProArg: 2.65 ± 1.025
7.134ProSer: 7.134 ± 1.133
3.057ProThr: 3.057 ± 1.073
5.3ProVal: 5.3 ± 0.864
0.0ProTrp: 0.0 ± 0.0
1.019ProTyr: 1.019 ± 0.535
0.0ProXaa: 0.0 ± 0.0
Gln
1.019GlnAla: 1.019 ± 0.895
0.815GlnCys: 0.815 ± 0.492
1.631GlnAsp: 1.631 ± 0.856
3.057GlnGlu: 3.057 ± 0.856
3.057GlnPhe: 3.057 ± 1.159
2.65GlnGly: 2.65 ± 0.681
1.223GlnHis: 1.223 ± 0.527
1.631GlnIle: 1.631 ± 1.026
2.038GlnLys: 2.038 ± 0.812
3.261GlnLeu: 3.261 ± 2.46
0.611GlnMet: 0.611 ± 0.41
1.019GlnAsn: 1.019 ± 0.404
2.242GlnPro: 2.242 ± 1.022
1.223GlnGln: 1.223 ± 1.413
2.65GlnArg: 2.65 ± 0.971
1.834GlnSer: 1.834 ± 0.497
2.242GlnThr: 2.242 ± 1.272
1.631GlnVal: 1.631 ± 0.664
0.408GlnTrp: 0.408 ± 0.214
1.223GlnTyr: 1.223 ± 1.065
0.0GlnXaa: 0.0 ± 0.0
Arg
3.057ArgAla: 3.057 ± 1.27
1.631ArgCys: 1.631 ± 0.664
2.242ArgAsp: 2.242 ± 1.197
2.242ArgGlu: 2.242 ± 1.178
1.834ArgPhe: 1.834 ± 0.964
3.057ArgGly: 3.057 ± 0.984
1.427ArgHis: 1.427 ± 0.749
2.446ArgIle: 2.446 ± 1.617
2.854ArgLys: 2.854 ± 1.499
2.242ArgLeu: 2.242 ± 0.831
0.815ArgMet: 0.815 ± 0.449
2.038ArgAsn: 2.038 ± 1.332
2.65ArgPro: 2.65 ± 0.853
3.057ArgGln: 3.057 ± 1.732
3.261ArgArg: 3.261 ± 2.078
3.057ArgSer: 3.057 ± 1.057
2.854ArgThr: 2.854 ± 1.454
2.854ArgVal: 2.854 ± 1.499
0.408ArgTrp: 0.408 ± 0.54
0.815ArgTyr: 0.815 ± 0.449
0.0ArgXaa: 0.0 ± 0.0
Ser
3.669SerAla: 3.669 ± 0.799
2.65SerCys: 2.65 ± 0.976
4.688SerAsp: 4.688 ± 1.789
3.873SerGlu: 3.873 ± 0.721
4.077SerPhe: 4.077 ± 1.735
4.688SerGly: 4.688 ± 1.295
3.057SerHis: 3.057 ± 1.481
5.3SerIle: 5.3 ± 1.765
4.484SerLys: 4.484 ± 1.964
7.746SerLeu: 7.746 ± 1.496
1.427SerMet: 1.427 ± 1.043
4.28SerAsn: 4.28 ± 0.851
3.465SerPro: 3.465 ± 1.329
2.242SerGln: 2.242 ± 1.676
2.446SerArg: 2.446 ± 0.944
6.115SerSer: 6.115 ± 0.572
6.93SerThr: 6.93 ± 1.518
4.28SerVal: 4.28 ± 1.139
0.611SerTrp: 0.611 ± 0.485
1.631SerTyr: 1.631 ± 0.987
0.0SerXaa: 0.0 ± 0.0
Thr
5.911ThrAla: 5.911 ± 1.108
2.446ThrCys: 2.446 ± 0.872
2.854ThrAsp: 2.854 ± 0.627
2.446ThrGlu: 2.446 ± 0.87
4.28ThrPhe: 4.28 ± 1.079
3.873ThrGly: 3.873 ± 1.03
2.446ThrHis: 2.446 ± 3.037
3.261ThrIle: 3.261 ± 1.124
3.669ThrLys: 3.669 ± 1.068
5.503ThrLeu: 5.503 ± 2.314
1.427ThrMet: 1.427 ± 1.694
2.038ThrAsn: 2.038 ± 0.517
4.688ThrPro: 4.688 ± 1.216
2.854ThrGln: 2.854 ± 0.998
2.854ThrArg: 2.854 ± 0.627
6.319ThrSer: 6.319 ± 2.228
5.911ThrThr: 5.911 ± 2.789
3.669ThrVal: 3.669 ± 1.06
0.611ThrTrp: 0.611 ± 0.485
2.446ThrTyr: 2.446 ± 1.207
0.0ThrXaa: 0.0 ± 0.0
Val
3.465ValAla: 3.465 ± 1.345
2.854ValCys: 2.854 ± 0.808
4.077ValAsp: 4.077 ± 0.81
2.038ValGlu: 2.038 ± 1.071
3.057ValPhe: 3.057 ± 1.262
4.28ValGly: 4.28 ± 1.317
1.427ValHis: 1.427 ± 1.208
3.057ValIle: 3.057 ± 0.782
3.873ValLys: 3.873 ± 0.764
5.707ValLeu: 5.707 ± 2.28
0.611ValMet: 0.611 ± 0.321
2.446ValAsn: 2.446 ± 1.285
2.65ValPro: 2.65 ± 1.392
2.038ValGln: 2.038 ± 1.164
2.038ValArg: 2.038 ± 0.747
3.873ValSer: 3.873 ± 0.875
5.3ValThr: 5.3 ± 1.953
6.523ValVal: 6.523 ± 0.621
1.223ValTrp: 1.223 ± 0.576
2.65ValTyr: 2.65 ± 0.71
0.0ValXaa: 0.0 ± 0.0
Trp
1.019TrpAla: 1.019 ± 0.535
0.204TrpCys: 0.204 ± 0.107
0.0TrpAsp: 0.0 ± 0.0
0.611TrpGlu: 0.611 ± 0.509
0.408TrpPhe: 0.408 ± 0.601
0.204TrpGly: 0.204 ± 0.107
0.611TrpHis: 0.611 ± 0.321
0.611TrpIle: 0.611 ± 0.485
0.815TrpLys: 0.815 ± 0.401
0.611TrpLeu: 0.611 ± 0.426
0.204TrpMet: 0.204 ± 0.107
0.408TrpAsn: 0.408 ± 0.214
0.815TrpPro: 0.815 ± 0.548
0.0TrpGln: 0.0 ± 0.0
0.815TrpArg: 0.815 ± 0.778
0.408TrpSer: 0.408 ± 0.214
0.611TrpThr: 0.611 ± 0.321
0.408TrpVal: 0.408 ± 0.54
0.0TrpTrp: 0.0 ± 0.0
0.815TrpTyr: 0.815 ± 1.348
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.427TyrAla: 1.427 ± 0.486
1.019TyrCys: 1.019 ± 0.635
0.815TyrAsp: 0.815 ± 0.761
1.631TyrGlu: 1.631 ± 1.395
2.038TyrPhe: 2.038 ± 0.789
1.631TyrGly: 1.631 ± 0.664
0.815TyrHis: 0.815 ± 0.539
2.242TyrIle: 2.242 ± 0.955
1.223TyrLys: 1.223 ± 0.566
3.873TyrLeu: 3.873 ± 0.987
0.0TyrMet: 0.0 ± 0.0
2.242TyrAsn: 2.242 ± 1.073
1.223TyrPro: 1.223 ± 0.527
2.038TyrGln: 2.038 ± 1.649
1.834TyrArg: 1.834 ± 0.868
2.446TyrSer: 2.446 ± 1.053
2.446TyrThr: 2.446 ± 1.742
2.038TyrVal: 2.038 ± 0.581
0.204TyrTrp: 0.204 ± 0.107
1.223TyrTyr: 1.223 ± 0.45
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski