Amino acid dipepetide frequency for Beihai hermit crab virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.201AlaAla: 9.201 ± 5.032
0.789AlaCys: 0.789 ± 0.3
4.732AlaAsp: 4.732 ± 1.026
3.417AlaGlu: 3.417 ± 1.0
2.892AlaPhe: 2.892 ± 1.954
4.469AlaGly: 4.469 ± 2.039
2.629AlaHis: 2.629 ± 1.115
2.366AlaIle: 2.366 ± 0.413
2.103AlaLys: 2.103 ± 1.016
9.989AlaLeu: 9.989 ± 0.624
0.789AlaMet: 0.789 ± 0.658
3.417AlaAsn: 3.417 ± 2.683
1.052AlaPro: 1.052 ± 1.313
3.943AlaGln: 3.943 ± 2.075
4.206AlaArg: 4.206 ± 0.259
5.521AlaSer: 5.521 ± 1.036
6.572AlaThr: 6.572 ± 3.488
2.629AlaVal: 2.629 ± 2.339
1.577AlaTrp: 1.577 ± 0.597
2.366AlaTyr: 2.366 ± 1.175
0.0AlaXaa: 0.0 ± 0.0
Cys
1.314CysAla: 1.314 ± 0.472
0.0CysCys: 0.0 ± 0.0
1.052CysAsp: 1.052 ± 0.366
0.263CysGlu: 0.263 ± 0.15
0.789CysPhe: 0.789 ± 0.247
1.314CysGly: 1.314 ± 0.125
0.526CysHis: 0.526 ± 0.3
1.052CysIle: 1.052 ± 0.308
0.789CysLys: 0.789 ± 0.45
1.577CysLeu: 1.577 ± 0.493
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.366CysPro: 2.366 ± 0.758
1.84CysGln: 1.84 ± 1.051
1.314CysArg: 1.314 ± 0.56
0.0CysSer: 0.0 ± 0.0
1.052CysThr: 1.052 ± 1.123
1.052CysVal: 1.052 ± 0.366
0.0CysTrp: 0.0 ± 0.0
0.263CysTyr: 0.263 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
3.417AspAla: 3.417 ± 0.468
1.052AspCys: 1.052 ± 0.6
3.155AspAsp: 3.155 ± 1.41
1.314AspGlu: 1.314 ± 0.47
2.629AspPhe: 2.629 ± 0.388
1.314AspGly: 1.314 ± 0.125
2.892AspHis: 2.892 ± 1.262
3.155AspIle: 3.155 ± 0.816
0.789AspLys: 0.789 ± 0.51
6.572AspLeu: 6.572 ± 1.575
0.263AspMet: 0.263 ± 0.29
3.417AspAsn: 3.417 ± 0.468
2.103AspPro: 2.103 ± 0.456
1.84AspGln: 1.84 ± 0.582
3.943AspArg: 3.943 ± 1.404
2.366AspSer: 2.366 ± 0.604
2.366AspThr: 2.366 ± 0.547
3.417AspVal: 3.417 ± 0.77
1.314AspTrp: 1.314 ± 0.493
2.103AspTyr: 2.103 ± 1.192
0.0AspXaa: 0.0 ± 0.0
Glu
2.366GluAla: 2.366 ± 0.471
1.052GluCys: 1.052 ± 0.6
2.629GluAsp: 2.629 ± 0.925
3.417GluGlu: 3.417 ± 0.951
3.417GluPhe: 3.417 ± 1.166
3.68GluGly: 3.68 ± 1.133
0.526GluHis: 0.526 ± 0.3
4.732GluIle: 4.732 ± 1.717
1.577GluLys: 1.577 ± 0.9
4.469GluLeu: 4.469 ± 0.743
1.84GluMet: 1.84 ± 0.746
1.84GluAsn: 1.84 ± 0.732
1.314GluPro: 1.314 ± 1.492
1.314GluGln: 1.314 ± 0.517
3.155GluArg: 3.155 ± 0.807
4.206GluSer: 4.206 ± 0.842
2.892GluThr: 2.892 ± 0.769
3.417GluVal: 3.417 ± 1.01
1.052GluTrp: 1.052 ± 0.308
1.052GluTyr: 1.052 ± 0.308
0.0GluXaa: 0.0 ± 0.0
Phe
2.892PheAla: 2.892 ± 0.987
1.577PheCys: 1.577 ± 0.392
2.629PheAsp: 2.629 ± 0.349
2.892PheGlu: 2.892 ± 0.985
1.052PhePhe: 1.052 ± 0.536
2.366PheGly: 2.366 ± 0.871
1.052PheHis: 1.052 ± 0.308
1.577PheIle: 1.577 ± 0.493
1.314PheLys: 1.314 ± 0.125
4.469PheLeu: 4.469 ± 1.238
1.052PheMet: 1.052 ± 0.225
1.84PheAsn: 1.84 ± 0.583
2.103PhePro: 2.103 ± 0.249
1.84PheGln: 1.84 ± 0.683
3.417PheArg: 3.417 ± 1.743
2.629PheSer: 2.629 ± 0.615
2.103PheThr: 2.103 ± 0.825
2.103PheVal: 2.103 ± 0.372
0.263PheTrp: 0.263 ± 0.15
1.314PheTyr: 1.314 ± 0.125
0.0PheXaa: 0.0 ± 0.0
Gly
3.417GlyAla: 3.417 ± 1.692
1.052GlyCys: 1.052 ± 0.366
2.892GlyAsp: 2.892 ± 0.528
2.629GlyGlu: 2.629 ± 0.877
3.155GlyPhe: 3.155 ± 0.816
5.783GlyGly: 5.783 ± 0.767
1.314GlyHis: 1.314 ± 0.493
4.995GlyIle: 4.995 ± 0.723
1.052GlyLys: 1.052 ± 0.225
8.149GlyLeu: 8.149 ± 1.736
1.84GlyMet: 1.84 ± 0.39
1.84GlyAsn: 1.84 ± 0.285
2.892GlyPro: 2.892 ± 1.444
3.417GlyGln: 3.417 ± 1.167
2.103GlyArg: 2.103 ± 0.573
2.103GlySer: 2.103 ± 0.404
4.469GlyThr: 4.469 ± 1.018
5.521GlyVal: 5.521 ± 0.813
1.314GlyTrp: 1.314 ± 0.472
0.526GlyTyr: 0.526 ± 0.3
0.0GlyXaa: 0.0 ± 0.0
His
2.629HisAla: 2.629 ± 0.497
1.052HisCys: 1.052 ± 0.308
1.052HisAsp: 1.052 ± 1.123
1.052HisGlu: 1.052 ± 0.308
0.526HisPhe: 0.526 ± 0.301
0.263HisGly: 0.263 ± 0.15
2.366HisHis: 2.366 ± 0.758
0.526HisIle: 0.526 ± 0.3
0.789HisLys: 0.789 ± 0.45
6.309HisLeu: 6.309 ± 2.403
0.789HisMet: 0.789 ± 0.3
1.052HisAsn: 1.052 ± 0.225
1.84HisPro: 1.84 ± 0.571
2.892HisGln: 2.892 ± 0.562
3.68HisArg: 3.68 ± 1.303
3.943HisSer: 3.943 ± 1.092
2.103HisThr: 2.103 ± 1.492
1.84HisVal: 1.84 ± 1.144
0.526HisTrp: 0.526 ± 0.3
1.052HisTyr: 1.052 ± 0.536
0.0HisXaa: 0.0 ± 0.0
Ile
4.206IleAla: 4.206 ± 1.518
0.789IleCys: 0.789 ± 0.361
2.629IleAsp: 2.629 ± 0.492
2.366IleGlu: 2.366 ± 0.289
2.629IlePhe: 2.629 ± 0.833
4.995IleGly: 4.995 ± 0.645
2.892IleHis: 2.892 ± 1.2
4.469IleIle: 4.469 ± 1.328
2.892IleLys: 2.892 ± 1.305
6.572IleLeu: 6.572 ± 0.839
1.052IleMet: 1.052 ± 0.6
1.052IleAsn: 1.052 ± 0.308
3.68IlePro: 3.68 ± 0.322
3.943IleGln: 3.943 ± 0.992
3.417IleArg: 3.417 ± 1.361
3.155IleSer: 3.155 ± 1.246
3.943IleThr: 3.943 ± 0.977
2.629IleVal: 2.629 ± 0.492
0.0IleTrp: 0.0 ± 0.0
1.314IleTyr: 1.314 ± 0.417
0.0IleXaa: 0.0 ± 0.0
Lys
1.052LysAla: 1.052 ± 1.154
0.0LysCys: 0.0 ± 0.0
1.84LysAsp: 1.84 ± 0.651
1.314LysGlu: 1.314 ± 0.549
2.629LysPhe: 2.629 ± 0.5
2.892LysGly: 2.892 ± 0.744
0.263LysHis: 0.263 ± 0.358
2.103LysIle: 2.103 ± 1.201
1.577LysLys: 1.577 ± 0.161
4.732LysLeu: 4.732 ± 1.685
1.314LysMet: 1.314 ± 0.549
1.052LysAsn: 1.052 ± 0.366
1.577LysPro: 1.577 ± 0.779
1.84LysGln: 1.84 ± 0.663
1.314LysArg: 1.314 ± 0.517
3.155LysSer: 3.155 ± 0.465
3.155LysThr: 3.155 ± 0.647
2.366LysVal: 2.366 ± 0.717
1.052LysTrp: 1.052 ± 0.6
1.314LysTyr: 1.314 ± 0.125
0.0LysXaa: 0.0 ± 0.0
Leu
8.149LeuAla: 8.149 ± 1.908
1.577LeuCys: 1.577 ± 0.161
4.732LeuAsp: 4.732 ± 0.707
6.835LeuGlu: 6.835 ± 1.168
3.943LeuPhe: 3.943 ± 1.214
5.258LeuGly: 5.258 ± 1.466
4.995LeuHis: 4.995 ± 1.988
6.309LeuIle: 6.309 ± 1.99
5.783LeuLys: 5.783 ± 0.706
9.989LeuLeu: 9.989 ± 1.005
3.943LeuMet: 3.943 ± 1.343
6.046LeuAsn: 6.046 ± 1.608
7.098LeuPro: 7.098 ± 1.909
3.943LeuGln: 3.943 ± 0.482
6.046LeuArg: 6.046 ± 2.404
8.412LeuSer: 8.412 ± 3.899
7.361LeuThr: 7.361 ± 1.524
7.886LeuVal: 7.886 ± 1.025
2.892LeuTrp: 2.892 ± 0.81
2.366LeuTyr: 2.366 ± 0.572
0.0LeuXaa: 0.0 ± 0.0
Met
1.84MetAla: 1.84 ± 1.024
0.263MetCys: 0.263 ± 0.358
1.84MetAsp: 1.84 ± 0.732
1.052MetGlu: 1.052 ± 0.6
0.789MetPhe: 0.789 ± 0.247
1.84MetGly: 1.84 ± 0.732
0.526MetHis: 0.526 ± 0.3
2.366MetIle: 2.366 ± 1.019
0.526MetLys: 0.526 ± 0.3
2.629MetLeu: 2.629 ± 0.719
1.052MetMet: 1.052 ± 0.308
1.577MetAsn: 1.577 ± 0.722
0.263MetPro: 0.263 ± 0.37
0.263MetGln: 0.263 ± 0.15
1.314MetArg: 1.314 ± 0.668
3.68MetSer: 3.68 ± 0.766
2.103MetThr: 2.103 ± 0.449
1.84MetVal: 1.84 ± 0.732
0.263MetTrp: 0.263 ± 0.15
0.263MetTyr: 0.263 ± 0.37
0.0MetXaa: 0.0 ± 0.0
Asn
2.366AsnAla: 2.366 ± 1.107
0.526AsnCys: 0.526 ± 0.3
2.103AsnAsp: 2.103 ± 0.872
1.314AsnGlu: 1.314 ± 0.75
1.577AsnPhe: 1.577 ± 0.597
1.052AsnGly: 1.052 ± 0.603
2.103AsnHis: 2.103 ± 0.851
2.892AsnIle: 2.892 ± 0.246
1.052AsnLys: 1.052 ± 0.366
5.783AsnLeu: 5.783 ± 0.809
1.052AsnMet: 1.052 ± 0.474
1.314AsnAsn: 1.314 ± 0.75
3.155AsnPro: 3.155 ± 0.495
1.314AsnGln: 1.314 ± 0.472
2.103AsnArg: 2.103 ± 0.615
2.366AsnSer: 2.366 ± 0.74
2.103AsnThr: 2.103 ± 0.743
1.84AsnVal: 1.84 ± 0.645
1.052AsnTrp: 1.052 ± 0.225
1.84AsnTyr: 1.84 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
4.206ProAla: 4.206 ± 1.074
1.052ProCys: 1.052 ± 0.6
2.629ProAsp: 2.629 ± 0.925
3.68ProGlu: 3.68 ± 1.687
2.366ProPhe: 2.366 ± 0.799
3.155ProGly: 3.155 ± 0.474
2.629ProHis: 2.629 ± 0.58
2.629ProIle: 2.629 ± 0.986
2.366ProLys: 2.366 ± 1.529
6.046ProLeu: 6.046 ± 2.007
0.789ProMet: 0.789 ± 0.45
2.103ProAsn: 2.103 ± 0.426
2.892ProPro: 2.892 ± 0.927
2.103ProGln: 2.103 ± 0.683
2.103ProArg: 2.103 ± 0.634
2.629ProSer: 2.629 ± 1.046
3.417ProThr: 3.417 ± 0.801
2.629ProVal: 2.629 ± 0.57
0.789ProTrp: 0.789 ± 0.615
1.314ProTyr: 1.314 ± 0.56
0.0ProXaa: 0.0 ± 0.0
Gln
4.732GlnAla: 4.732 ± 1.851
1.052GlnCys: 1.052 ± 0.225
1.577GlnAsp: 1.577 ± 0.599
2.103GlnGlu: 2.103 ± 0.404
2.103GlnPhe: 2.103 ± 0.615
2.629GlnGly: 2.629 ± 0.719
1.314GlnHis: 1.314 ± 0.56
1.577GlnIle: 1.577 ± 0.508
2.103GlnLys: 2.103 ± 0.449
3.943GlnLeu: 3.943 ± 0.834
1.314GlnMet: 1.314 ± 0.493
2.629GlnAsn: 2.629 ± 0.89
1.84GlnPro: 1.84 ± 0.583
2.366GlnGln: 2.366 ± 0.717
2.629GlnArg: 2.629 ± 0.492
2.892GlnSer: 2.892 ± 1.355
3.155GlnThr: 3.155 ± 0.361
3.68GlnVal: 3.68 ± 1.09
1.052GlnTrp: 1.052 ± 0.366
2.103GlnTyr: 2.103 ± 0.872
0.0GlnXaa: 0.0 ± 0.0
Arg
4.469ArgAla: 4.469 ± 1.422
0.263ArgCys: 0.263 ± 0.15
4.206ArgAsp: 4.206 ± 0.362
3.155ArgGlu: 3.155 ± 0.816
1.052ArgPhe: 1.052 ± 0.6
4.469ArgGly: 4.469 ± 0.691
1.314ArgHis: 1.314 ± 0.125
5.521ArgIle: 5.521 ± 0.502
2.366ArgLys: 2.366 ± 0.604
6.309ArgLeu: 6.309 ± 1.988
1.577ArgMet: 1.577 ± 0.545
2.103ArgAsn: 2.103 ± 0.825
2.103ArgPro: 2.103 ± 0.634
1.84ArgGln: 1.84 ± 0.387
4.469ArgArg: 4.469 ± 1.669
2.892ArgSer: 2.892 ± 0.675
3.155ArgThr: 3.155 ± 0.647
2.892ArgVal: 2.892 ± 0.48
1.314ArgTrp: 1.314 ± 0.865
2.366ArgTyr: 2.366 ± 0.831
0.0ArgXaa: 0.0 ± 0.0
Ser
3.68SerAla: 3.68 ± 1.252
2.103SerCys: 2.103 ± 0.683
2.366SerAsp: 2.366 ± 1.411
3.68SerGlu: 3.68 ± 1.756
2.892SerPhe: 2.892 ± 0.927
4.469SerGly: 4.469 ± 1.263
2.629SerHis: 2.629 ± 1.71
2.366SerIle: 2.366 ± 0.987
2.629SerLys: 2.629 ± 0.492
8.412SerLeu: 8.412 ± 1.307
0.789SerMet: 0.789 ± 0.355
2.629SerAsn: 2.629 ± 0.58
6.835SerPro: 6.835 ± 1.387
4.732SerGln: 4.732 ± 1.376
2.892SerArg: 2.892 ± 0.528
6.309SerSer: 6.309 ± 1.912
3.68SerThr: 3.68 ± 0.841
4.732SerVal: 4.732 ± 2.804
1.314SerTrp: 1.314 ± 0.417
2.892SerTyr: 2.892 ± 0.84
0.0SerXaa: 0.0 ± 0.0
Thr
5.783ThrAla: 5.783 ± 2.141
0.789ThrCys: 0.789 ± 0.794
2.103ThrAsp: 2.103 ± 0.426
2.892ThrGlu: 2.892 ± 0.959
2.892ThrPhe: 2.892 ± 1.64
4.206ThrGly: 4.206 ± 1.076
3.155ThrHis: 3.155 ± 0.311
3.68ThrIle: 3.68 ± 1.164
1.052ThrLys: 1.052 ± 0.68
7.361ThrLeu: 7.361 ± 2.646
1.577ThrMet: 1.577 ± 0.392
1.84ThrAsn: 1.84 ± 1.253
3.417ThrPro: 3.417 ± 0.468
2.629ThrGln: 2.629 ± 0.58
2.629ThrArg: 2.629 ± 0.925
4.995ThrSer: 4.995 ± 1.262
1.84ThrThr: 1.84 ± 0.292
4.206ThrVal: 4.206 ± 1.527
2.103ThrTrp: 2.103 ± 0.825
2.366ThrTyr: 2.366 ± 0.572
0.0ThrXaa: 0.0 ± 0.0
Val
6.046ValAla: 6.046 ± 2.096
0.263ValCys: 0.263 ± 0.358
3.155ValAsp: 3.155 ± 0.911
4.206ValGlu: 4.206 ± 0.508
1.84ValPhe: 1.84 ± 1.144
4.732ValGly: 4.732 ± 1.207
1.577ValHis: 1.577 ± 0.722
3.155ValIle: 3.155 ± 0.986
4.206ValLys: 4.206 ± 1.076
5.783ValLeu: 5.783 ± 0.959
1.577ValMet: 1.577 ± 0.597
1.314ValAsn: 1.314 ± 0.472
2.366ValPro: 2.366 ± 0.899
1.84ValGln: 1.84 ± 0.285
3.417ValArg: 3.417 ± 0.838
7.361ValSer: 7.361 ± 1.556
2.892ValThr: 2.892 ± 1.13
2.892ValVal: 2.892 ± 0.246
0.263ValTrp: 0.263 ± 0.15
1.314ValTyr: 1.314 ± 0.549
0.0ValXaa: 0.0 ± 0.0
Trp
1.577TrpAla: 1.577 ± 0.161
0.789TrpCys: 0.789 ± 0.247
1.052TrpAsp: 1.052 ± 0.225
0.789TrpGlu: 0.789 ± 0.45
0.0TrpPhe: 0.0 ± 0.0
0.789TrpGly: 0.789 ± 0.615
0.0TrpHis: 0.0 ± 0.0
1.577TrpIle: 1.577 ± 0.545
1.052TrpLys: 1.052 ± 0.6
1.84TrpLeu: 1.84 ± 0.537
2.103TrpMet: 2.103 ± 0.872
1.314TrpAsn: 1.314 ± 0.417
0.789TrpPro: 0.789 ± 0.247
1.052TrpGln: 1.052 ± 0.6
0.526TrpArg: 0.526 ± 0.268
0.789TrpSer: 0.789 ± 0.361
1.052TrpThr: 1.052 ± 0.308
2.103TrpVal: 2.103 ± 0.743
0.263TrpTrp: 0.263 ± 0.15
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.314TyrAla: 1.314 ± 0.56
0.526TyrCys: 0.526 ± 0.3
1.052TyrAsp: 1.052 ± 0.225
1.577TyrGlu: 1.577 ± 0.779
1.314TyrPhe: 1.314 ± 0.472
0.789TyrGly: 0.789 ± 0.3
1.577TyrHis: 1.577 ± 0.459
1.577TyrIle: 1.577 ± 0.722
0.526TyrLys: 0.526 ± 0.301
2.629TyrLeu: 2.629 ± 0.89
1.314TyrMet: 1.314 ± 0.582
0.526TyrAsn: 0.526 ± 0.716
1.84TyrPro: 1.84 ± 1.051
1.84TyrGln: 1.84 ± 1.051
3.155TyrArg: 3.155 ± 1.452
2.892TyrSer: 2.892 ± 0.805
2.103TyrThr: 2.103 ± 0.372
0.526TyrVal: 0.526 ± 0.301
1.052TyrTrp: 1.052 ± 0.6
0.789TyrTyr: 0.789 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski