Amino acid dipepetide frequency for Beihai paphia shell virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.885AlaAla: 5.885 ± 1.566
1.839AlaCys: 1.839 ± 0.344
5.149AlaAsp: 5.149 ± 0.852
5.149AlaGlu: 5.149 ± 0.852
2.207AlaPhe: 2.207 ± 1.08
6.62AlaGly: 6.62 ± 0.095
1.471AlaHis: 1.471 ± 0.72
2.942AlaIle: 2.942 ± 0.227
3.678AlaLys: 3.678 ± 0.979
6.252AlaLeu: 6.252 ± 0.836
1.103AlaMet: 1.103 ± 0.54
4.046AlaAsn: 4.046 ± 2.466
3.31AlaPro: 3.31 ± 1.715
1.103AlaGln: 1.103 ± 0.016
2.942AlaArg: 2.942 ± 0.783
6.62AlaSer: 6.62 ± 0.651
6.252AlaThr: 6.252 ± 1.386
4.781AlaVal: 4.781 ± 0.117
1.839AlaTrp: 1.839 ± 0.212
3.31AlaTyr: 3.31 ± 0.508
0.0AlaXaa: 0.0 ± 0.0
Cys
1.839CysAla: 1.839 ± 0.9
0.0CysCys: 0.0 ± 0.0
1.103CysAsp: 1.103 ± 0.016
0.368CysGlu: 0.368 ± 0.18
0.368CysPhe: 0.368 ± 0.376
0.736CysGly: 0.736 ± 0.196
0.368CysHis: 0.368 ± 0.18
0.368CysIle: 0.368 ± 0.18
0.736CysLys: 0.736 ± 0.196
0.368CysLeu: 0.368 ± 0.18
0.368CysMet: 0.368 ± 0.18
0.368CysAsn: 0.368 ± 0.18
1.103CysPro: 1.103 ± 0.54
0.368CysGln: 0.368 ± 0.376
0.736CysArg: 0.736 ± 0.196
1.471CysSer: 1.471 ± 0.392
1.103CysThr: 1.103 ± 0.016
1.103CysVal: 1.103 ± 0.54
1.471CysTrp: 1.471 ± 0.164
0.736CysTyr: 0.736 ± 0.36
0.0CysXaa: 0.0 ± 0.0
Asp
2.207AspAla: 2.207 ± 0.524
1.103AspCys: 1.103 ± 0.016
4.781AspAsp: 4.781 ± 0.672
3.678AspGlu: 3.678 ± 0.132
5.149AspPhe: 5.149 ± 0.297
2.574AspGly: 2.574 ± 0.704
0.736AspHis: 0.736 ± 0.196
4.046AspIle: 4.046 ± 0.312
4.781AspLys: 4.781 ± 0.672
4.413AspLeu: 4.413 ± 0.619
1.471AspMet: 1.471 ± 0.164
2.207AspAsn: 2.207 ± 0.587
3.678AspPro: 3.678 ± 1.535
1.471AspGln: 1.471 ± 0.164
3.31AspArg: 3.31 ± 0.508
4.781AspSer: 4.781 ± 0.439
4.413AspThr: 4.413 ± 1.048
4.413AspVal: 4.413 ± 0.492
0.368AspTrp: 0.368 ± 0.18
2.207AspTyr: 2.207 ± 1.08
0.0AspXaa: 0.0 ± 0.0
Glu
4.413GluAla: 4.413 ± 0.063
0.736GluCys: 0.736 ± 0.36
6.252GluAsp: 6.252 ± 0.275
5.149GluGlu: 5.149 ± 1.408
3.678GluPhe: 3.678 ± 1.244
1.839GluGly: 1.839 ± 0.9
1.103GluHis: 1.103 ± 0.54
3.678GluIle: 3.678 ± 0.423
3.31GluLys: 3.31 ± 0.603
5.149GluLeu: 5.149 ± 0.297
0.736GluMet: 0.736 ± 0.196
2.942GluAsn: 2.942 ± 0.328
2.942GluPro: 2.942 ± 0.227
2.942GluGln: 2.942 ± 0.783
2.942GluArg: 2.942 ± 0.884
5.885GluSer: 5.885 ± 1.212
2.942GluThr: 2.942 ± 0.227
6.252GluVal: 6.252 ± 0.275
1.103GluTrp: 1.103 ± 0.572
1.471GluTyr: 1.471 ± 0.392
0.0GluXaa: 0.0 ± 0.0
Phe
2.942PheAla: 2.942 ± 0.328
0.0PheCys: 0.0 ± 0.0
2.942PheAsp: 2.942 ± 0.328
5.885PheGlu: 5.885 ± 0.656
3.31PhePhe: 3.31 ± 1.159
6.62PheGly: 6.62 ± 0.095
1.471PheHis: 1.471 ± 0.392
2.207PheIle: 2.207 ± 0.032
3.31PheLys: 3.31 ± 0.048
4.413PheLeu: 4.413 ± 2.159
0.368PheMet: 0.368 ± 0.18
1.103PheAsn: 1.103 ± 0.016
1.103PhePro: 1.103 ± 0.016
1.471PheGln: 1.471 ± 0.392
1.103PheArg: 1.103 ± 0.572
1.839PheSer: 1.839 ± 0.767
3.678PheThr: 3.678 ± 0.423
3.678PheVal: 3.678 ± 0.688
0.0PheTrp: 0.0 ± 0.0
1.471PheTyr: 1.471 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
4.413GlyAla: 4.413 ± 0.619
1.103GlyCys: 1.103 ± 0.016
2.942GlyAsp: 2.942 ± 0.328
5.885GlyGlu: 5.885 ± 0.455
3.31GlyPhe: 3.31 ± 0.603
3.31GlyGly: 3.31 ± 0.508
1.471GlyHis: 1.471 ± 0.164
1.471GlyIle: 1.471 ± 0.164
2.207GlyLys: 2.207 ± 1.08
3.31GlyLeu: 3.31 ± 0.508
1.471GlyMet: 1.471 ± 0.72
4.413GlyAsn: 4.413 ± 0.619
1.471GlyPro: 1.471 ± 0.164
0.736GlyGln: 0.736 ± 0.196
1.839GlyArg: 1.839 ± 0.212
4.413GlySer: 4.413 ± 0.492
1.839GlyThr: 1.839 ± 0.212
6.252GlyVal: 6.252 ± 0.836
0.736GlyTrp: 0.736 ± 0.196
3.31GlyTyr: 3.31 ± 1.159
0.0GlyXaa: 0.0 ± 0.0
His
1.839HisAla: 1.839 ± 0.212
1.103HisCys: 1.103 ± 0.54
0.368HisAsp: 0.368 ± 0.18
1.839HisGlu: 1.839 ± 0.344
0.368HisPhe: 0.368 ± 0.18
1.471HisGly: 1.471 ± 0.72
0.368HisHis: 0.368 ± 0.18
1.471HisIle: 1.471 ± 0.392
0.736HisLys: 0.736 ± 0.196
1.103HisLeu: 1.103 ± 0.016
1.471HisMet: 1.471 ± 0.164
0.736HisAsn: 0.736 ± 0.36
1.471HisPro: 1.471 ± 0.72
1.103HisGln: 1.103 ± 0.54
1.471HisArg: 1.471 ± 0.164
2.574HisSer: 2.574 ± 0.148
1.839HisThr: 1.839 ± 0.212
0.736HisVal: 0.736 ± 0.36
1.103HisTrp: 1.103 ± 0.016
1.103HisTyr: 1.103 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
3.678IleAla: 3.678 ± 0.423
0.736IleCys: 0.736 ± 0.36
4.046IleAsp: 4.046 ± 0.312
2.207IleGlu: 2.207 ± 0.032
2.574IlePhe: 2.574 ± 0.148
1.471IleGly: 1.471 ± 0.947
1.839IleHis: 1.839 ± 0.344
2.207IleIle: 2.207 ± 0.524
2.207IleLys: 2.207 ± 0.524
4.781IleLeu: 4.781 ± 0.439
2.207IleMet: 2.207 ± 0.032
3.31IleAsn: 3.31 ± 0.048
1.839IlePro: 1.839 ± 0.212
2.574IleGln: 2.574 ± 0.704
2.207IleArg: 2.207 ± 1.08
3.678IleSer: 3.678 ± 0.423
3.31IleThr: 3.31 ± 0.048
5.149IleVal: 5.149 ± 0.815
0.0IleTrp: 0.0 ± 0.0
0.736IleTyr: 0.736 ± 0.196
0.0IleXaa: 0.0 ± 0.0
Lys
4.781LysAla: 4.781 ± 1.784
0.368LysCys: 0.368 ± 0.18
2.942LysAsp: 2.942 ± 0.884
4.781LysGlu: 4.781 ± 0.117
2.942LysPhe: 2.942 ± 0.884
3.678LysGly: 3.678 ± 0.688
2.207LysHis: 2.207 ± 0.032
4.781LysIle: 4.781 ± 0.439
6.62LysLys: 6.62 ± 3.239
4.046LysLeu: 4.046 ± 0.312
0.736LysMet: 0.736 ± 0.752
2.574LysAsn: 2.574 ± 0.704
4.781LysPro: 4.781 ± 0.672
1.103LysGln: 1.103 ± 0.016
3.31LysArg: 3.31 ± 0.603
4.413LysSer: 4.413 ± 2.159
3.31LysThr: 3.31 ± 0.508
1.839LysVal: 1.839 ± 0.9
0.0LysTrp: 0.0 ± 0.0
2.574LysTyr: 2.574 ± 0.148
0.0LysXaa: 0.0 ± 0.0
Leu
7.356LeuAla: 7.356 ± 1.402
2.207LeuCys: 2.207 ± 0.524
8.091LeuAsp: 8.091 ± 0.487
4.781LeuGlu: 4.781 ± 0.117
1.471LeuPhe: 1.471 ± 0.72
3.678LeuGly: 3.678 ± 0.688
1.103LeuHis: 1.103 ± 0.572
4.781LeuIle: 4.781 ± 0.672
5.517LeuLys: 5.517 ± 0.477
5.149LeuLeu: 5.149 ± 0.259
2.574LeuMet: 2.574 ± 0.704
2.942LeuAsn: 2.942 ± 0.884
4.046LeuPro: 4.046 ± 0.799
1.839LeuGln: 1.839 ± 0.344
5.149LeuArg: 5.149 ± 0.815
6.988LeuSer: 6.988 ± 0.471
4.781LeuThr: 4.781 ± 0.672
5.149LeuVal: 5.149 ± 1.408
0.368LeuTrp: 0.368 ± 0.376
2.942LeuTyr: 2.942 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
1.839MetAla: 1.839 ± 0.9
0.736MetCys: 0.736 ± 0.196
1.103MetAsp: 1.103 ± 0.54
2.574MetGlu: 2.574 ± 0.704
0.736MetPhe: 0.736 ± 0.196
1.471MetGly: 1.471 ± 0.164
0.736MetHis: 0.736 ± 0.36
1.103MetIle: 1.103 ± 0.54
0.368MetLys: 0.368 ± 0.376
1.471MetLeu: 1.471 ± 0.164
0.368MetMet: 0.368 ± 0.376
1.103MetAsn: 1.103 ± 0.016
2.942MetPro: 2.942 ± 1.895
0.736MetGln: 0.736 ± 0.36
1.103MetArg: 1.103 ± 0.54
0.736MetSer: 0.736 ± 0.36
2.942MetThr: 2.942 ± 1.895
0.368MetVal: 0.368 ± 0.376
0.368MetTrp: 0.368 ± 0.376
0.736MetTyr: 0.736 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
2.942AsnAla: 2.942 ± 0.783
0.736AsnCys: 0.736 ± 0.196
2.942AsnAsp: 2.942 ± 0.884
1.839AsnGlu: 1.839 ± 0.767
2.574AsnPhe: 2.574 ± 0.148
3.31AsnGly: 3.31 ± 1.159
0.368AsnHis: 0.368 ± 0.18
2.574AsnIle: 2.574 ± 1.26
2.574AsnLys: 2.574 ± 1.26
4.781AsnLeu: 4.781 ± 1.551
0.736AsnMet: 0.736 ± 0.676
4.046AsnAsn: 4.046 ± 0.243
4.046AsnPro: 4.046 ± 0.799
0.736AsnGln: 0.736 ± 0.196
0.736AsnArg: 0.736 ± 0.752
1.839AsnSer: 1.839 ± 0.767
2.574AsnThr: 2.574 ± 0.407
3.31AsnVal: 3.31 ± 0.048
1.471AsnTrp: 1.471 ± 0.164
3.31AsnTyr: 3.31 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
1.839ProAla: 1.839 ± 1.323
0.0ProCys: 0.0 ± 0.0
2.207ProAsp: 2.207 ± 0.032
2.574ProGlu: 2.574 ± 0.148
2.574ProPhe: 2.574 ± 0.407
2.207ProGly: 2.207 ± 0.587
2.574ProHis: 2.574 ± 0.148
3.678ProIle: 3.678 ± 0.979
2.942ProLys: 2.942 ± 0.884
5.885ProLeu: 5.885 ± 0.656
2.207ProMet: 2.207 ± 0.423
2.574ProAsn: 2.574 ± 0.148
1.471ProPro: 1.471 ± 0.947
1.839ProGln: 1.839 ± 1.323
1.471ProArg: 1.471 ± 0.164
3.678ProSer: 3.678 ± 0.132
3.31ProThr: 3.31 ± 1.159
4.781ProVal: 4.781 ± 0.672
1.103ProTrp: 1.103 ± 0.572
2.574ProTyr: 2.574 ± 2.075
0.0ProXaa: 0.0 ± 0.0
Gln
2.942GlnAla: 2.942 ± 0.783
1.103GlnCys: 1.103 ± 0.016
0.736GlnAsp: 0.736 ± 0.196
0.736GlnGlu: 0.736 ± 0.752
0.736GlnPhe: 0.736 ± 0.196
1.471GlnGly: 1.471 ± 0.392
1.103GlnHis: 1.103 ± 0.54
2.207GlnIle: 2.207 ± 0.524
2.207GlnLys: 2.207 ± 0.524
3.678GlnLeu: 3.678 ± 0.423
0.736GlnMet: 0.736 ± 0.196
0.368GlnAsn: 0.368 ± 0.376
0.736GlnPro: 0.736 ± 0.36
2.207GlnGln: 2.207 ± 0.032
1.839GlnArg: 1.839 ± 0.767
2.942GlnSer: 2.942 ± 0.884
1.839GlnThr: 1.839 ± 0.344
3.31GlnVal: 3.31 ± 0.048
0.368GlnTrp: 0.368 ± 0.18
0.368GlnTyr: 0.368 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
4.781ArgAla: 4.781 ± 0.439
0.0ArgCys: 0.0 ± 0.0
0.736ArgAsp: 0.736 ± 0.36
2.207ArgGlu: 2.207 ± 0.587
2.574ArgPhe: 2.574 ± 0.407
3.678ArgGly: 3.678 ± 0.423
1.839ArgHis: 1.839 ± 0.344
2.207ArgIle: 2.207 ± 0.032
3.678ArgLys: 3.678 ± 1.244
4.781ArgLeu: 4.781 ± 0.439
0.736ArgMet: 0.736 ± 0.36
1.471ArgAsn: 1.471 ± 0.164
1.471ArgPro: 1.471 ± 0.947
1.839ArgGln: 1.839 ± 0.212
2.942ArgArg: 2.942 ± 0.884
1.839ArgSer: 1.839 ± 0.212
3.31ArgThr: 3.31 ± 0.508
2.942ArgVal: 2.942 ± 0.328
0.736ArgTrp: 0.736 ± 0.752
1.103ArgTyr: 1.103 ± 0.572
0.0ArgXaa: 0.0 ± 0.0
Ser
5.149SerAla: 5.149 ± 0.297
1.471SerCys: 1.471 ± 0.164
5.149SerAsp: 5.149 ± 0.852
6.988SerGlu: 6.988 ± 1.026
3.678SerPhe: 3.678 ± 0.688
3.31SerGly: 3.31 ± 1.064
1.471SerHis: 1.471 ± 0.72
4.046SerIle: 4.046 ± 0.799
5.517SerLys: 5.517 ± 1.032
5.885SerLeu: 5.885 ± 1.011
2.207SerMet: 2.207 ± 1.143
2.942SerAsn: 2.942 ± 0.227
3.31SerPro: 3.31 ± 0.048
2.207SerGln: 2.207 ± 0.587
2.574SerArg: 2.574 ± 0.407
6.988SerSer: 6.988 ± 2.863
2.574SerThr: 2.574 ± 0.407
4.781SerVal: 4.781 ± 0.117
0.0SerTrp: 0.0 ± 0.0
2.942SerTyr: 2.942 ± 0.783
0.0SerXaa: 0.0 ± 0.0
Thr
5.885ThrAla: 5.885 ± 1.566
0.736ThrCys: 0.736 ± 0.36
2.574ThrAsp: 2.574 ± 0.963
4.046ThrGlu: 4.046 ± 0.868
5.149ThrPhe: 5.149 ± 0.815
2.207ThrGly: 2.207 ± 0.032
0.736ThrHis: 0.736 ± 0.36
2.207ThrIle: 2.207 ± 0.587
4.046ThrLys: 4.046 ± 1.424
5.885ThrLeu: 5.885 ± 0.656
1.839ThrMet: 1.839 ± 0.767
3.678ThrAsn: 3.678 ± 0.423
3.31ThrPro: 3.31 ± 1.715
2.574ThrGln: 2.574 ± 0.704
2.207ThrArg: 2.207 ± 0.032
3.678ThrSer: 3.678 ± 0.979
7.356ThrThr: 7.356 ± 0.847
4.413ThrVal: 4.413 ± 1.73
1.103ThrTrp: 1.103 ± 0.572
1.471ThrTyr: 1.471 ± 0.72
0.0ThrXaa: 0.0 ± 0.0
Val
6.62ValAla: 6.62 ± 0.461
0.368ValCys: 0.368 ± 0.18
3.678ValAsp: 3.678 ± 0.979
3.31ValGlu: 3.31 ± 0.508
3.31ValPhe: 3.31 ± 0.508
4.046ValGly: 4.046 ± 0.243
1.839ValHis: 1.839 ± 0.344
2.942ValIle: 2.942 ± 0.884
2.207ValLys: 2.207 ± 0.524
5.517ValLeu: 5.517 ± 1.032
1.471ValMet: 1.471 ± 0.164
3.678ValAsn: 3.678 ± 0.979
5.885ValPro: 5.885 ± 1.212
4.413ValGln: 4.413 ± 0.063
5.517ValArg: 5.517 ± 1.032
4.781ValSer: 4.781 ± 0.995
4.413ValThr: 4.413 ± 0.619
2.574ValVal: 2.574 ± 0.148
1.103ValTrp: 1.103 ± 0.572
2.942ValTyr: 2.942 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
1.471TrpAla: 1.471 ± 0.947
0.0TrpCys: 0.0 ± 0.0
1.839TrpAsp: 1.839 ± 0.212
0.368TrpGlu: 0.368 ± 0.18
0.0TrpPhe: 0.0 ± 0.0
0.736TrpGly: 0.736 ± 0.196
0.0TrpHis: 0.0 ± 0.0
1.471TrpIle: 1.471 ± 0.392
1.471TrpLys: 1.471 ± 0.164
1.471TrpLeu: 1.471 ± 0.72
0.0TrpMet: 0.0 ± 0.0
0.736TrpAsn: 0.736 ± 0.196
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.103TrpArg: 1.103 ± 1.127
1.103TrpSer: 1.103 ± 0.016
0.736TrpThr: 0.736 ± 0.752
1.103TrpVal: 1.103 ± 0.572
0.0TrpTrp: 0.0 ± 0.0
0.368TrpTyr: 0.368 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.678TyrAla: 3.678 ± 0.688
0.736TyrCys: 0.736 ± 0.752
2.207TyrAsp: 2.207 ± 0.524
1.103TyrGlu: 1.103 ± 0.54
2.207TyrPhe: 2.207 ± 1.699
1.471TyrGly: 1.471 ± 0.392
1.471TyrHis: 1.471 ± 0.392
0.368TyrIle: 0.368 ± 0.376
3.678TyrLys: 3.678 ± 0.688
2.574TyrLeu: 2.574 ± 0.407
0.368TyrMet: 0.368 ± 0.18
2.574TyrAsn: 2.574 ± 0.407
2.574TyrPro: 2.574 ± 0.148
0.368TyrGln: 0.368 ± 0.18
0.368TyrArg: 0.368 ± 0.18
2.942TyrSer: 2.942 ± 0.783
2.574TyrThr: 2.574 ± 0.407
3.678TyrVal: 3.678 ± 0.423
0.736TyrTrp: 0.736 ± 0.36
1.103TyrTyr: 1.103 ± 0.572
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2720 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski